- Fixed issue with missing content from LibreOffice files. See #40
- Fixed order of entry reading from LibreOffice OOXML files. See #41
- Fixes issues with long attribute values (> 65k) in OO XML. See #37
- Propogate errors from XML failures into promise rejections. See #38
- Changed the XML parser dependency for maintenance and fixes. See #39
- Added a new method for reading textbox content. See #35
- Added separation between headers and footers. See #34
- Major refactoring of the OLE code to use promises internally
- Added support for Open Office XML-based (.docx) Word files. See #1
- Added support for reading direct from a Buffer. See #11
- Removed event-stream dependency. See #19
- Fixed an issue with not closing files properly. See #23
- Corrected handling of extracting files with files. See #31
- Corrected handling of extracting files with deleted text. See #32
- Fixed issues with extracting multiple rows of table data. See #33
This is a major release, and while there are no incompatible API changes, it seemed best to bump the version so as not to pick up updates automatically. However, all old applications should not require any code changes to use this version.
- Re-fixed the bad loop in the OLE code. See #15, #18
- A few errors previously rejected as strings, they're now errors
- Updated dependencies to safe versions. See #20
- Added a new getEndnotes method. See #16
- Fixed a bad loop in the OLE code
- Removed coffeescript and mocha, now using jest and plain ES6
- Removed partial work on .docx (for now)
- Fixed a documentation issue.
extract
returns a Promise. See #6 - Corrected table cell delimiters to be tabs. See #9
- Fixed an issue where replacements weren't being applied right.
- Added the missing
lib
folder - Added a missing dependency to
package.json
- Fixed a bug with text boundary calculations
- Added endpoints
getHeaders
,getFootnotes
,getAnnotations
- Initial release to npm