Skip to content

Commit

Permalink
Add office document binary extraction. (#346)
Browse files Browse the repository at this point in the history
- Add Word Processor DF and binary extraction
- Add Spreadsheets DF and binary extraction
- Add Presentation Program DF and binary extraction
- Add Text files DF and binary extraction
- Add tests for new DF and binary extractions
- Add test fixtures for new DF and binary extractions
- Resolves #303
- Resolves #304
- Resolves #305
- Use aut-resources repo to distribute our shaded tika-parsers 1.22
- Close TikaInputStream
- Add RDD filters on MimeTypeTika values
- Add CodeCov configuration yaml
- Includes work by @jrwiebe, see #346 for all commits before squash
  • Loading branch information
ruebot authored and ianmilligan1 committed Aug 16, 2019
1 parent 39831c2 commit c824ad8
Show file tree
Hide file tree
Showing 11 changed files with 602 additions and 50 deletions.
26 changes: 26 additions & 0 deletions .codecov.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
codecov:
notify:
require_ci_to_pass: yes

coverage:
precision: 2
round: down
range: "50...80"

status:
project: yes
patch: yes
changes: no

parsers:
gcov:
branch_detection:
conditional: yes
loop: yes
method: no
macro: no

comment:
layout: "header, diff"
behavior: default
require_changes: no
4 changes: 4 additions & 0 deletions pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -65,6 +65,10 @@
<id>maven</id>
<url>http://repo.maven.apache.org/maven2/</url>
</repository>
<repository>
<id>mvn-repo</id>
<url>https://raw.githubusercontent.com/archivesunleashed/aut-resources/master/mvn-repo</url>
</repository>
<repository>
<id>jitpack.io</id>
<url>https://jitpack.io</url>
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -38,6 +38,7 @@ object DetectMimeTypeTika {
} else {
val tis = TikaInputStream.get(content)
val mimetype = tika.detect(tis)
tis.close()
mimetype
}
}
Expand Down

This file was deleted.

Loading

0 comments on commit c824ad8

Please sign in to comment.