Ignore thesis, reports and other documents out of scopes #98

lfoppiano · 2020-08-18T05:08:13Z

AFAIK we should have the possibilities to harvest only articles, journals and ignore thesis and other documents that are not going to be ingested correctly or that are not supported by all the components of the pipeline.

E.g. Thesis is in average 200Mb up to Gbs for a PDF.

lfoppiano added the enhancement label Aug 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ignore thesis, reports and other documents out of scopes #98

Ignore thesis, reports and other documents out of scopes #98

lfoppiano commented Aug 18, 2020

Ignore thesis, reports and other documents out of scopes #98

Ignore thesis, reports and other documents out of scopes #98

Comments

lfoppiano commented Aug 18, 2020