-
-
Notifications
You must be signed in to change notification settings - Fork 47
/
Copy pathcex-demo.html
36 lines (36 loc) · 2.15 KB
/
cex-demo.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
<?xml version="1.0" encoding="UTF-8"?>
<div class="templates:surround?with=templates/page.html&at=content">
<h1>Content Extraction and Binary Resource Indexing</h1>
<div class="templates:if-module-missing?uri=http://exist-db.org/xquery/contentextraction&at=java:org.exist.contentextraction.xquery.ContentExtractionModule">
<div class="alert alert-warning">
<p>The content extraction module does not appear to be available in your eXist installation.
To enable it, stop eXist, edit <code>$EXIST_HOME/extensions/build.properties</code> and set the corresponding property to true:</p>
<pre>
# Binary Content and Metadata Extraction Module
include.feature.contentextraction = true</pre>
<p>Next, call <code>build.sh/build.bat</code> from eXist's top directory to build the module. You should see in the output how the various
libraries required are downloaded and installed.</p>
</div>
</div>
<p>This page demonstrates how to query binary documents which have been indexed with Lucene after
their text content has been extracted. The app defines a trigger on the "binary" collection below the "data" collection in the
app root collection. To test the indexing, upload a pdf to the collection and its contents
will be extracted and indexed automatically.</p>
<form action="" method="GET" class="form form-inline">
<span class="input-group col-md-3 col-xs-12">
<input name="query" type="search" data-template="templates:form-control" class="form-control" placeholder="Search string"/>
<span class="input-group-btn">
<button id="f-btn-search" type="submit" class="btn btn-primary">
<span class="glyphicon glyphicon-search"/>
</button>
</span>
</span>
</form>
<div class="templates:if-parameter-set?param=query">
<div class="cex:query"/>
</div>
<div class="source-links">
<p>View source: <a href="cex-demo.html" class="templates:load-source">this page</a>,
<a href="modules/cex.xql" class="templates:load-source">cex.xql</a>.</p>
</div>
</div>