This tool is outdated and no longer used in Jibiki, please refer to the docker setup instructions instead.
This is a small tool to import the sentences, links, audio and tags from the Tatoeba project into a PostgreSQL database. This tool was written for Jibiki which is a free Japanese dictionary website integrating many data sources into one.
This tool requires the textsearch_ja extension to be installed. Without this extension, the Japanese full text search will not work.
Installing textsearch_ja
If you are on Windows, you need to install MinGW to use make
- Download textsearch_ja as ZIP and extract it
- Open any command line utility
cd
into the textsearch_ja folder- Run
make
- Run
make install
- Done!
- Download the jar from the releases tap of this page
- Download and install Java runtime 8 or above
- Open any command line utility (CMD on Windows and Terminal on Linux and Mac)
- Navigate to the directory you downloaded the tool to using
cd DIRECTORY_HERE
- Run the following command to run the tool
java -jar TatoebaPostgreSQL.jar
- Follow the prompts on screen and then wait for it to install
- Done!
SELECT sentence FROM sentences WHERE tsv @@ to_tsquery('I like dogs!');
Read more about tsvector and tsquery