This program is Search Engine using Term Frequency and Inverse Document Frequency (TF-IDF) with Cosine Similarity ranking and PageRank algorithm.
- Clone this repository to your local computer.
$ git clone https://github.com/hideUW/java_search_engine.git
-
Run Eclipse. Select the "File" > "Import" option from the menu:
-
Select the "Gradle" > "Existing Gradle Project" option.
-
Hit "Ok" and "Next" until you encounter the below dialog. Click the "Browse" button:
-
Navigate to where your project is located and select it.
-
Run Main.java
-
Then, the program starts indexing website in data folder and the user can use small search engine.
-
By editing Main.java, the user can change what data they use in search engine.