Skip to content

Search Engine using Term Frequency and Inverse Document Frequency (TF-IDF) with Cosine Similarity ranking and PageRank algorithm

Notifications You must be signed in to change notification settings

hide27k/java-searchengine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

java_search_engine

This program is Search Engine using Term Frequency and Inverse Document Frequency (TF-IDF) with Cosine Similarity ranking and PageRank algorithm.

Getting Started

  1. Clone this repository to your local computer.
$ git clone https://github.com/hideUW/java_search_engine.git
  1. Run Eclipse. Select the "File" > "Import" option from the menu:

  2. Select the "Gradle" > "Existing Gradle Project" option.

  3. Hit "Ok" and "Next" until you encounter the below dialog. Click the "Browse" button:

  4. Navigate to where your project is located and select it.

  5. Run Main.java

  6. Then, the program starts indexing website in data folder and the user can use small search engine.

  7. By editing Main.java, the user can change what data they use in search engine.

About

Search Engine using Term Frequency and Inverse Document Frequency (TF-IDF) with Cosine Similarity ranking and PageRank algorithm

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages