Skip to content

Latest commit

 

History

History
32 lines (20 loc) · 980 Bytes

File metadata and controls

32 lines (20 loc) · 980 Bytes

Natural-language-processing-web-scrapping

It extracts the frequency of terms used in the web pages, documents etc.

Natural Language Processing (NLP) has been sccessfully implemented in earch engines (Google, Yahoo), social media (Facebook, twitter). Natural language toolkit (NLTK) is the library for natural language processing (NLP).

Install nltk using

pip install nltk

Install packages

import nltk
nltk.download()

Image of objects

Outputs

Web page-Human behavior

https://en.wikipedia.org/wiki/Human_behavior

Image of objects

Web page-Wearable technology

https://en.wikipedia.org/wiki/Wearable_technology

Image of objects