A bespoke NLP Chatbot trained using a corpus of Reddit data.
-
Updated
Apr 2, 2020 - Python
A bespoke NLP Chatbot trained using a corpus of Reddit data.
Program that performs textual analysis of Reddit data (approx. 300 GB) preprocessed by another team member. Uses Hadoop's Mapreduce to classify comments as either positive or negative based on certain keywords, negation, etc.
Subreddit data and information for r/seanpm2001 @ https://www.reddit.com/r/seanpm2001/
Program to scrape reddit data using reddit-api
Reddar is a retrieval system based on elasticsearch and python flask for Reddit data.
A sample dataset of over 1000 Reddit posts , extracted using the Bright Data API, ideal for sentiment analysis, consumer monitoring, trend identification, and competitor analysis.
Add a description, image, and links to the reddit-data topic page so that developers can more easily learn about it.
To associate your repository with the reddit-data topic, visit your repo's landing page and select "manage topics."