You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Este repositório contém scripts da minha monografia, em que analisei dados do MBL (Movimento Brasil Livre) no Youtube. Incluem um arquivo com tags de vídeos do canal, usadas para criar wordclouds e um arquivo de nós para análise no Gephi. Um segundo script realiza modelagem de tópicos com base nos comentários de vídeos selecionados.
This project tests if machine learning provides a sufficient accuracy level for predicting topic classification on unseen text. LDA and Naive Bayes algorithms used. Data cleaned and uploaded to AWS S2 storage and imported to google colab using PySpark for analysis.
Create a solution that will help in identifying the type of complaint ticket raised by the customers of a multinational bank using NLP and Topic Modelling (NMF)
Develop a model which will make it possible to identify specific subject matter being discussed/present on web pages by using a combination of web crawling and natural language processing.
Welcome to my repository for the British Airways Data Science Virtual Internship ✈️! Here, I applied data science techniques like web scraping 🌐, sentiment analysis 😃😡, and predictive modeling 🤖 to real airline data. Join me in exploring how data insights can enhance customer experiences! 🚀