Welcome to the Webz.io News Dataset Repository! This repository is created by Webz.io and is dedicated to providing free datasets of publicly available news articles. We release new datasets weekly, each containing around 1,000 news articles focused on various themes, topics, or metadata characteristics like sentiment analysis, and top IPTC categories such as finance, sports, and politics.
To get ongoing free access to online news data, you can use Webz.io's free News API Lite. Here is an open-source demo demonstrating what can be done with it.
- Weekly Releases: New dataset available every week.
- Thematic Focus: Datasets based on specific themes, topics, or metadata.
- Rich Metadata: Includes sentiment analysis, categories, publication dates.
- Diverse Sources: Articles from a wide range of news websites.
The datasets are free for academic, research, and journalistic purposes:
- Data Analysis: For statistical analyses, trend identification, and pattern recognition.
- Machine Learning: Suitable for training NLP models, sentiment analysis, etc.
- Journalistic Research: Helps journalists in data-driven storytelling.
- Browse the repository.
- Find a dataset that suits your needs.
- Download the dataset with its detailed description and metadata file.
- We created a simple React application that you can use to preview the data
Contributions are welcome! If you have suggestions or want to contribute, please open an issue or a pull request.
For questions or support, raise an issue in the repository.
By using the Dataset Repository you agree to the following TOU.