Skip to content

Webhose/free-news-datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Webz.io News Dataset Repository

webz.io Logo

Introduction

Welcome to the Webz.io News Dataset Repository! This repository is created by Webz.io and is dedicated to providing free datasets of publicly available news articles. We release new datasets weekly, each containing around 1,000 news articles focused on various themes, topics, or metadata characteristics like sentiment analysis, and top IPTC categories such as finance, sports, and politics.

To get ongoing free access to online news data, you can use Webz.io's free News API Lite. Here is an open-source demo demonstrating what can be done with it.

Dataset Overview

  • Weekly Releases: New dataset available every week.
  • Thematic Focus: Datasets based on specific themes, topics, or metadata.
  • Rich Metadata: Includes sentiment analysis, categories, publication dates.
  • Diverse Sources: Articles from a wide range of news websites.

Usage

The datasets are free for academic, research, and journalistic purposes:

  • Data Analysis: For statistical analyses, trend identification, and pattern recognition.
  • Machine Learning: Suitable for training NLP models, sentiment analysis, etc.
  • Journalistic Research: Helps journalists in data-driven storytelling.

Accessing the Datasets

  • Browse the repository.
  • Find a dataset that suits your needs.
  • Download the dataset with its detailed description and metadata file.
  • We created a simple React application that you can use to preview the data

Contribution

Contributions are welcome! If you have suggestions or want to contribute, please open an issue or a pull request.

Support

For questions or support, raise an issue in the repository.

License/Terms of Use

By using the Dataset Repository you agree to the following TOU.


About

Weekly free datasets from global news sites

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published