A github dataset sample of over 2000 leading Instagram Github coding influencers. Dataset was extracted using the Bright Data Collector.
- followers count
- profile type
- account type
- engagement score
- categories
- location
- external/bio links
- hashtags used
- brand affiliation
- bio
- highlights
- posts
This is a sample subset which is derived from the "All Instagram account, business & nonbusiness (public data)" dataset which includes 614,000,000 Instagram profiles.
In this example, the large dataset was filtered down into a smaller subset using smart filter queries available on the Bright Data control panel.
-
$or: [{"post_hashtags":"github"},{"bio_hashtags":"github"}]
-
followers: {"$gt":100}
Additional filter query values include: Posts count, cuntry, verified account, multiple hashtag combinations and more.
Available dataset file formats: JSON, NDJSON, JSON Lines, CSV, or Parquet..
Dataset delivery type options: API download, Amazon S3, Google cloud, Microsoft Azure, SFTP.
Data enrichment available as an addition to the data points extracted: Avg. post engagement rate, brand affiliation and more.
Get the full Instagram dataset on Bright Data's page.
- 614,000,000 "Instagram profiles"
- 21,100,000 "Instagram posts"
- 88,800 "Instagram reels"
The Bright Initiative offers access to Bright Data's Web Scraper APIs to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application here.