LlamaIndex is the leading framework for building LLM-powered agents over your data.
-
Updated
Apr 26, 2025 - Python
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects.
(https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
📙 中华新华字典数据库。包括歇后语,成语,词语,汉字。
🧙 Build, run, and manage data pipelines for integrating and transforming data.
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
Superduper: End-to-end framework for building custom AI applications and agents.
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data.humdata.org among many other sites.
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A synthetic data generator for text recognition
Preswald is a framework for building and deploying interactive data apps, internal tools, and dashboards with Python. With one command, you can launch, share, and deploy locally or in the cloud, turning Python scripts into powerful shareable apps.
A Doctor for your data
Extract data from a wide range of Internet sources into a pandas DataFrame.
Compare tables within or across databases
🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!