LlamaIndex is the leading framework for building LLM-powered agents over your data.
-
Updated
Mar 3, 2025 - Python
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects.
(https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
📙 中华新华字典数据库。包括歇后语,成语,词语,汉字。
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
🧙 Build, run, and manage data pipelines for integrating and transforming data.
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data.humdata.org among many other sites.
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
A synthetic data generator for text recognition
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Extract data from a wide range of Internet sources into a pandas DataFrame.
Compare tables within or across databases
🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping easy again!
A Doctor for your data
PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.