The Data Science Ontology is a knowledge base about data science that aims to:
- catalog and formalize the concepts of data science
- semantically annotate popular software packages for data science
- power new AI assistants for data scientists
To learn more about the Data Science Ontology, start here.
The Data Science Ontology is young but growing! We welcome contributions of concepts and annotations. Learn how to contribute. For improvements to the web frontend, please visit the dedicated frontend repository.
Ensure jq
, pandoc-citeproc
, and npm
are installed.
To install the JavaScript-based dependencies: npm install
To build the ontology into the build
folder: npm run build
To validate the ontology after building: npm run validate
The following steps assume using the IBM Cloud free tier, but can be adjusted to other CouchDB services.
- Create a Cloudant resource.
- On the Service Details page, choose Launch Cloudant Dashboard.
- On the Databases page, choose Create Database.
- Name your database
data-science-ontology
, choose Non-partitioned and choose Create. - On the Account page, under the Settings tab, copy the External Endpoint
(preferred) value, and assign it to the
COUCH_URL
environment variable (note: do not use a trailing slash). - Use IAM to set up an API key and assign it
to the
IAM_API_KEY
environment variable. - Run
npm run upload-couchdb
.
If you want to re-run step 7 after a new build, run npm run clean-couchdb
first. Note that this removes all non-design documents from your database.