Skip to content

Commit

Permalink
adding youtube video
Browse files Browse the repository at this point in the history
  • Loading branch information
elliewix authored Oct 29, 2018
1 parent 6eb4429 commit d83cda6
Showing 1 changed file with 11 additions and 1 deletion.
12 changes: 11 additions & 1 deletion readme.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,15 +4,25 @@ This is a tool created as a project for the Spring 2016 GSLIS Data Cleaning cour

The products of this tool will sit somewhere between auto-documentaiton and data profiling.

## PyData presentation

This tool was presented at PyData Chicago 2016. Talk recording: https://www.youtube.com/watch?v=Hb7nvHbwNAw&t=4s

## Basic purpose

Point the tool at a folder of files and it will create a markdown file with basic statistics about each column along with template areas for you to write a narrative about each column. You can then render that into HTML or simply include it in your data package as documentation.

## Python 3 is in progress

`data_profilepy3.py` is the version updated for python 3 and contains the most up to date code. Has the same use.

Consider the python 2 version deprecated.

## Core caveats

Still mostly a proof of concept.

Path issues for windows.
Path issues for windows. This was hopefully fixed.

Unknown bugs.

Expand Down

0 comments on commit d83cda6

Please sign in to comment.