Skip to content

Commit

Permalink
added readme
Browse files Browse the repository at this point in the history
  • Loading branch information
Juan Cazala committed Jun 20, 2015
1 parent 1a5c7ac commit e9d7dd0
Showing 1 changed file with 108 additions and 2 deletions.
110 changes: 108 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,2 +1,108 @@
# mnist
I just want to make an easy-to-use library of MNIST digits, specifically for training neural networks in javascript (either broswer or node.js). This is incomplete tho, I just uploaded the raw data for a thousand examples (28x28) of each digits (0-9), I ll try to finish it soon.
MNIST Digits
============

The goal of this library is to provide an easy-to-use way for training and testing MNIST digits to neural networks (either in the browser or node.js). I built this in order to work out of the box with [Synaptic](https://github.com/cazala/synaptic).


### Installation

for node.js: `npm install mnist --save`

for the browser: `bower install mnist --save`

### Usage

The most important method is `MNIST.set(trainingAmmt, testAmmt)` which takes the ammount of samples for the training and test sets, and returns an object with the two sets of samples (one for training and the other one for testing). Both sets are shuffled, and there are no samples repeated in both sets.

For example:

```
var mnist = require('mnist'); // this line is not needed in the browser
var set = mnist.set(8000, 2000);
var trainingSet = set.training;
var testSet = set.test;
```

That would create a random training set of 8000 mnist digits, and a test set with other random 2000 mnist digits, and there are not going to be any sample in the training set repeated in the test set.

Every set consist of an array of elements, which each of them will look like this:

```
{
input: [0,0,0,1,1, ... ,0,0], // a 784-length array of floats representing each pixel of the 28 x 28 image, normalized between 0 and 1
output: [0,0,0,0,0,0,1,0,0,0] // a 10-length binary array that tells which digits (from 0 to 9) is in that image
}
```


### Helpers

Every digit has a set of helper funcitions which are listed below. To access these functions you have to target the desired digit like this: `mnist[digit]`.

So, for example, to get a single '5' digit, you would call `mnist[5].get()`.

These are all the helper function:

#####.get([index])

Returns a single sample of that digit, given its index. If no index is provided, a random sample is returned.

```
mnist[0].get(100) // [0,0,0,1,1...,0,0]
```

#####.length

Returns the number of samples available for that digit

```
mnist[6].length // 1009
```

####.range(start, end)

Returns an array of samples, corresponding to the range specified between the `start` and `end` arguments

```
mnist[9].range(10, 17) // [ array with 7 samples of the digit 9 ]
```

####.set(start, end)

Returns an dataset of samples, corresponding to the range specified between the `start` and `end` arguments, every element of the array has an `input` and `output` properties, beight `input` the normalized data of each image, and output a 10-length binary array representing which digit is in that image.

```
mnist[4].set(200, 250) // [ dataset with 50 samples of the digit 4 ready for training/testing a neural network ]
```

####.raw

This property provides access to all the raw data for all the samples of that digit

```
mnist[3].raw // all the raw data for digit 3
```

### Drawing a digit (only in the browser)

This lets you draw a given digit in a canvas context: `mnist.draw(digit, context [,offsetX, offsetY])`

```
var digit = mnist[1].get();
var context = document.getElementById('myCanvas').getContext('2d');
mnist.draw(digit, context); // draws a '1' mnist digit in the canvas
```

The digit provided has to be a 784-length array of normalized values (0-1). All of the following approaches will work:

```
var digit = mnist[0].get(); // single digit
var digit = mnist[1].range(0, 100)[50]; // extracting the digit from a range
var digit = mnist[4].set(0, 100)[20].input; // extracting the digit from a dataset
var digit = mnist.set(8000, 2000).training[0].input; // extracting the digit from a training set
var digit = mnist.set(8000, 2000).test[15].input; // extracting the digit from a test set
```

0 comments on commit e9d7dd0

Please sign in to comment.