HiDi: Pipelines for Latent Factor Modeling

https://circleci.com/gh/VEVO/hidi/tree/master.svg?style=svg

HiDi is a library for high-dimensional latent factor modeling for collaborative filtering applications.

How Do I Use It?

This will get you started.

from hidi import inout, clean, matrix, pipeline


# CSV file with link_id and item_id columns
in_files = ['hidi/examples/data/user-item.csv']

# File to write output data to
outfile = 'latent-factors.csv'

transforms = [
    inout.ReadTransform(in_files),      # Read data from disk
    clean.DedupeTransform(),            # Dedupe it
    matrix.SparseTransform(),           # Make a sparse user*item matrix
    matrix.SimilarityTransform(),       # To item*item similarity matrix
    matrix.SVDTransform(),              # Perform SVD dimensionality reduction
    matrix.ItemsMatrixToDFTransform(),  # Make a DataFrame with an index
    inout.WriteTransform(outfile)       # Write results to csv
]

pl = pipeline.Pipeline(transforms)
pl.run()

Setup

Requirements

HiDi is tested against CPython 2.7, 3.4, 3.5, and 3.6. It may work with different version of CPython.

Installation

To install HiDi, simply run

$ pip install hidi

Run the Tests

$ pip install tox
$ tox

Name	Name	Last commit message	Last commit date
Latest commit kahnvex Update pandas astype Categorical to new API May 9, 2018 f5bd480 · May 9, 2018 History 118 Commits
docs/source	docs/source	Embedding -> latent factors	May 2, 2017
hidi	hidi	Update pandas astype Categorical to new API	May 9, 2018
tests	tests	change the builddataset transform and its test case	May 9, 2017
.gitignore	.gitignore	Initial commit	Apr 18, 2017
CHANGELOG.rst	CHANGELOG.rst	Version bump to 0.0.3	Apr 27, 2017
LICENSE	LICENSE	Initial commit	Apr 18, 2017
Makefile	Makefile	Add more tasks to makefile	Apr 20, 2017
README.rst	README.rst	Embedding -> latent factors	May 2, 2017
circle.yml	circle.yml	Add more tasks to makefile	Apr 20, 2017
requirements.testing.txt	requirements.testing.txt	Add circleci testing configuration	Apr 20, 2017
requirements.txt	requirements.txt	Change doc theme	Apr 21, 2017
setup.cfg	setup.cfg	Add setup.cfg	Apr 18, 2017
setup.py	setup.py	Embedding -> latent factors	May 2, 2017
tox.ini	tox.ini	Add rednose and mock to tox.ini	Apr 18, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HiDi: Pipelines for Latent Factor Modeling

How Do I Use It?

Setup

Requirements

Installation

Run the Tests

About

Releases

Packages

Contributors 2

Languages

License

kahnvex/hidi

Folders and files

Latest commit

History

Repository files navigation

HiDi: Pipelines for Latent Factor Modeling

How Do I Use It?

Setup

Requirements

Installation

Run the Tests

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages