KGCN-pytorch-updated

This is an updated PyTorch implementation of the KGCN model, building upon two previous works: TensorFlow KGCN and PyTorch KGCN. The former is the offical one while the latter is a PyTorch version implemented by @zzaebok. This repo modifies directly on his PyTorch KGCN.

KGCN-pytorch-updated

1.1. KGCN at a Glance

KGCN is Knowledge Graph Convolutional Networks for recommender systems, which uses the technique of graph convolutional networks (GCN) to proces knowledge graphs for the purpose of recommendation.

Figure 1: KGCN Framework

Reference:

Knowledge Graph Convolutional Networks for Recommender Systems Hongwei Wang, Miao Zhao, Xing Xie, Wenjie Li, Minyi Guo. In Proceedings of The 2019 Web Conference (WWW 2019)

ACM: https://dl.acm.org/citation.cfm?id=3313417

arXiv: https://arxiv.org/abs/1904.12575

Paper With Code: https://paperswithcode.com/paper/190412575

1.2. Running the Code

For showing result under one specific hyper parameter setting, use KGCN.ipynb or KGCN.py.

For batch experiments, use batch_experiments.ipynb.

p.s. KGCN.ipynb and KGCN.py have the same functionality, but the latter is modularized for easy debugging and reuse.

1.3. Dataset

1.3.1. `movie`

Raw rating file for movie is too large to be contained in this repo.

Downlad the rating data first

$ wget http://files.grouplens.org/datasets/movielens/ml-20m.zip
$ unzip ml-20m.zip
$ mv ml-20m/ratings.csv data/movie/

1.3.2. `music`

Nothing to do

1.3.3. `product`

This dataset is built upon the Rec-Tmall dataset. Check ./data/product/preprocessing.ipynb for more information.

1.3.4. Other Dataset

If you want to use your own dataset, you need to prepare 3 files.

Rating Data
- Each row should contain (user-item-rating)
- In this repo, it is pandas dataframe structure. (look at data_loader.py)
Knowledge Graph
- Each triple(head-relation-tail) consists of knowledge graph
- In this repo, it is dictionary type. (look at data_loader.py)
Item Index to Entity Index Mapping
- Check ./data/product/preprocessing.ipynb to see my solutions.

1.4. Structure

Core files:

data_loader.py
- data loader class for movie / music dataset
- you don't need it if you make custom dataset
aggregator.py
- aggregator class which implements 3 aggregation functions
- and 2 mixers
model.py
- KGCN model network

Figure 2: Dependency Graph

1.5. Comparison with PyTorch KGCN

1.5.1. Add a new handcrafted dataset `product`

Dataset source: Rec-Tmall
Preprocessing script: preprocessing.ipynb

1.5.2. Add a new mixer `transe`

transe mixer has a better performace on divergence speed, time complexity, divergence loss value and AUC value, final-epoch loss value and AUC value in most cases.

See _mix_neighbor_vectors_TransE() in aggregator.py

1.5.3. Add `batch_experiments.ipynb` for convenient experiment

Automatically conduct multiple experiments.

Config many parameter sets once
Run
Check results in a well-formatted print-out

1.5.4. Fix RuntimeError: indices should be either on cpu or on the same device as the indexed tensor (CPU)

Before

https://github.com/zzaebok/KGCN-pytorch/blob/3b0bb56da4b6759d204de06f1d4547e9b4abe3ce/model.py#L79-L80

After

            neighbor_entities = torch.LongTensor(self.adj_ent[entities[h].cpu()]).view((self.batch_size, -1)).to(self.device)
            neighbor_relations = torch.LongTensor(self.adj_rel[entities[h].cpu()]).view((self.batch_size, -1)).to(self.device)

1.5.5. Fix IndexError: index out of range in self

Before

https://github.com/zzaebok/KGCN-pytorch/blob/3b0bb56da4b6759d204de06f1d4547e9b4abe3ce/model.py#L34-L35

After

        self.adj_ent = torch.zeros(self.num_ent, self.n_neighbor, dtype=torch.long)
        self.adj_rel = torch.zeros(self.num_ent, self.n_neighbor, dtype=torch.long)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KGCN-pytorch-updated

1.1. KGCN at a Glance

1.2. Running the Code

1.3. Dataset

1.3.1. `movie`

1.3.2. `music`

1.3.3. `product`

1.3.4. Other Dataset

1.4. Structure

1.5. Comparison with PyTorch KGCN

1.5.1. Add a new handcrafted dataset `product`

1.5.2. Add a new mixer `transe`

1.5.3. Add `batch_experiments.ipynb` for convenient experiment

1.5.4. Fix RuntimeError: indices should be either on cpu or on the same device as the indexed tensor (CPU)

1.5.5. Fix IndexError: index out of range in self

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
assets		assets
data		data
.gitignore		.gitignore
KGCN.ipynb		KGCN.ipynb
KGCN.py		KGCN.py
README.md		README.md
aggregator.py		aggregator.py
batch_experiments.ipynb		batch_experiments.ipynb
data_loader.py		data_loader.py
model.py		model.py

Ki-Seki/KGCN-pytorch-updated

Folders and files

Latest commit

History

Repository files navigation

KGCN-pytorch-updated

1.1. KGCN at a Glance

1.2. Running the Code

1.3. Dataset

1.3.1. movie

1.3.2. music

1.3.3. product

1.3.4. Other Dataset

1.4. Structure

1.5. Comparison with PyTorch KGCN

1.5.1. Add a new handcrafted dataset product

1.5.2. Add a new mixer transe

1.5.3. Add batch_experiments.ipynb for convenient experiment

1.5.4. Fix RuntimeError: indices should be either on cpu or on the same device as the indexed tensor (CPU)

1.5.5. Fix IndexError: index out of range in self

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

1.3.1. `movie`

1.3.2. `music`

1.3.3. `product`

1.5.1. Add a new handcrafted dataset `product`

1.5.2. Add a new mixer `transe`

1.5.3. Add `batch_experiments.ipynb` for convenient experiment

Packages