Skip to content

Recostruct a graph of the relationships between theorems in a math paper in PDF format.

Notifications You must be signed in to change notification settings

gbruno16/Papers_Graph_Generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Graph generator from scientific papers

The script scrapes a given pdf trying to recostruct a graph containing the references between the pages of the paper. This can be particularly useful for long mathematical papers to understand the relationships between many results (theorems, lemmas, definitions...) and the structure of the underlying theory.

How does it work?

The code reconstructs a dictionary with the extracted info by each link. Then plots a graph of the corresponding adjancy matrix (in the next picture the node labels are the page numbers).

Reverse engineering the pdf is quite complex, in particular we can only know the origin page of the link and not from which proof it comes from. On the other hand it's possible to extract the name of the destination object. This allows us to build a table of the most referenced results. (Note that the links have to be clickable)

About

Recostruct a graph of the relationships between theorems in a math paper in PDF format.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published