Skip to content

The project is an implementation of a generic parser that takes a lexicon file, a grammar file and an input file and produces a concise abstract syntax tree. The desired structure of the tree is defined by the grammar file.

License

Notifications You must be signed in to change notification settings

iensen/genparser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

68 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

To start using the project, download the zip archive: https://github.com/iensen/genparser/archive/master.zip

CONTENTS

  • docs/main/ -- documentation
  • src/astgen/ -- implementation consisting of several python modules
  • .gitignore -- the files ignored by this git repository
  • README.md -- this readme file

PREREQUISITES

  • Python version 3.4 or higher

INSTALL

Unzip the archive and go to the folder src/astgen/


EXECUTE

The command line syntax is:

python main.py path_to_lexicon_file path_to_grammar_file path_to_input_file [-s] [-b]

Lexicon file should contain declaration of lexem types, one per line, as defined in section 2.1 of https://github.com/iensen/genparser/blob/master/docs/main/astgen.pdf?raw=true

Grammar file should contain grammar rules, one per line, as defined in section 2.2 of https://github.com/iensen/genparser/blob/master/docs/main/astgen.pdf?raw=true

Input file is an ASCII file as defined in section 3.3 of https://github.com/iensen/genparser/blob/master/docs/main/astgen.pdf?raw=true

An optional argument -s tells the parser not to skip spaces (by default, all the lexems with type 'spaces' are removed from the sequence before parsing).

An optional argument -b tells the lexer to add built-in lexems 'num', 'id' and 'spaces' into the lexicon file (By default, they are not aded).

EXAMPLES

Examples can be found in src/astgen/tests folder of the distribution.

The execution traces for two of them are given below:

:~/src/astgen$ python3 main.py tests/arith_expr/lexicon tests/arith_expr/grammar tests/arith_expr/input -b
['add', ('num', '1'), ['mult', ('num', '2'), ('num', '3')]]
:~/src/astgen$ python3 main.py tests/chess/lexicon tests/chess/grammar tests/chess/input
['game',
 ['move',
  ('move_id', '1.'),
  ['pawn_move',
   ('cell', 'e4')
  ],
  ['pawn_move',
   ('cell', 'e5')
  ]
 ],
 ['move',
  ('move_id', '2.'),
  ['move',
   ['fig',
    ('figure', 'Q')
   ],
   ('cell', 'h5')
  ],
  ['move',
   ['fig',
    ('figure', 'N')
   ],
   ('cell', 'c6')
  ]
 ],
 ['move',
  ('move_id', '3.'),
  ['move',
   ['fig',
    ('figure', 'B')
   ],
   ('cell', 'c4')
  ],
  ['move',
   ['fig',
    ('figure', 'N')
   ],
   ('cell', 'f6')
  ]
 ]
]

About

The project is an implementation of a generic parser that takes a lexicon file, a grammar file and an input file and produces a concise abstract syntax tree. The desired structure of the tree is defined by the grammar file.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages