"Open-source code for TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models" accepted by the main conference of EMNLP 2023.
sympy == 1.7.1
curl https://raw.githubusercontent.com/leanprover/elan/master/elan-init.sh -sSf | sh
source $HOME/.elan/env
python3 -m pip install mathlibtools
cd auto_generate_atp/lean_gym
bash scripts/setup.sh
cd auto_generate_atp/lean_gym
lean --make src
python generate_lemma_from_rule.py
python generate_lemma_from_trigo.py
Generate more complex combinations of numbers, typically these numbers are sampled from a range that is larger than the range of the original generated data.
python generate_lemma_from_rule_generalizaiton_number.py
cd generated_data
python generate_and_emerge_data_from_rules.py
cd ..
python delete_useless_data_step1.py
python generate_train_data_step1.py