Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL

This repository is the official implementation of the paper, providing a step-by-step tutorial to reproduct the main experiment for a 7b-scale base model with 4xA100 GPUs (~1.5d).

Synthetic CoT data (Sec 3.2) and preprocessed database prompt for Bird dataset (Appendix E) can be downloaded via Google Drive for reproduction.

Code is coming in a while... (not too soon)

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL

About

Releases

Packages

RUCKBReasoning/DPO_Text2SQL

Folders and files

Latest commit

History

Repository files navigation

Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages