Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Models: Add Nous-Hermes-2-Mistral-7B-DPO.Q4_0.gguf #2027

Merged

Conversation

ThiloteE
Copy link
Collaborator

@ThiloteE ThiloteE commented Feb 25, 2024

Adds Nous-Hermes-2-Mistral-7B-DPO.Q4_0.gguf, which is the new 7b flagship model of NousResearch, to models3.json.

Original Model location:

https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO-GGUF

Model description:

Nous Hermes 2 on Mistral 7B DPO is the new flagship 7B Hermes! This model was DPO'd from Teknium/OpenHermes-2.5-Mistral-7B and has improved across the board on all benchmarks tested - AGIEval, BigBench Reasoning, GPT4All, and TruthfulQA.

The model prior to DPO was trained on 1,000,000 instructions/chats of GPT-4 quality or better, primarily synthetic data as well as other high quality datasets, available from the repository teknium/OpenHermes-2.5.

Original Dataset Location:

https://huggingface.co/datasets/teknium/OpenHermes-2.5

Dataset description:

This is the dataset that made OpenHermes 2.5 and Nous Hermes 2 series of models.

The Open Hermes 2/2.5 and Nous Hermes 2 models have made significant advancements of SOTA LLM's over recent months, and are underpinned by this exact compilation and curation of many open source datasets and custom created synthetic datasets.

The Open Hermes 2.5 dataset is a continuation of the Open Hermes 1 dataset, at a much larger scale, much more diverse, and much higher quality compilation, reaching 1M, primarily synthetically generated instruction and chat samples.

Describe your changes

Issue ticket number and link

Checklist before requesting a review

  • I have performed a self-review of my code.
  • If it is a core feature, I have added thorough tests.
  • I have added thorough documentation for my code.
  • I have tagged PR with relevant project labels. I acknowledge that a PR without labels may be dismissed.
  • If this PR addresses a bug, I have provided both a screenshot/video of the original bug and the working solution.

Screenshots

image
image

Notes

  • I inspected the OpenHermes-2.5 dataset, which to 99.9% does not contain any system prompts, so this is a model that should work really well with various system prompts.
  • sideloaded and tested a little (nothing extensive) on Windows 10 with Nvidia 1060 3GB and AMD Ryzen 5 5600 with GPT4All 2.7.1

Adds Nous-Hermes-2-Mistral-7B-DPO.Q4_0.gguf, which is the new 7b flagship model of NousResearch.

**Original Model location:**

https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO-GGUF

**Model description:**

Nous Hermes 2 on Mistral 7B DPO is the new flagship 7B Hermes! This model was DPO'd from Teknium/OpenHermes-2.5-Mistral-7B and has improved across the board on all benchmarks tested - AGIEval, BigBench Reasoning, GPT4All, and TruthfulQA.

The model prior to DPO was trained on 1,000,000 instructions/chats of GPT-4 quality or better, primarily synthetic data as well as other high quality datasets, available from the repository teknium/OpenHermes-2.5.

**Original Dataset Location:**

https://huggingface.co/datasets/teknium/OpenHermes-2.5

**Dataset description:**

This is the dataset that made OpenHermes 2.5 and Nous Hermes 2 series of models.

The Open Hermes 2/2.5 and Nous Hermes 2 models have made significant advancements of SOTA LLM's over recent months, and are underpinned by this exact compilation and curation of many open source datasets and custom created synthetic datasets.

The Open Hermes 2.5 dataset is a continuation of the Open Hermes 1 dataset, at a much larger scale, much more diverse, and much higher quality compilation, reaching 1M, primarily synthetically generated instruction and chat samples.



Signed-off-by: ThiloteE <73715071+ThiloteE@users.noreply.github.com>
@ThiloteE ThiloteE added the models.json This requires a change to the official model list. label Feb 25, 2024
@ThiloteE ThiloteE changed the title Add-Nous-Hermes-2-Mistral-7B-DPO.Q4_0.gguf Add Nous-Hermes-2-Mistral-7B-DPO.Q4_0.gguf Feb 25, 2024
@ThiloteE ThiloteE changed the title Add Nous-Hermes-2-Mistral-7B-DPO.Q4_0.gguf Models: Add Nous-Hermes-2-Mistral-7B-DPO.Q4_0.gguf Feb 25, 2024
@Titaniumtown
Copy link

Been using this model for a bit now. Big improvement over OpenOrca.

@manyoso manyoso self-requested a review February 27, 2024 14:28
@manyoso manyoso merged commit 713afb7 into nomic-ai:main Feb 27, 2024
2 checks passed
@ThiloteE ThiloteE deleted the Add-Nous-Hermes-2-Mistral-7B-DPO.Q4_0.gguf branch June 24, 2024 08:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
models.json This requires a change to the official model list.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants