Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SparkNLP 1004 - Introducing MiniCPM #14205

Conversation

prabod
Copy link
Contributor

@prabod prabod commented Mar 13, 2024

Description

MiniCPM is a series of edge-side large language models, with the base model, MiniCPM-2B, having 2.4B non-embedding parameters. It ranks closely with Mistral-7B on comprehensive benchmarks (with better performance in Chinese, mathematics, and coding abilities), surpassing models like Llama2-13B, MPT-30B, and Falcon-40B. On the MTBench benchmark, which is closest to user experience, MiniCPM-2B also outperforms many representative open-source models such as Llama2-70B-Chat, Vicuna-33B, Mistral-7B-Instruct-v0.1, and Zephyr-7B-alpha.

Screenshots (if appropriate):

Types of changes

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)

Checklist:

  • My code follows the code style of this project.
  • My change requires a change to the documentation.
  • I have updated the documentation accordingly.
  • I have read the CONTRIBUTING page.
  • I have added tests to cover my changes.
  • All new and existing tests passed.

@prabod prabod self-assigned this Mar 13, 2024
@maziyarpanahi maziyarpanahi added new-feature Introducing a new feature new model DON'T MERGE Do not merge this PR labels Mar 20, 2024
@prabod prabod force-pushed the SPARKNLP-1004-Implement-MiniCPMForCasualLM-similar-to-Llama-2 branch from 5dc52d8 to a9155e5 Compare July 22, 2024 10:05
@prabod prabod marked this pull request as ready for review July 22, 2024 10:10
@maziyarpanahi maziyarpanahi changed the base branch from master to release/550-release-candidate September 1, 2024 17:58
@maziyarpanahi maziyarpanahi merged commit 2102e2d into release/550-release-candidate Sep 1, 2024
4 checks passed
@coveralls
Copy link

Pull Request Test Coverage Report for Build 10656184612

Warning: This coverage report may be inaccurate.

This pull request's base commit is no longer the HEAD commit of its target branch. This means it includes changes from outside the original pull request, including, potentially, unrelated coverage changes.

Details

  • 0 of 0 changed or added relevant lines in 0 files are covered.
  • No unchanged relevant lines lost coverage.
  • Overall coverage remained the same at 62.422%

Totals Coverage Status
Change from base Build 10655996011: 0.0%
Covered Lines: 8970
Relevant Lines: 14370

💛 - Coveralls

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
DON'T MERGE Do not merge this PR new model new-feature Introducing a new feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants