Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixes #12414: Add cheaper model and long context model for Qwen2.5-72B-Instruct from siliconflow #12415

Merged
merged 1 commit into from
Jan 7, 2025

Conversation

senseb
Copy link
Contributor

@senseb senseb commented Jan 7, 2025

Summary

Fixes #12414

Add cheaper model and long context model for Qwen2.5-72B-Instruct from siliconflow

Screenshots

Before After
... ...

Checklist

Important

Please review the checklist below before submitting your pull request.

  • This change requires a documentation update, included: Dify Document
  • I understand that this PR may be closed in case there was no previous discussion or issues. (This doesn't apply to typos!)
  • I've added a test for each change that was introduced, and I tried as much as possible to make a single atomic change.
  • I've updated the documentation accordingly.
  • I ran dev/reformat(backend) and cd web && npx lint-staged(frontend) to appease the lint gods

@dosubot dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. ⚙️ feat:model-runtime labels Jan 7, 2025
@dosubot dosubot bot added the lgtm This PR has been approved by a maintainer label Jan 7, 2025
@crazywoola crazywoola merged commit dc650c5 into langgenius:main Jan 7, 2025
5 checks passed
@senseb senseb deleted the siliconflow-qwen2.5-72b branch January 7, 2025 05:59
Scorpion1221 added a commit to yybht155/dify that referenced this pull request Jan 11, 2025
* commit 'c236f05f4b575884b42fc45819ad71b05a22115c': (135 commits)
  chore: bump version to 0.15.0 (langgenius#12297)
  refactor: enhance API token validation with session locking and last used timestamp update (langgenius#12426)
  Fix newline characters in tables during document parsing (langgenius#12112)
  fix:deepseek tool call not working correctly (langgenius#12437)
  add fish-speech-1.5 from siliconflow (langgenius#12425)
  chore(docker/.env.example): Add TOP_K_MAX_VALUE to the `.env.example`… (langgenius#12422)
  refactor: use tiktoken for token calculation (langgenius#12416)
  fix: [PromptEditorHeightResizeWrap] Bug  langgenius#12410  (langgenius#12406)
  Fixes langgenius#12414: Add cheaper model and long context model for Qwen2.5-72B-Instruct from siliconflow (langgenius#12415)
  Support TTS and Speech2Text for Model Provider GPUStack (langgenius#12381)
  mark deprecated models in siliconflow langgenius#12399 (langgenius#12405)
  fix: http method can be upper case and lower case close langgenius#11877 (langgenius#12401)
  fix: enhance chunk list management with new invalidation keys and imp… (langgenius#12396)
  [Fix] revert sagemaker llm to support model hub (langgenius#12378)
  sandbox doesn't provide auto disable log (langgenius#12388)
  feat: support config top max value by env (langgenius#12375)
  chore: chat app textarea auto focus (langgenius#12366)
  feat: support configuration of refresh token expiration by environment variable (langgenius#12335)
  fix: resolve issue with the opening statement generated by the `AutomaticRes` component failing to sync between states. (langgenius#12349)
  Fix the issue where `TextGeneration` component does not correctly clear input data. (langgenius#12351)
  ...

# Conflicts:
#	api/core/helper/code_executor/code_executor.py
#	api/poetry.lock
alexcodelf pushed a commit to alexcodelf/dify that referenced this pull request Jan 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
⚙️ feat:model-runtime lgtm This PR has been approved by a maintainer size:L This PR changes 100-499 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add cheaper model and long context model for Qwen2.5-72B-Instruct from siliconflow
2 participants