This repository has been archived by the owner on Jun 24, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 359
Structural Overhaul #162
Merged
Merged
Structural Overhaul #162
Changes from 33 commits
Commits
Show all changes
40 commits
Select commit
Hold shift + click to select a range
d84aa7f
Create a Model trait
danforbes e0713a1
Bloom model
danforbes 6bfda75
cargo fmt
danforbes 73f59c3
Rename llama-rs to llm-base
danforbes e670c25
Clippy
danforbes c4b4176
Remove redundant associated Model type from Model trait
danforbes 1cf305f
Remove associated Layer type from Model trait
danforbes 0d4dde9
cargo fmt
danforbes 849c28d
Docs
danforbes 54ad890
Tests and examples
danforbes 4ba7c1c
Layers are private
danforbes dcf85ff
Merge branch 'main' of github.com:rustformers/llama-rs into dfo/model…
philpax 43ecac1
Merge branch 'main' into dfo/model/bloom
philpax 440bd69
Fix build
philpax 5658484
refactor: introduce llm(-cli)
philpax bcf5627
Fix model name in LLaMA inference example
danforbes 5ac4b79
feat: wire up both bloom/llama to CLI
philpax 1601240
Merge branch 'dfo/model/bloom' of github.com:danforbes/llama-rs into …
philpax 1761512
Add example for testing BLOOM inference
danforbes 8d2d9c6
cargo fmt
danforbes 813bdd1
Add launch.json for debugging loading and inference
danforbes c608b4b
Merge branch 'main' into dfo/model/bloom
danforbes e19418c
Check tensor dimensions when loading
danforbes e35f93b
`Model` -> `KnownModel`, `ErasedModel -> Model`
danforbes 288df7f
Merge branch 'main' into dfo/model/bloom
danforbes 0aea8f7
Refactor ggml stuff into a single crate
danforbes 8594ac8
Use latest upstream ggml with alibi
danforbes a542c98
Improve examples
danforbes 16fca15
Latest upstream ggml
danforbes 974d2f7
Cleanup README
danforbes 1abaa41
Rebase fix
danforbes f994fa8
GPT2/Cerebras loading and inference
danforbes ff99a80
Rebase & remove BLOOM
danforbes 454f3a9
GitHub Action should support Git submodules
danforbes e69d487
Fix binary file name in README
danforbes 608090b
ggml-rs -> ggml
danforbes 78db42c
Add back BLOOM
danforbes 1eb2e11
feat: re-enable BLOOM for now
philpax 181d823
refactor: reintroduce ggml-sys and bindgen tool
philpax 9314c68
fix: check out submodules for clippy CI
philpax File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
[submodule "ggml-rs/ggml"] | ||
path = ggml-rs/ggml | ||
url = git@github.com:ggerganov/ggml.git |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,44 @@ | ||
{ | ||
// Use IntelliSense to learn about possible attributes. | ||
// Hover to view descriptions of existing attributes. | ||
// For more information, visit: https://go.microsoft.com/fwlink/?linkid=830387 | ||
"version": "0.2.0", | ||
"configurations": [ | ||
{ | ||
"type": "lldb", | ||
"request": "launch", | ||
"name": "Debug example 'gpt2_inference'", | ||
"cargo": { | ||
"args": [ | ||
"build", | ||
"--example=gpt2_inference", | ||
"--package=gpt2" | ||
], | ||
"filter": { | ||
"name": "gpt2_inference", | ||
"kind": "example" | ||
} | ||
}, | ||
"args": ["${env:HOME}/.ggml-models/cerebras-gpt-13b.bin"], | ||
"cwd": "${workspaceFolder}" | ||
}, | ||
{ | ||
"type": "lldb", | ||
"request": "launch", | ||
"name": "Debug example 'llama_inference'", | ||
"cargo": { | ||
"args": [ | ||
"build", | ||
"--example=llama_inference", | ||
"--package=llama" | ||
], | ||
"filter": { | ||
"name": "llama_inference", | ||
"kind": "example" | ||
} | ||
}, | ||
"args": ["${env:HOME}/.ggml-models/gpt4all-7b.bin"], | ||
"cwd": "${workspaceFolder}" | ||
} | ||
] | ||
} |
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,16 +1,20 @@ | ||
[workspace] | ||
members = [ | ||
"ggml-sys", | ||
"ggml", | ||
"ggml-format", | ||
"llama-rs", | ||
"llama-cli", | ||
"generate-ggml-bindings" | ||
# Crates | ||
"ggml-rs", | ||
"llm-base", | ||
"gpt2", | ||
"llama", | ||
"llm", | ||
"llm-cli", | ||
] | ||
resolver = "2" | ||
|
||
[workspace.package] | ||
version = "0.1.0" | ||
|
||
[workspace.dependencies] | ||
bytemuck = "1.13.1" | ||
log = "0.4" | ||
rand = "0.8.5" | ||
serde = { version = "1.0", features = ["derive"] } |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm thinking we'll remove this wording as we grow to accommodate more LLMs. I'll revise the wording on this after this PR lands, so nothing for you to do here - just mentioning it.