language-model-architectures

Here are 2 public repositories matching this topic...

zdxdsw / inductive_counting_with_LMs

This work provides extensive empirical results on training LMs to count. We find that while traditional RNNs trivially achieve inductive counting, Transformers have to rely on positional embeddings to count out-of-domain. Modern RNNs (e.g. rwkv, mamba) also largely underperform traditional RNNs in generalizing counting inductively.

inductive-biases inductive-counting length-generalization language-model-architectures

Updated Oct 6, 2024
Jupyter Notebook

Vivek-Tate / Language-Model

Star

Language Model project is a Java-based language and N-Gram model. It predicts up to two words based on a single word input and provides detailed text analysis statistics. Demonstrating advanced object-oriented programming and design principles, it is a valuable tool for predictive text input and linguistic analysis.

java lm language-model java-application linguistic-analysis javaui n-gram-language-models language-model-architectures

Updated Jun 23, 2024
Java

Improve this page

Add a description, image, and links to the language-model-architectures topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the language-model-architectures topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly