CodeGPTPlus/deepseek-coder-1.3b-typescript

irthomasthomas · 2024-02-01T15:15:32Z

CodeGPTPlus/deepseek-coder-1.3b-typescript · Hugging Face

This is a fine-tuned model by the CodeGPT team, specifically crafted for generating expert code in TypeScript. It is fine-tuned from deepseek-ai/deepseek-coder-1.3b-base with a dataset of 0.5B tokens, making it an excellent choice for precise and efficient TypeScript code generation.

The model uses a 16K window size and an additional fill-in-the-middle task for project-level code completion.

How to Use

This model is for completion purposes only. Here are some examples of how to use the model:

Running the model on a GPU

from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("CodeGPTPlus/deepseek-coder-1.3b-typescript", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("CodeGPTPlus/deepseek-coder-1.3b-typescript", trust_remote_code=True).cuda()

input_text = """<|fim begin|>function quickSort(arr: number[]): number[] {
  if (arr.length <= 1) {
    return arr;
  }
  const pivot = arr[0];
  const left = [];
  const right = [];
<|fim hole|>
  return [...quickSort(left), pivot, ...quickSort(right)];
}<|fim end|>"""

inputs = tokenizer(input_text, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_length=256)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Running with Ollama

Model: https://ollama.ai/codegpt/deepseek-coder-1.3b-typescript
Command: ollama run codegpt/deepseek-coder-1.3b-typescript

Running with Ollama and CodeGPT Autocomplete in VSCode

Documentation: https://docs.codegpt.co/docs/tutorial-features/code_autocompletion
Select "Ollama - codegpt/deepseek-coder-1.3b-typescript" in the autocomplete model selector.

Fill In the Middle (FIM)

<|fim begin|>function quickSort(arr: number[]): number[] {
  if (arr.length <= 1) {
    return arr;
  }
  const pivot = arr[0];
  const left = [];
  const right = [];
<|fim hole|>
  return [...quickSort(left), pivot, ...quickSort(right)];
}<|fim end|>

Training Procedure

The model was trained using the following hyperparameters:

learning_rate: 2e-05
train_batch_size: 20
eval_batch_size: 20
seed: 42
gradient_accumulation_steps: 2
total_train_batch_size: 40
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-06
lr_scheduler_type: cosine
lr_scheduler_warmup_steps: 261
num_epochs: 1

For more information, visit the model page.

Suggested labels

{ "label-name": "TypeScript-Code-Generation", "description": "Model for generating TypeScript code", "repo": "CodeGPTPlus/deepseek-coder-1.3b-typescript", "confidence": 70.59 }

The text was updated successfully, but these errors were encountered:

irthomasthomas added llm Large Language Models finetuning Tools for finetuning of LLMs e.g. SFT or RLHF labels Feb 13, 2024

ShellLM mentioned this issue Apr 22, 2024

GPTScore: A Novel Evaluation Framework for Text Generation Models #811

Open

1 task

ShellLM mentioned this issue May 1, 2024

Measuring inference speed metrics for hosted and local LLM #822

Open

1 task

ShellLM mentioned this issue May 9, 2024

DeepSeek-V2: A Strong, Economical, and Efficient MoE LLM of 236B total parameters #831

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CodeGPTPlus/deepseek-coder-1.3b-typescript · Hugging Face #498

CodeGPTPlus/deepseek-coder-1.3b-typescript · Hugging Face #498

irthomasthomas commented Feb 1, 2024

CodeGPTPlus/deepseek-coder-1.3b-typescript · Hugging Face #498

CodeGPTPlus/deepseek-coder-1.3b-typescript · Hugging Face #498

Comments

irthomasthomas commented Feb 1, 2024

CodeGPTPlus/deepseek-coder-1.3b-typescript

How to Use

Running the model on a GPU

Running with Ollama

Running with Ollama and CodeGPT Autocomplete in VSCode

Fill In the Middle (FIM)

Training Procedure

Suggested labels

{ "label-name": "TypeScript-Code-Generation", "description": "Model for generating TypeScript code", "repo": "CodeGPTPlus/deepseek-coder-1.3b-typescript", "confidence": 70.59 }