End-to-End Named Entity Recognition (NER) with BERT Transformed learning on GCP (Docker & CircleCI)

This repository implements a complete Named Entity Recognition (NER) pipeline using a pre-trained Hugging Face Transformers model (BERT). It enables to identify and classify named entities (e.g. people, organizations, locations) within text data. The pipeline leverages the power of Google Cloud Platform (GCP) for deployment and scalability, containerized with Docker for portability, and streamlined with CircleCI for continuous integration and continuous delivery (CI/CD).

Features

Leverages pre-trained BERT model from Hugging Face Transformers for efficient and accurate NER.
Provides a user-friendly interface to process text data and extract named entities.
Scales seamlessly on GCP for handling large text datasets.
Encapsulated in Docker containers for easy deployment across various environments.
Automated CI/CD pipeline through CircleCI for streamlined development and deployment.

Flow Diaglram of MLops pipeline:

Workflows

constants
config_entity
artifact_entity
components
pipeline
app.py

Git commands

git add .

git commit -m "Updated"

git push origin main

GCP Configuration

#Gcloud cli download link: https://cloud.google.com/sdk/docs/install#windows

gcloud init

How to run?

conda create -n nerproj python=3.8 -y

conda activate nerproj

pip install -r requirements.txt

python app.py

GCP CICD Deployment with CircleCI:

artifact registry --> create a repository
change line 42,50,72,76,54 in circleci config
Opne circleci --> create a project

Set Environment variables in CircleCI

GCLOUD_SERVICE_KEY --> service account

GOOGLE_COMPUTE_ZONE = asia-south1

GOOGLE_PROJECT_ID

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.circleci		.circleci
flowchart		flowchart
ner		ner
notebooks		notebooks
scrips		scrips
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
setup.py		setup.py
template.py		template.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

End-to-End Named Entity Recognition (NER) with BERT Transformed learning on GCP (Docker & CircleCI)

Features

Flow Diaglram of MLops pipeline:

Workflows

Git commands

GCP Configuration

How to run?

GCP CICD Deployment with CircleCI:

Set Environment variables in CircleCI

Create a VM instances & setup scripts

About

Releases

Packages

Languages

License

data-pioneer/MLops-Name-Entity-Recognition-End-to-End-main

Folders and files

Latest commit

History

Repository files navigation

End-to-End Named Entity Recognition (NER) with BERT Transformed learning on GCP (Docker & CircleCI)

Features

Flow Diaglram of MLops pipeline:

Workflows

Git commands

GCP Configuration

How to run?

GCP CICD Deployment with CircleCI:

Set Environment variables in CircleCI

Create a VM instances & setup scripts

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages