README

Project Structure

.
├── final_data/              # Contains preprocessed data
│   ├── test_df.pkl         # Preprocessed test data
│   └── train_df.pkl        # Preprocessed train data
├── final_features/         # Feature csvs
│   ├── 512_RoBERTa_attention/
│   ├── gemini_attention/
│   ├── period_longformer_att/
│   ├── time_series/
│   └── word_attention_glove/
├── models/                 # Models Stored here (empty now)
└── *.ipynb                # Jupyter notebooks for different components

Running the Code

Prerequisites

Python 3.x
Required packages (Python, TensorFlow, PyTorch, xgboost, catboost, Transformers)

Data Preparation

The preprocessed data is stored in final_data directory in pickle format (.pkl). If you need to preprocess raw data:

Open preprocessing.ipynb
Modify the following variables to point to your raw data:
- train_path: Path to your training CSV file
- test_path: Path to your test CSV file
Run the notebook to generate the preprocessed data files

Model Training and Evaluation

All metaclassifier notebooks are self-contained and can be run directly from their respective locations. They expect the preprocessed data to be in the final_features directory.

Available metaclassifiers:

MetaClassifier_CatBoost.ipynb
MetaClassifier_LSTM.ipynb
MetaClassifier_TabNet_PCA.ipynb
MetaClassifier_TabNet.ipynb
MetaClassifier_XGBoost.ipynb

Feature Engineering

Additional feature engineering notebooks are provided:

feature_engineering.ipynb
512_RoBERTA_attention.ipynb
period_gemini_att.ipynb
period_longformer_attention.ipynb
word_glove_attention.ipynb
gemini_embed_generator.ipynb

Each notebook is self contained and can be run independently as long as the preprocessed data is available in the expected location. For generating gemini embeddings an API key is required.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
code		code
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
report.pdf		report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

Project Structure

Running the Code

Prerequisites

Data Preparation

Model Training and Evaluation

Feature Engineering

About

Releases

Packages

Languages

shashuat/EnsLM

Folders and files

Latest commit

History

Repository files navigation

README

Project Structure

Running the Code

Prerequisites

Data Preparation

Model Training and Evaluation

Feature Engineering

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages