Skip to content
View OdedMous's full-sized avatar

Block or report OdedMous

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
OdedMous/README.md

About

Hi there ! 👋

I like to explore the past to shape a smarter future.

Selected Projects

In this project, I developed an NLP classifier for predicting medical domains in texts using a Siamese Neural Network.

The approach involves training a simple classifier with a smart representation for each text. This representation is a vector that captures the distance to text prototypes. The distance measure is learned using a dual network, consisting of a PubMedBERT model.

NLP models require large amounts of annotated data, which are often difficult to obtain, especially in specialized fields like medicine. 💊

To address this challenge, we propose a multi-head BERT-based architecture, trained simultaneously on a 'main task' (which has limited data) and 'supporting tasks' (which have abundant data), enabling cross-task knowledge transfer.

This approach is inspired by how humans generalize knowledge across different domains. For example, learning to play the piano enhances hand-eye coordination, which can then improve performance in other activities, such as basketball.

Large Language Models are pre-trained on vast public data collected from the Internet, which likely contains private or sensitive information. Combined with the fact that large models tend to memorize training data, this scenario poses a potential risk of data leakage. 🔓

In this project, I reimplemented a decoding method that ensures privacy with high probability, based on the paper "Differentially Private Decoding in Large Language Models" (Majmudar et al., 2022), and experimented on GPT-2 and ViLT (Vision-and-Language Transformer) models.

Sepsis is a life-threatening medical condition that can lead to organ failure and death without timely treatment in the Intensive Care Unit (ICU). Managing septic patients involves the administration of several medications, but there is no universal treatment policy due to patient variability and the complexity of the disease.

In this project, I developed a Reinforcement Learning (RL)-based agent that adjusts medication doses based on real-time clinical information from patients.

                       

Blogs

Fast inference time is crucial for deep learning models that are deployed on resource-constrained devices, for providing real-time responses to user requests, and for cost and sustainability reasons. 🌿

In this blog I review the “early exiting” method, which improves inference time by allowing samples to exit at different depths within the network, potentially making many “easier” samples to exit early and thus avoiding unnecessary computations while still maintaining accuracy.

Pinned Loading

  1. Cross-Task-Learning-for-Low-Resource-NLP Cross-Task-Learning-for-Low-Resource-NLP Public

    Forked from NivAm12/Enhancing-By-Subtasks-Components

    In this project, we propose to deal with the data scarcity problem in a specific NLP task by harnessing existing annotated datasets from related tasks. Our approach involves training a multi-head a…

    Python 1

  2. Medical-Text-Classification Medical-Text-Classification Public

    Developed an NLP classifier for detecting medical domains in texts using a Siamese Neural Network

    Jupyter Notebook 8 2

  3. DP-Decoding-in-LLM DP-Decoding-in-LLM Public

    Experiment a differentially private decoding strategy for LLMs.

    HTML 2 1

  4. Sepsis-RL Sepsis-RL Public

    My implementation of a Reinforcement Learning framework for Sepsis management.

    Python

  5. Medical-Image-Registration Medical-Image-Registration Public

    Implement geometry-based and intensity-based registration algorithms between pairs of retinal 2D scans. Libraries: numpy, opencv, skimage

    Python

  6. ElectionsDataAnalysis ElectionsDataAnalysis Public

    Analyse the Israeli elections of 2019a, 2019b, 2020 and 2021.

    Jupyter Notebook