Skip to content
@CRC-1597-Small-Data

CRC 1597 Small Data

Official GitHub of the Collaborative Research Center 1597 Small Data funded by the German Research Foundation (DFG)

CRC 1597 Small Data!

Welcome to the SmallData Github, where we showcase projects that have been made in our Collaborative Research Center (CRC). In SmallData, we address data analysis and modeling in small data settings, i.e., when there is only little information in a dataset at hand, due to a small number of observations that carry relevant information, relative to the complexity of novel patterns to be uncovered or the level of heterogeneity across observations.

We focus on

  • Similarity for pulling in additional data of the same type (Project Area A),
  • Transfer for transferring additional information to the dataset at hand, such as from data of different type (Project Area B),
  • Uncertainty for quantifying and reducing uncertainty in particular in similarity and transfer (Project Area C).

This is enabled by a joint methods framework, with a focus on combining knowledge-driven and data-driven modeling. For more information, please visit our website. Subscribe to our event mailing list by following the instructions here.

Public Repository Disclosure

This organization provides a comprehensive overview of all repositories established with funding from SmallData. For the most current iteration of the repository, kindly refer to the original source.

Popular repositories Loading

  1. LatentDynamics.jl LatentDynamics.jl Public

    Forked from maren-ha/LatentDynamics.jl

    Hackenberg M, Pechmann A, Kreutz C, Kirschner J, Binder H. A statistical approach to latent dynamic modeling with differential equations. 2023. arXiv:2311.16286

    Julia

  2. .github .github Public

  3. ovqa ovqa Public

    Forked from lmb-freiburg/ovqa

    Ging S, Bravo MA, Brox T. Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy. arXiv preprint arXiv:2402.07270. 2024 Feb 11.

    Python

  4. LatentSubgroups LatentSubgroups Public

    Forked from kianaf/LatentSubgroups

    Farhadyar K, Bonofiglio F, Hackenberg M, Behrens M, Zöller D, Binder H. Combining propensity score methods with variational autoencoders for generating synthetic data in presence of latent sub-grou…

    Jupyter Notebook

  5. DontWasteYourTime-early-stopping DontWasteYourTime-early-stopping Public

    Forked from automl/DontWasteYourTime-early-stopping

    Bergman E, Purucker L, Hutter F. Don't Waste Your Time: Early Stopping Cross-Validation. arXiv preprint arXiv:2405.03389. 2024 May 6.

    Python

  6. CVPR24-MedSAM-on-Laptop CVPR24-MedSAM-on-Laptop Public

    Forked from automl/CVPR24-MedSAM-on-Laptop

    Pfefferle A, Purucker L, Hutter F. DAFT: Data-Aware Fine-Tuning of Foundation Models for Efficient and Effective Medical Image Segmentation. InCVPR 2024: Segment Anything In Medical Images On Lapto…

    Python

Repositories

Showing 10 of 26 repositories
  • MetaPB2 Public Forked from automl/MetaPB2

    "Meta-learning Population-based Methods for Reinforcement Learning"

    CRC-1597-Small-Data/MetaPB2’s past year of commit activity
    Jupyter Notebook 0 Apache-2.0 1 0 0 Updated Mar 24, 2025
  • BALViT Public Forked from robot-learning-freiburg/BALViT

    "Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters"

    CRC-1597-Small-Data/BALViT’s past year of commit activity
    0 1 0 0 Updated Mar 11, 2025
  • FunRed Public Forked from SysPsyHertel/FunRed

    R package for computing functional redundancy measures - "Characterising Measures of Functional Redundancy in, Microbiome Communities via Relative Entropy"

    CRC-1597-Small-Data/FunRed’s past year of commit activity
    R 0 GPL-3.0 2 0 0 Updated Mar 6, 2025
  • BGA-Classification Public Forked from CarolaHeinzel/BGA-Classification

    Heinzel CS, Purucker L, Hutter F, Pfaffelhuber P. Advancing Biogeographical Ancestry Predictions Through Machine Learning. bioRxiv. 2025.

    CRC-1597-Small-Data/BGA-Classification’s past year of commit activity
    Python 0 1 0 0 Updated Feb 1, 2025
  • scManifoldDynamics Public Forked from laia-cg/scManifoldDynamics

    Hackenberg M, Canal Guitart L, Backofen R, Binder H. Evaluating discrepancies in dimensionality reduction for time-series single-cell RNA-sequencing data, bioRxiv. Feb 2025.

    CRC-1597-Small-Data/scManifoldDynamics’s past year of commit activity
    Jupyter Notebook 0 1 0 0 Updated Jan 27, 2025
  • tabpfn-client Public Forked from PriorLabs/tabpfn-client

    "Accurate predictions on small data with a tabular foundation model" - ⚡ Easy API access to the tabular foundation model TabPFN ⚡

    CRC-1597-Small-Data/tabpfn-client’s past year of commit activity
    Python 0 Apache-2.0 16 0 0 Updated Jan 9, 2025
  • TabPFN Public Forked from PriorLabs/TabPFN

    "Accurate predictions on small data with a tabular foundation model" - ⚡ TabPFN: Foundation Model for Tabular Data ⚡

    CRC-1597-Small-Data/TabPFN’s past year of commit activity
    Python 0 282 0 0 Updated Jan 9, 2025
  • tabpfn-extensions Public Forked from PriorLabs/tabpfn-extensions

    "Accurate predictions on small data with a tabular foundation model" - Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN!

    CRC-1597-Small-Data/tabpfn-extensions’s past year of commit activity
    Python 0 Apache-2.0 18 0 0 Updated Jan 9, 2025
  • HW-GPT-Bench Public Forked from automl/HW-GPT-Bench

    Sukthanker RS, Zela A, Staffler B, et al. HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models [Internet]. In: NeurIPS 2024 Track Datasets and Benchmarks Poster. 2024. Available from: https://openreview.net/pdf?id=urJyyMKs7E

    CRC-1597-Small-Data/HW-GPT-Bench’s past year of commit activity
    Python 0 Apache-2.0 3 0 0 Updated Dec 6, 2024
  • BoostingAutoencoder Public Forked from NiklasBrunn/BoostingAutoencoder

    Brunn N, Hackenberg M, Vogel T, Binder H. Sparse dimensionality reduction for analyzing single-cell-resolved interactions [Internet]. 2024;Available from: https://www.biorxiv.org/content/10.1101/2024.12.01.626228v1

    CRC-1597-Small-Data/BoostingAutoencoder’s past year of commit activity
    Julia 0 MIT 2 0 0 Updated Nov 29, 2024

Top languages

Loading…

Most used topics

Loading…