Releases · vdemichev/DiaNN

29 Jan 08:46

vdemichev

2.0

d06f8ad

DIA-NN 2.0 Latest

Latest

We are excited to announce DIA-NN 2.0, the most significant milestone in the history of DIA-NN development.

Key Breakthroughs

Proteoform Confidence mode: DIA-NN 2.0 solves the long-standing challenge of DIA proteomics, combining the proteomic depth of DIA with DDA-like identification confidence. First, it features a major improvement in achieving peptidoform confidence: almost all identifications are now peptidoform-confident on modern instruments. Second, the new mode in DIA-NN also extends peptidoform confidence to protein sequences.
Fine-Tuning of Deep Learning Models for specific data and PTMs. This makes DIA-NN applicable to a wide range of PTM-focused applications.

Main advances

Phosphoproteomics and Ubiquitinomics: major improvement in identification numbers.
Scanning PASEF methods support: DIA-NN 2.0 implements a universal algorithm that automatically takes advantage of any kind of Q1 information present in the data, including various schemes based on window overlaps, with or without ion mobility dimension.
Tunable decoy models enable, for the first time, correct handling of peptidomics data that involves peptides with shared sequence patterns.

Improved algorithms

Major changes in DIA-NN architecture (new search logic, decoy generation, neural network module and calibration module), resulting in higher identification numbers.
Fold-change reduction of RAM usage when processing multiplexed data with large libraries.
Support for non-specific digests: with the new Proteoform confidence mode and major RAM usage reductions this finally makes sense for DIA. We envision applications to immunopeptidomics and protease specificity mapping.
Improved QuantUMS quantification for multiplexed DIA.

Reporting and documentation

DIA-NN's PDF report now comprises a set of per-run QC plots, which include distributions of PSMs over RT and IM dimensions, as well as information on peak widths and MS1 mass accuracy. We will add certain extra QC plots in the future. We will also be grateful for any feedback and suggestions from the proteomic community here: DIA-NN calculates a wide variety of QC metrics internally, it can report almost anything.
We have introduced a major update to the DIA-NN documentation, making it significantly more detailed as well as including some tips and best practices when it comes to bioinformatics.

Future roadmap

DIA-NN 2.0 fulfills all the major goals announced with the DIA-NN 1.9 release. In the future, we will switch from major releases to frequent minor updates, continuously incorporating feedback and suggestions from the proteomics community. In addition:

We plan to put stronger emphasis on leveraging experiment-specific deep learning models for boosting identification performance and data completeness.
We have some key improvements of protein quantification in works.
There are some new ways of doing DIA in works.
We will migrate to new Thermo libraries and add native .raw support on Linux
We consider changing the pipeline format to .json, allowing for easy editing of pipelines with scripts

Any updated information on DIA-NN 2.0 as well as release notes for updated versions will be posted here #1366.

Get DIA-NN

The attached binaries are for academic use (please see LICENSE.txt).
We also start the distribution of DIA-NN Enterprise for Industry. To purchase or get a trial license, please contact Aptila Biotech aptila.bio.

Assets 9

21 Oct 08:03

vdemichev

1.9.2

af0e13d

DIA-NN 1.9.2

DIA-NN 1.9.2 is a major update with several key performance and functionality improvements.

Notes

DIA-NN 1.9.2 is a free Academia-only version.
Our spin-off Aptila Biotech is now preparing an enterprise version of DIA-NN for Industry.

Identification performance

Major phosphoproteomics improvement.
Redesigned neural network classifier with on average better performance.
Completely redesigned and improved mass calibration, in particular on Orbitrap and Astral instruments. The algorithm is highly effective but we know how to improve it further in future DIA-NN versions.

Quantification performance

Major improvement of protein quantification with QuantUMS.
The normalisation algorithm has been changed. It is now more reliant on the majority of the proteins being unchanged between samples but yields significantly higher precision and more proteins differentially expressed in most cases.
Improved quantification precision on timsTOF when using MBR.

Speed and memory

Improved ultra-fast mode. Combined with MBR it can now yield near-optimal performance on some phosphoproteomics datasets acquired on Orbitrap/Astral or timsTOF instruments, while providing a several-fold speedup.
Up to several-fold faster analysis of blanks/failed runs.
More than twice reduction of memory usage for the internal representation of the spectral library, this is relevant for large libraries, e.g. for phospho. Library RAM usage will be further reduced in future versions of DIA-NN.
Better control of memory consumption during search with large libraries.

FDR control

New 'Conservative' machine learning mode (experimental), which imposes the theoretical upper bound of a factor of 2 on the possible q-value deflation due to ML overfitting, if any. The mode is meant to be used with MBR.
The --nn-fold 4 option (experimental) that ensures that each neural network in an ensemble is only used for prediction on samples it has not been trained on.
Of note, these functions are normally not needed on 99% of datasets, however if the purpose is to benchmark the software and too-conservative q-values are, due to the design of the experiment, preferable to optimistic q-values, then using these options is recommended.

Usability improvements

Online Skyline installations support. An Administrative install of Skyline is still necessary, but DIA-NN will use it to find and launch the online install, if available.
Ability to run multiple Viewer instances to compare different peptides or runs side-by-side.
A fragment ion coverage plot is added to the Viewer. This is to be used for quick visual reference only, for making meaningful conclusions please rely directly on the extracted chromatograms shown rather than on this plot.
The name of an in silico predicted library to be generated is shown in the GUI with the correct extension.

Fixes

The bug on Linux which manifested as a crash when using the --matrices option has been fixed.
The bug that caused incorrect results when using on-the-fly in silico prediction from FASTA combined with raw files searching in the same DIA-NN run and with peptidoform scoring enabled.

Notes

The documentation will be updated after 1.9.3 release.
An update of the Linux binary was added on October 31, 2024, fixing an issue with memory allocation (no functional changes).
Replacement of library spectra, RT and IM values with in silico predicted ones must not be combined with raw data analysis in this version but instead needs to be carried out in a separate step.

Assets 6

15 Jul 15:22

vdemichev

1.9.1

e4720a4

DIA-NN 1.9.1 Pre-release

Pre-release

DIA-NN 1.9.1 is a minor update of 1.9.

Linux version included
Faster processing of large Slice-PASEF runs
Emprical DIA-based libraries are now saved in .parquet format instead of .tsv: less disk space, precise real numbers
Fixed a bug in the implementation of --no-cut-after-mod
Fixed a bug in processing contaminants when doing on-the-fly FASTA digest and raw data analysis
Default output location in the GUI is now C:/Temp, if exists, instead of the DIA-NN installation folder
The .protein_description.tsv output file contains the protein information used by DIA-NN
The .manifest.txt output file now provides a description of output files produced
Adjusted settings for launching Skyline

For the future DIA-NN roadmap, see https://github.com/vdemichev/DiaNN/releases/tag/1.9

Assets 5

09 Jun 18:26

vdemichev

1.9

0206df3

DIA-NN 1.9

DIA-NN 1.9 release summary

DIA-NN 1.9 is the biggest improvement of DIA-NN so far. Below is the summary of key features, please see the documentation for details.

Peptidoforms
Data-dependent acquisition (DDA) has so far maintained one key advantage over data-independent acquisition (DIA): confidence in peptidoform assignment. That is, with DDA one can be reasonably confident that a peptide is matched to the spectrum of the correct peptidoform (i.e. without amino acid substitutions or other modifications), and that the set of reported modifications (phosphorylation, etc) is correct. Now we achieve this also with DIA, while maintaining all the advantages of DIA and largely preserving its deep proteome coverage. We expect a range of applications, from pQTL analysis in population proteomics to metaproteomics. A preprint describing the new peptidoform-scoring module in DIA-NN is to follow.

Phosphoproteomics
We use the new peptidoform scoring module to significantly improve phosphoproteomics workflows. Moreover, DIA-NN now reports site-specific localisation confidence along with site-level quantities, in a convenient format, greatly simplifying its use for phosphoproteomics.

Multiplexing
DIA-NN 1.9 features a second-generation plexDIA (multiplexing) module, with a significantly enhanced ability to gain channel-specific confidence in peptide and protein identifications. Further, processing of multiplexed DIA data is greatly simplified by convenient output, including channel-specific protein group quantities obtained with QuantUMS.

timsTOF proteomics
DIA-NN 1.9 implements Slice-PASEF as well as features preliminary support for midia-PASEF and Synchro-PASEF.

Quantification
DIA-NN 1.9 features a second-generation QuantUMS module, wherein quantities are optimised with machine learning and statistically-justified accuracy estimates are available for individual quantities.

Visualisation
This has been the most often requested feature since the conception of DIA-NN. Now supported via either Skyline integration or via a dedicated DIA-NN Viewer.

General performance
Better identification numbers and stricter control of false discoveries, along with extensive options to tailor the identification and quantification confidence control to a specific experiment.

Speed and code quality
DIA-NN has been overhauled to match the modern coding practices using C++20, with a focus on efficient memory use and better multithreading. DIA-NN 1.9 features code optimisations which yield roughly 1.3x-2x speed gains for library-free search. Large predicted libraries (tens of millions of precursors) are now often 10x+ quicker to generate.

Timeline. This is a Windows release of DIA-NN 1.9, Linux support is to follow shortly. Further, we have a number of features and performance improvements under active development and will likely release a series of updates implementing these in the near future. We will also be grateful for any feedback on DIA-NN 1.9 as well as feature requests, which we will do our best to implement.

Future roadmap

DIA-NN is under active development, towards (i) enabling new technologies as well as (ii) achieving better performance for existing workflows. In the latter case, we have the following planned or under development:

While DIA-NN performs remarkably well in library-free setting already, there is a room for even better performance. Specifically, DIA-NN will in the future implement experiment-specific transfer learning, similar to the concept recently introduced in AlphaDIA.
DIA-NN already implements a low RAM usage mode, which restricts the amount of system memory it needs for its search. Currently, the biggest factor in RAM usage by DIA-NN in the lib-free mode is the storage of the predicted library in memory, especially when using multiplexing. DIA-NN will in the future implement a different format for internal library storage, with fold-change lower memory requirements.
The ultra-fast mode in DIA-NN is great for preliminary analyses (up to 5x faster), although it does sacrifice identification performance, as it implements a spectrum centric-like search strategy, which is inherently less sensitive. We have a different fast search mode in works, which will have minimum performance trade offs.
We have a number of algorithms in works, which will fully explore the potential (in terms of both identification and quantification performance) of Slice-PASEF, midia- and Synchro-PASEF, Scanning SWATH and Orbitrap Astral. While the current algorithms perform remarkably well already, showing the potential of these technologies, we work on specific improvements that will further boost the performance.
DIA-NN will in the future incorporate a module for detailed QC analysis of DIA runs.
Together with our collaborators, we are developing some exciting new workflows combining different tools.

Assets 4

57 Join discussion

15 Apr 13:36

vdemichev

1.8.1

719be81

DIA-NN 1.8.1

Multiplexing support
Improved dia-PASEF performance
'Peak height' quantification mode
Fixed handling of cases when a spectral library annotates the precursor ion as fragment or includes y1/b1 fragments
Stability issues under Linux solved

Assets 5

28 Jun 16:31

vdemichev

1.8

63b9b33

DIA-NN 1.8

A major improvement in terms of both performance and functionality. Some key changes:

More peptide & protein IDs.
Stringent control of global precursor and protein FDR, validated on thousands of samples.
Transformatively better library-free analysis mode - in many cases you will no longer need a spectral library, even for the most challenging samples.
Full support for PTM confidence scoring and site localisation. Validated workflows for phosphoproteomics and ubiquitinomics.
DIA-NN is now considerably faster and requires less memory.
Fully functional Linux builds - now with support for deep learning & native support for dia-PASEF data.

This release is also considerably better in some aspects than the beta versions we have been sharing.

Of note, we received multiple feature requests. Unfortunately, only a few could be implemented in the limited amount of time we had, as this release had to be scheduled for today due to the need to reference it in a publication.

Multiple papers describing the new DIA-NN features implemented in this version are to appear in the near future.

Assets 5

28 Jun 16:21

vdemichev

1.7.18

63b9b33

DIA-NN 1.7.18 development build Pre-release

Pre-release

Update README.md

Assets 3

28 Jun 16:20

vdemichev

1.7.17

63b9b33

DIA-NN 1.7.17 development build Pre-release

Pre-release

Update README.md

Assets 3

28 Mar 17:29

vdemichev

1.7.16

38db564

DIA-NN 1.7.16 development build Pre-release

Pre-release

Significant improvement of multiple algorithms & functionality changes since 1.7.12, please contact by email (provided in the manual) for details on the new functionality. Minor improvement of identification performance in comparison to 1.7.15. An updated manual describing all the new features will be available in the future.

Lack of manual is the primary reason why this version is marked as a 'development build' and not a 'release'.

Assets 3

08 Mar 12:21

vdemichev

1.7.15

c652420

DIA-NN 1.7.15 development build Pre-release

Pre-release

Significant improvement of multiple algorithms & functionality changes, please contact by email (provided in the manual) for details on the new functionality. An updated manual describing all the new features will be available in the future.

Assets 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Key Breakthroughs

Main advances

Improved algorithms

Reporting and documentation

Future roadmap

Get DIA-NN

DIA-NN 1.9 release summary

Future roadmap

Releases: vdemichev/DiaNN

DIA-NN 2.0

Key Breakthroughs

Main advances

Improved algorithms

Reporting and documentation

Future roadmap

Get DIA-NN

DIA-NN 1.9.2

DIA-NN 1.9.1

DIA-NN 1.9

DIA-NN 1.9 release summary

Future roadmap

DIA-NN 1.8.1

DIA-NN 1.8

DIA-NN 1.7.18 development build

DIA-NN 1.7.17 development build

DIA-NN 1.7.16 development build

DIA-NN 1.7.15 development build