For the further characterization of a given molecule in a targeted lipidomics workflow for the validation and quantification of lipids, specific precursor m/z values and select potential fragment m/z values (transitions in an inclusion list) are tracked using robust and comparably inexpensive triple-quad MS instruments in selective reaction monitoring (SRM) mode, which allows for the identification and quantitation of lipids on the class level. Orbitrap-type or time-of-flight (TOF) MS instruments with a higher mass resolution and the ability to perform a full-scan acquisition in parallel reaction monitoring (PRM) mode for selected precursors, measuring all fragment ions simultaneously, can be used for targeted lipidomics to achieve a deeper MS fragment coverage, allowing for species or subspecies identification.
In untargeted lipidomics workflows for discovery applications, no previous inclusion list is provided, thus requiring MS instruments that can operate in a data-dependent acquisition (DDA) or data-independent acquisition (DIA) mode to obtain a full-scan precursor and fragment mass spectra of either top-k m/z signals with the highest intensity or all ions contained in predefined m/z windows. Such experiments are often performed on instruments with high mass resolutions to further reduce ambiguities caused by isobaric lipids.
2. Software for Lipid Identification from Mass Spectrometry
The recent development of lipid identification tools has aimed to propel the rapidly emerging field of lipidomics by improving the quality and performance of applied algorithms, while integrating novel separation techniques and high-resolution mass spectrometers.
WResearche
rs reviewed a total of 31 openly available software tools for lipidomics data processing and identification that were published between 2006 and the end of 2021.
WeResearchers evaluate the usability of common data formats and, specifically, of PSI standard data formats as either input or output formats and their support for at least one of the lipidomics workflows (see
Table 1 for reference).
Table 1. Overview of software for lipid identification from mass spectrometry. Abbreviations: U: Untargeted, T: Targeted, C: Chromatography, CE: Capillary Electrophoresis, IM: Ion Mobility, DI: Direct Infusion (Shotgun), I: Imaging. $: targeted includes Selected Reaction and Multiple Reaction Monitoring (MRM), untargeted includes DDA and DIA approaches. *: Only the most important ones relevant to this re
vise
warch. All tools use some form of configuration file format, e.g., text-based (TXT) or other formats for libraries or fragmentation rules. Workflow assignment designates the primary workflow a tool was designed for and this was stated by the authors; others may be available.
WeResearchers use direct infusion as a more generic synonym for what is usually referred to as ‘shotgun lipidomics’. Comma-separated values (CSV) is a tabular, spreadsheet-like format. If tab characters are used as separators, the format is TSV. Hypertext markup language (HTML) is a format viewable with an internet browser. XLSX: MS office XML-based spreadsheet format. MSP: NIST mass spectral library format. MGF: Mascot Generic Format. BLIB: Binary mass spectral library format. PDF: Portable Document Format. #: rule-based validation often includes spectral scores, ratios and thresholds, scores denote spectral similarity functions, such as the commonly used dot product/cosine variants. Remarks: (1) The software is no longer available. (2) Lipid class separation chromatography, e.g., HILIC or supercritical fluid chromatography. (3) XCMS input recommended, LIPID MAPS class assignment of suspect ions. (3) Software is provided as a web application without further information. (4) Supports phospholipids only. (5) XCMS input recommended, LIPID MAPS class assignment of suspect ions. (6) After release 3.0, LipidMatch is available as LipidMatch Flow (latest version 3.5, but without source code). (7) Supports oxidized phospholipids only. (8) Identification and quantification use other tools’ methods. (9) The source code is provided for download, but no code license is defined.
Workflow $ |
Name |
Handling |
MS * |
Identification # |
Quant |
Input |
Output |
Last Release |
Open-Source |
License |
Programming Language |
T |
LIMSA |
C, DI |
MS | 1 | , MS | 2 |
Compound/Fragment library |
yes |
XLSX, CSV, HTML |
NA |
2006 |
NA (1) |
GPL v3 |
C++, VBA, Excel |
T |
LipidomeDB |
DI, C |
MS | 1 | , MS | 2 |
m | / | z | Library + Transitions + rule-based |
yes |
XLSX |
XLSX, HTML |
2019 |
no |
NA |
Java |
T |
LipidQuant |
C (2) |
MS | 1 |
m | / | z | library + rule-based |
yes |
TXT |
XLSX |
2021 |
yes |
CC-BY 4 |
VBA, Excel |
U |
ALEX and ALEX 123 |
DI |
MS | 1 | , MS | 2 | , MS | 3 |
Manual |
no |
manual input of parameters |
HTML |
2017 |
no |
NA |
NA (3) |
U |
Greazy (4) |
C, DI |
MS | 1 | , MS | 2 |
Fragment/Spectral Library + score |
no |
vendor, mzML |
mzTab (via LipidLama) |
2022 |
yes |
Apache v2 |
C# |
U |
LDA2 |
C |
MS | 1 | , MS | 2 |
Rule-based |
yes |
mzML, TXT |
XLSX, mzTab-M |
2021 |
yes |
GPL v3 |
Java |
U |
LipidBlast |
C |
MS | 1 | , MS | 2 |
Spectral Library + score |
no |
MSP, MGF, XLSX |
MGF, XLSX |
2014 |
yes |
CC-BY |
EXCEL |
U |
LipiDex |
C |
MS | 1 | , MS | 2 |
Spectral Library + rule-based |
yes |
MGF, mzXML, CSV |
CSV |
2018 |
yes |
MIT |
Java |
U |
LipidFinder |
C |
MS | 1 |
Rule-based, LMSD |
no |
CSV, JSON (5) |
PDF |
2021 |
yes |
MIT |
Python |
U |
LipidHunter (4) |
C, DI |
MS | 1 | , MS | 2 |
Rule-based |
yes |
mzML, XLSX, TXT |
XLSX, HTML, TXT |
2020 |
yes |
GPL v2, Proprietary |
Python |
U |
LipidIMMS |
C, IM |
MS | 1 | + CCS, MS | 2 |
CCS Library + Spectral Library + score |
no |
MSP, MGF |
CSV, HTML |
2020 |
no |
NA |
NA (3) |
U |
LipidMatch (6) |
C, I, DI |
MS | 1 | , MS | 2 | , MSE/DIA |
Compound/Fragment library + rule-based |
yes |
CSV, MS2 (ProteoWizard) |
CSV |
2020 |
yes |
CC BY 4.0 |
R |
U |
LipidMiner |
C |
MS | 1 | , MS | 2 |
Compound/Fragment library + rule-based |
yes |
raw |
XLSX, CSV |
2014 |
no |
NA |
C#, Python |
U |
LipidMS |
C |
MS | 1 | , MS | 2 | , MSE/DIA |
Compound/Fragment library + rule-based |
yes |
mzXML, CSV |
CSV |
2022 |
yes |
GPL v3 |
R |
U |
Lipid-Pro |
C |
MSE/DIA |
Compound/Fragment library |
yes |
CSV |
XLSX, TXT |
2015 |
no |
Proprietary |
C# |
U |
LipidXplorer |
DI |
MS | 1 | , MS | 2 | , MS | 3 |
Rule based |
no |
mzML (MS | 1 | + MS | 2 | ) |
CSV, HTML |
2019 |
yes |
GPL v2 |
Python |
U |
LiPydomics |
C, IM |
MS | 1 |
CCS Library + | m | / | z | Library + HILIC RT Library + rule-based |
yes |
CSV |
XLSX |
2021 |
yes |
MIT |
Python |
U |
LIQUID |
C |
MS | 1 | , MS | 2 |
Spectral Library + rule-based |
yes |
RAW, mzML |
TSV, mzTab, MSP |
2021 |
yes |
Apache v2 |
C# |
U |
LOBSTAHS |
C |
MS | 1 |
Spectral Library + rule-based |
yes |
mzML, mzXML, mzData, CSV |
XLSX, CSV |
2021 |
yes |
GPL v3 |
R |
U |
LPPTiger (7) |
C |
MS | 1 | , MS | 2 |
Spectral Library + score |
yes |
mzML, XLSX, TXT |
XLSX, HTML |
2021 |
yes |
GPL v2, Proprietary |
Python |
U |
MassPix |
I |
MS | 1 |
m | / | z | Library + rule-based |
no |
imzML |
CSV |
2017 |
yes |
NA |
R |
U |
MS-DIAL 4 |
C, CE, IM |
MS | 1 | , MS | 2 | , MSE/DIA |
Spectral Library + rule-based |
yes |
vendor, mzML |
CSV, mzTab-M, XLSX |
2022 |
yes |
GPL v3 |
C# |
U |
MZmine 2 |
C |
MS | 1 | , MS | 2 |
Spectral Library + rule-based |
yes |
vendor, mzML, mzXML, mzData, CSV, mzTab, XML |
CSV, mzTab, XML |
2019 |
yes |
GPL v2 |
Java |
U |
XCMS |
C |
MS | 1 | , MS | 2 |
Spectral Library + score |
yes |
mzML, mzXML, netCDF |
CSV |
2021 |
yes |
GPL v2 |
R, C |
T + U |
LipidCreator and Skyline |
C |
MS | 1 | , MS | 2 | , MSE/DIA |
Fragment/Spectral Library + score (8) |
yes (8) |
vendor, mzML (MS | 1 | + MS | 2 | ) |
XLSX, CSV, BLIB |
2021 |
yes |
MIT |
C# |
T + U |
LipidPioneer |
C |
MS | 1 | , MS | 2 |
Compound/ | m | / | z | Library (8) |
yes (8) |
XLSX |
XLSX |
2017 |
yes (9) |
NA |
VBA, Excel |
T + U |
LipidQA |
DI |
MS | 1 | , MS | 2 |
Spectral Library + score |
yes |
vendor (Thermo, Waters) |
CSV |
2007 |
NA (1) |
NA |
Visual C++ |
T + U |
LipoStar |
C, IM |
MS | 1 | , MS | 2 | , MSE/DIA |
Compound/Fragment library + rule-based validation |
yes |
vendor |
CSV |
2022 |
no |
Proprietary |
C# |
T + U |
LipoStarMSI |
DI, I |
MS | 1 | , MS | 2 |
Spectral Library + rule based |
yes |
vendor (Bruker, Waters), imzML |
CSV |
2020 |
no |
Proprietary |
C# |
T + U |
SmartPeak |
C |
MS | 1 | , MS | 2 |
Transitions + rule-based |
yes |
mzML, CSV |
mzTab, XML, CSV |
2022 |
yes |
MIT |
C++, Python |
T + U |
Smfinder |
C |
MS | 1 | , MS | 2 |
Spectral Library + score |
yes |
mzML, mzXML |
XLSX, TXT |
2020 |
yes (9) |
NA |
Python, R, C++ |
WResearche
rs categorized the tools by supported workflow (targeted, untargeted or both), sample handling (separation, e.g., chromatography, ion mobility, direct infusion, imaging), MS level, summarizing targeted, selected ions under MS
1, MS
2 for shotgun and DDA approaches and MSE/DIA for data-independent approaches, based on their own claims in their primary publications or documentation.
Concerning lipid identification,
wresearche
rs broadly distinguish between tools that use either a rule-based or a library-based identification approach.
Rule-based tools must describe at least precursor ion
m/z, MS
2 fragments and (relative) fragment intensity ranges for lipid class, species or subspecies identification. In order to reduce the chance for false-positive identifications, these approaches often also apply further validation rules, such as fragment signal intensity ratios that must fall within certain bounds. However, these rule-based approaches can be customized to also allow for identification on a more precise lipid structure level if the necessary data is available. In principle, these approaches are very flexible and allow for the query of spectra for certain patterns that are indicative of specific lipid species. This makes them applicable to targeted, as well as untargeted, analysis.
Library-based approaches use either in-silico generated MS
2 spectra for lipids derived from their structural representation or experimentally acquired and post-processed spectra. To assign a putative identity to measured lipid mass spectra, a variant of the dot product score or other related vector scores is often used
[41,42][3][4].
WResearche
rs further indicate whether tools support quantitative output, such as intensities, areas, relative or absolute quantities or if they only support qualitative lipid identification output. For these tools to be included in larger processing workflows, the supported data formats for input and output are crucial. In the mass spectrometry and lipidomics field specifically,
weresearchers can distinguish between text (human readable) and binary file formats. The latter are often the raw data vendor formats, but can also include local database files, such as the blib format for mass spectral libraries or the common sqlite database format. Within text-based formats,
weresearchers can distinguish structured ones that follow a specific schema for MS data, such as the Mascot Generic Format (MGF), NIST Mass Spectrum format (MSP), MS2
[43][5] or mzTab(-M) and semi-structured ones, such as CSV, JSON or XLSX, where the latter is a compressed XML format. XML-based formats are well-adapted to be machine readable and validatable and are used in the PSI format mzML, as well as its predecessors, mzXML
[44][6] and mzData
[45][7]. TXT formats are generally only weakly structured but remain human-readable.
Maintenance, accessibility and reusability are important factors in being able to create and maintain reproducible processing pipelines from openly available tools.
WResearche
rs therefore also captured the date of the last release for each tool with a granularity of one year and whether it is available under an explicit open-source license, and if so, under which one specifically. This is also an important aspect for the original authors of a tool, as sustainable development and maintenance of bioinformatics software through a lack of continued funding is still an issue. Open access to the software can help in building up a community around it, where maintenance and further development can be shared between different stakeholders.
WeResearchers did not specifically record whether a tool’s source code is available via a source code repository platform such as GitHub or GitLab, but generally recommend that for open-source software, since these platforms will make the source code available for the foreseeable future.
Lastly,
wresearche
rs list the programming languages that were used to develop the tool. This can have an impact on operating system platform independence and may make reuse of the software easier for certain user demographics, e.g., MS EXCEL and VBA macros may simplify usage by non-bioinformaticians but have clear limits to the Windows platform and limit integrability into non-UI driven workflows.
2.1. Targeted Workflow
LIMSA
[46,47][8][9] supports data from both LC separation, as well as direct infusion workflows. In a first step, vendor data needs to be converted to the NetCDF format using the authors proprietary but free of charge tool, SECD, which is then used to export MS data to LIMSA via EXCEL. LIMSA itself is implemented in C++ as an EXCEL add-in and provides peak finding, identification, isotopic correction and absolute quantification based on calibration lines and labeled internal standards. Unfortunately,
wresearche
rs were not able to find a publicly available version of the software.
LipidomeDB
[48,49][10][11] is a web application for the processing of direct infusion and differential ion mobility MS lipidomics data. It requires a user login but is otherwise free to use. LipidomeDB supports isotopic correction and absolute quantification via class-specific labeled lipid standards and linear calibration curves. Input data needs to be provided in XLSX format and can be exported after identification and quantification as XLSX and HTML.
LipidQuant
[50][12] is a tool for quantitative lipidomics in lipid class separation workflows, such as HILIC or SFC coupled to MS, based on EXCEL and Visual Basic for Applications (VBA). It supports input of
m/z and sample-wise quantity data in TXT or generally tabular formats from vendor software. It includes an extensible built-in database of lipid species, organized by lipid class, and performs type II isotopic correction and absolute quantification using class-specific, heavy labeled (deuterated) internal lipid standards. Output is available from the XLSX worksheet.
2.2. Untargeted Workflow
ALEX 123
[51][13] is an online database that provides comprehensive fragmentation information on 430,000 lipid molecules from 47 lipid classes across five different lipid categories. Output of ALEX 123 is provided in HTML format. In combination with LDA2, it was used for lipid and lipid fragment identification in LC-MS/MS data. Alternatively, ALEX
[52][14] can be used for lipid identification on a species level from high-resolution FTMS data. The source codes of ALEX and ALEX 123 are not publicly available.
Greazy
[53][15] is well-integrated with the ProteoWizard tool suite and supports both chromatography-MS as well as DI data. It generates a search space of phospholipids and theoretical MS2 spectra based on user input. Experimental MS2 spectra are searched against the phospholipids in the search space with adjustable precursor mass tolerance. The match score is computed based on a combination of hypergeometric distribution and intensity score, considering the number of observed fragments for each lipid. The lipid spectrum matches are filtered based on density estimation and the hits above the score threshold are reported in mzTab 1.0 format.
Lipid Data Analyzer 2 (LDA2)
[52,53][14][15] supports untargeted LC-MS/MS lipidomics workflows and is implemented in JAVA. It accepts the following input formats for MS data: raw, .d, wiff, chrom and mzXML. It requires additional quantitation files (XLSX) with lipid class/species to mass/adduct mass association and additional expected RTs for each experiment. In LDA2, custom platform and ionization energy-specific fragmentation rule sets for lipid class and scan species level fragment identification can be defined. Identification and quantification results are stored in XLSX, CSV, mzTab 1.0 and most recently, mzTab-M 2.0.
LipidBlast
[54,55,56][16][17][18] is a suite of XLSX/Visual Basic for Applications (VBA) macros that can generate in-silico tandem MS libraries for lipid identification with other tools, such as NIST’s MS Search application. Input formats are MSP, MGF and XLSX, while output can be generated in MGF and XLSX formats. It is not actively developed any longer, but its libraries have been integrated into MS-DIAL.
LipidDex
[57][19] is also implemented in JAVA. It uses in-silico fragmentation templates and lipid-optimized MS
2 spectral matching to identify and track lipid species in LC-MS/MS experiments. It can calculate peak purity and determine co-isolation and co-elution of isobaric lipids and is able to remove ionization artifacts. It reads data in MGF or mzXML formats and saves identification results in CSV tables.
LipidFinder
[58,59,60][20][21][22] is a Python tool and web application available from the LIPID MAPS website that supports untargeted identification of lipids in LC-MS data, using XCMS for initial feature finding and custom filter and post-processing steps specifically tailored to lipidomics. Input formats are those that are also supported by XCMS, but specifically CSV and JSON, to transfer feature data and configuration settings to the application. LipidFinder supports the generation of reports in PDF, XLSX and CSV formats.
LipidHunter
[61][23] identifications are based on (glycero-)phospholipidomics MS
2 spectra measured by RPLC-MS/MS or direct infusion methods, integrating with LIPID MAPS for bulk lipid search. It supports mzML as an input format from LC-MS/MS and data-dependent shotgun acquisitions. Input files need to be split into an MS
1-only file, covering survey scans for faster processing, and a complete file that contains MS
1 and MS
2 scans. LipidHunter extracts fragment ions based on a user-definable configuration and links MS
2 fragment information to parent ions that are identified against the LIPID MAPS database. It finally performs a lipid species assignment based on their product ions and additional rules. LipidHunter reports quantification and identification results in HTML, CSV and XLSX.
LipidIMMS Analyzer
[62,63][24][25] is a web application for lipid identification in chromatography ion mobility workflows. It uses an internal database of MS
1, CCS, RT and MS
2 information and applies a weighted composite scoring to assign the final identification. It accepts data in MSP and MGF formats and supports output in CSV and HTML.
LipidMatch
[64][26] supports LC-MS, imaging and direct infusion workflows based on an extensive in-silico MS
2 fragmentation library including 56 different lipid types. It uses a rule-based approach for lipid identification against the precursor and fragment
m/z values, including definable adducts, and it is implemented in R. DDA as well as DIA data are supported through peak picking with tools such as MZmine or XCMS. LipidMatch accepts input in CSV (feature tables) or MS
2 (MS/MS data) format and provides annotated and identified results down to the subspecies fatty acyl level. It exports identification results in CSV format. LipidMatch Flow converts vendor file formats with msConvert on the fly.
LipidMiner
[65][27] supports LC-MS/MS DDA data and uses the LIPID MAPS structure database as its library for lipid identification using a rule-based approach. It is implemented in Python and C# and provides input from Thermo raw files. Output is provided in XLSX and CSV formats.
LipidMS
[51][13] is an R package that supports the processing of high-resolution, DIA-MS data. Due to the missing direct relation between the precursor and fragments in DIA, the package applies a score to assess the co-elution of both for grouping, based on fragment and ion intensity rules that allow annotation on species, molecular subspecies (fatty acyl) and structural species (FA position) level. Input may be provided in mzXML or CSV. Output is available as R objects, which can be easily converted and exported into CSV and other tabular formats.
Lipid-Pro
[66][28] is another tool that supports DIA LC-MS/MS data. Implemented in C#, it uses a lipid compound and fragment library and applies matching rules to identify precursor fragment associations based on retention time-aligned, pre-processed data. Input can be provided in CSV format, while output is available as XLSX or TXT.
LipidXplorer
[67,68][29][30] supports DI-MS lipidomics workflows regardless of the lipid category, implemented in Python. It transfers filtered and averaged representative spectra (from all scans based on the measurement settings of the data) into a master scan. The master scan is then searched against the fragmentation rules per class and per mode as provided by query scripts written in Molecular Fragmentation Query Language (MFQL), which is inspired by the SQL database query language. The tool currently supports Thermo raw and mzML files as well as text file-based import (CSV for MS
1 and DTA for MS2, in v1.2.7) as input files and generates comma-separated output files. The output file can be programmed by MFQL and usually reports lipid species found with mass, chemical formula, identification error, lipid name, isobaric species, if any, along with precursor and fragment ion intensities per sample (CSV).
LiPydomics
[69][31] is a Python tool for HILIC ion mobility MS lipidomics data analysis. It uses a custom experimental database with
m/z and CCS values for 45 lipid classes and HILIC retention times for 23 lipid classes. CCS prediction and HILIC retention time prediction for lipids that are not contained in the experimental database are realized by applying machine learning to the experimental database reference values. Identification is performed using a rule-based approach on
m/z, RT and CCS values. LiPydomics accepts CSV files as input and provides results in XLSX format.
LIQUID
[70][32] supports identification of lipids from LC-MS/MS experiments with a customizable library and adaptable scoring model that includes quartiles of fragment intensities. The library covers over 30,000 lipid targets in nine distinct lipid categories, 29 lipid classes and 85 subclasses, sourced from LIPID MAPS and extended with additional lipids. It is implemented in C# and supports input in Thermo Fisher raw format and mzML. Processing results can be exported in CSV, mzTab or MSP formats.
LOBSTAHS
[71][33] is implemented in R for the identification of lipids, oxidized lipids and oxylipin biomarkers in LC-MS data. It uses XCMS and the R/Bioconductor package CAMERA
[72][34] for feature detection and aggregation and validates potential lipid features against an internal
m/z library of lipid species adducts using a rule-based approach based on adduct order of intensity. Input is therefore supported in all formats that XCMS supports. Output can be exported in XLSX or CSV formats.
For oxidized phospholipids, LPPTiger
[60][22] is an option for data-dependent LC-MS/MS data. It is implemented in Python and uses in-silico generated spectral libraries together with a composite score based on individual similarity, rank, fingerprint, isotope matching and specificity scores. It reads data in mzML, XLSX and TXT formats as input (MSP for the library format) and outputs as XLSX and HTML.
MassPix
[73][35] is an R library for the analysis of imaging-MS lipidomics data. It uses an MS
1 m/z library for rule-based identification. It reads imzML format as input and annotates deisotoped
m/z values against its internal generated library. Identified results can be exported in CSV format.
MS-DIAL 4
[74[36][37],
75], written in JAVA, supports chromatography, CE and ion mobility workflows. It applies a spectral library search approach, based on a MS fragment library of 177 lipid subclasses. MS-DIAL 4 performs peak picking, alignment annotation and quantification. Identification combines scoring and a rule-based approach that is guided by a decision tree and provides different levels of confidence. As input formats, multiple vendor formats and mzML are supported, while outputs can be written in CSV, XLSX and mzTab-M. MS-DIAL also supports retention time prediction and offers comprehensive visualizations.
MZmine 2
[76][38] is a modular software for untargeted, chromatography-based metabolomics, with support for lipid species identification using spectral libraries and rules for annotation. It is implemented in JAVA and offers to read input from a variety of vendor formats as well as from open formats as input and it is also able to export identification and intensity data in common spreadsheet and tabular formats and supports mzTab for reading and writing. The upcoming MZmine 3 will also support mzTab-M.
XCMS
[77,78][39][40] is a generic R/Bioconductor library for mass spectrometry feature finding and grouping and has no dedicated support for lipid identification. It uses a spectral library-based approach for feature identification, but other packages may provide other functionality more tailored for lipids. XCMS supports LC-MS/MS data in mzML, mzXML and netCDF formats and outputs feature tables in CSV, XLSX or other formats supported by the R ecosystem.
2.3. Targeted and Untargeted Workflow
The final batch of tools support the analysis of targeted and semi-targeted or untargeted lipidomics data.
LipidCreator
[79[41][42],
80], together with Skyline
[81][43], is primarily designed for targeted lipidomics analysis, but through Skyline’s support for DIA analysis, can also be applied for untargeted workflows. LipidCreator is used to create transition lists and spectral libraries for more than 60 lipid classes, either using predefined libraries for common species and tissues or by manual selection of lipid classes, head groups and fatty acyl parameters. Transitions and a spectral library derived from the in-silico transition list can be transferred to Skyline to be used with its peak/transition detection and integration and its spectral matching features. All major vendor formats are supported, as well as mzML for input. Results can be exported in XLSX and CSV formats, while spectral libraries are exported in the open BLIB format.
LipidPioneer
[82][44] is an EXCEL template implemented in VBA supporting more than 60 lipid classes, including oxidized ones. It allows the generation of custom lipid inclusion lists based on sum formulas of adduct masses for use in targeted and untargeted workflows. These can then be used by other software for lipid identification, such as MZmine, MS-DIAL or Greazy, or for Quality Assurance (QA) and Quality Control (QC) applications. LipidPioneer supports export in any format supported by EXCEL, e.g., CSV or EXCEL.
LipidQA
[83][45] supports both targeted and untargeted workflows for DI-MS. It is implemented in Visual C++ and uses a fragment ion and lipid chemical formula database to perform spectral matching for identification. Absolute quantitation with calibration curves is also supported. LipidQA can read data in Thermo and Waters vendor formats and provides its results in CSV format.
LipoStar
[84][46], implemented in C#, supports data from chromatographic separation and ion mobility for DDA and DIA workflows. It uses a compound and fragment library and rule-based validation for the identification of lipids. LipoStar reads vendor MS data and supports the exporting of results in the CSV format.
LipoStarMSI
[85][47] is LipoStar’s sibling software for direct infusion and imaging MS lipidomics. It uses a spectral library and rule-based approach for lipid identification. LipoStarMSI is also implemented in C# and can read vendor formats of Bruker and Waters as well as the open imzML format. Output is exported in CSV format.
SmartPeak
[86][48] uses OpenMS
[87][49] at its core and supports absolute quantitation in targeted and semi–targeted workflows. It is implemented mainly in C++ and implements MRM-specific peak integration and feature selection on top of established OpenMS methods. SmartPeak’s primary input format is mzML, while transitions, parameters and sample sequence information are provided in CSV format. Results can be exported in mzTab, XML and CSV formats.
Smfinder
[88][50] has parts that are implemented in Python and some parts that are implemented in R. It supports targeted, untargeted and 13C labeling workflows. Lipid identification is performed based on plausible sum formulas first, with subsequent validation using a spectral library. The untargeted workflow uses XCMS for feature detection. Smfinder supports mzML and mzXML as input data formats. Results can be exported in XLSX and TXT formats.
Out of the 31 tools for lipid identification
wresearche
rs reviewed, 6 of 31 (>19%) did not provide a release version that could help to ensure reproducibility when authors want to compare their software to those of others. Eight of 31 tools (>25%) had no explicit license defined. Just as many, but not necessarily the same ones, did not provide the source code in an openly accessible way.