Please use this identifier to cite or link to this item: http://dspace.dtu.ac.in:8080/jspui/handle/repository/21784
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSHARMA, DEVANSHI-
dc.date.accessioned2025-07-08T06:13:05Z-
dc.date.available2025-07-08T06:13:05Z-
dc.date.issued2025-05-
dc.identifier.urihttp://dspace.dtu.ac.in:8080/jspui/handle/repository/21784-
dc.description.abstractThe increasing complexity of RNA-seq data requires analysis pipelines that are robust, scalable, and interpretable. LYNX-RNA (Language-augmented Yield for Nextflow based RNA eXpression analysis) is a modular, Nextflow-based workflow that delivers end-to-end RNA-seq analysis—from raw FASTQ files to biological insights—with automation and reproducibility. LYNX-RNA integrates standard tools for quality control, alignment, quantification, and differential gene expression (DGE), along with advanced modules for WGCNA, PPI network modeling, and GO/KEGG enrichment. A key feature is its built-in machine learning module (Random Forest and XGBoost) for predictive biomarker discovery, and an LLM-powered reporting system that generates natural language summaries of results. To identify DEGs, treated ITP patient data (GSE112278) was compared against external healthy controls (GSE251778) using Welch’s t-test and FDR correction. These DEGs were used to train a classifier that achieved a ROC AUC of 0.937, demonstrating high predictive accuracy. Notably, top predicted DEGs such as EPB42, TNS1, and HAGH overlapped with WGCNA derived hub genes, reinforcing biological relevance. The pipeline supports deployment in low-resource environments (≤24 GB RAM), is compatible with Conda, Docker, and HPC systems, and includes a Python-based CLI for user accessibility. We applied LYNX-RNA to a longitudinal ITP dataset spanning control, pre-treatment, and post treatment stages, uncovering dynamic gene signatures and potential immune metabolic biomarkers. LYNX-RNA provides a flexible, automation-ready solution for transcriptome analysis, well-suited for biomarker discovery and translational immunology. In summary, LYNX-RNA bridges key gaps in usability, scalability, and interpretability in transcriptomic workflows. It serves as a versatile, automation-ready platform for both academic research and translational applications in systems immunology, precision medicine, and biomarker discovery.en_US
dc.language.isoenen_US
dc.relation.ispartofseriesTD-7994;-
dc.subjectRNA-SEQ PIPELINEen_US
dc.subjectLYNX-RNAen_US
dc.subjectBIOMARKER DISCOVERYen_US
dc.subjectIMMUNE THROMBOCYTOPENIA (ITP)en_US
dc.subjectGENE EXPRESSION ANALYSISen_US
dc.subjectLARGE LANGUAGE MODEL (LLM)en_US
dc.subjectWORKFLOW AUTOMATIONen_US
dc.subjectTRANSCRIPTOMICSen_US
dc.subjectNATURAL LANGUAGE REPORTINGen_US
dc.subjectWGCNAen_US
dc.subjectXGBOOSTen_US
dc.titleLYNX-RNA: A NEXTFLOW-BASED MODULAR RNA-SEQ AND MACHINE LEARNING PIPELINE FOR BIOMARKER DISCOVERY WITH LLM-SUMMARIZED REPORT GENERATION IN IMMUNE THROMBOCYTOPENIAen_US
dc.typeThesisen_US
Appears in Collections:M.E./M.Tech. Bio Tech

Files in This Item:
File Description SizeFormat 
DEVANSHI SHARMA M.Tech.pdf2.65 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.