<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <title>DSpace Collection:</title>
  <link rel="alternate" href="http://dspace.dtu.ac.in:8080/jspui/handle/123456789/50" />
  <subtitle />
  <id>http://dspace.dtu.ac.in:8080/jspui/handle/123456789/50</id>
  <updated>2026-07-01T04:42:13Z</updated>
  <dc:date>2026-07-01T04:42:13Z</dc:date>
  <entry>
    <title>DL-BASED INTRUSION DETECTION FOR IOMT: A COMPARATIVE ANALYSIS OF EMBED-NET, CONV-NET-SVM, AND DEEP-SVM-NET</title>
    <link rel="alternate" href="http://dspace.dtu.ac.in:8080/jspui/handle/repository/22951" />
    <author>
      <name>KUMAR, ASHISH</name>
    </author>
    <author>
      <name>Kumar, Shailender (SUPERVISOR)</name>
    </author>
    <id>http://dspace.dtu.ac.in:8080/jspui/handle/repository/22951</id>
    <updated>2026-06-25T05:09:41Z</updated>
    <published>2026-06-01T00:00:00Z</published>
    <summary type="text">Title: DL-BASED INTRUSION DETECTION FOR IOMT: A COMPARATIVE ANALYSIS OF EMBED-NET, CONV-NET-SVM, AND DEEP-SVM-NET
Authors: KUMAR, ASHISH; Kumar, Shailender (SUPERVISOR)
Abstract: The swift development of IoMT technology has led to the ability of continuously track&#xD;
ing patients, automating clinical operations, and facilitating interaction between connected&#xD;
devices in contemporary healthcare settings. While these interrelated processes greatly im&#xD;
prove medical efficiency and accessibility, there are major cybersecurity risks present as&#xD;
well. Typically, medical devices have low computing power, run outdated firmware, and&#xD;
lack sophisticated protection systems; therefore, they become easy targets for diverse cy&#xD;
ber attacks such as DoS [7], reconnaissance, spoofing, and data manipulation attacks. It&#xD;
turns out to be hard to cope with dynamic and sophisticated IoMT traffic using common&#xD;
IDS technologies [8], like signature-based IDS and traditional machine learning techniques.&#xD;
The limitations of the existing systems in terms of poor adaptiveness, simplified feature&#xD;
representation, and high levels of false negatives prove the necessity of further research in&#xD;
creating smart IDS techniques.&#xD;
This study introduces a 3-part deep learning solution specifically for an IoMT IDS&#xD;
Embed-Net, Conv-Net-SVM, and Deep-SVM-Net. Each model is a solution to specific se&#xD;
curity problems with IoMT environment. Embed-Net is an approach that embeds device&#xD;
identifiers, flags, and protocol attributes into dense models that are learned from the data&#xD;
and enables these models to represent subtle interactions between heterogeneous network&#xD;
traffic. The Conv-Net-SVM performed convolutional feature extraction followed by a Sup&#xD;
port Vector Machine (SVM) classifier, fusing the advantages of CNN, which is capable of&#xD;
learning the structs in the flow with good margin-based decision boundaries of SVM. Deep&#xD;
SVM-Net is an output layer based on SVM that exploits a novel approach to deep neural&#xD;
net that functions well in separating benign and malicious traffic and simultaneously has a&#xD;
low computational load for resource constrained medical devices.&#xD;
Several IoT-related datasets that included the various cyber-attack patterns were first&#xD;
preprocessed for standardization of scaling, transformations to ordinal encoding, feature&#xD;
selection based on covariance, and SMOTE balancing before training and validation were&#xD;
performed on the datasets, each using a cross validation strategy that ensured stable and&#xD;
generalizable models. All three exhibited excellent intrusion detection results with par&#xD;
ticularly good overall accuracy and low false-negative rates from Embed-Net, outstanding&#xD;
capability in learning attack signatures with high spatial correlations from ConvNetSVM,&#xD;
and superior intrusion detection real-time efficiency with highly reliable binary separation&#xD;
from Deep-SVM-Net. The results collectively point to the superiority of the deep learn&#xD;
ing architecture for improving the security of IoMT, which efficiently adapts to mixed-type&#xD;
features, to an imbalanced dataset, and to medical operational constraints.</summary>
    <dc:date>2026-06-01T00:00:00Z</dc:date>
  </entry>
  <entry>
    <title>EXECUTION-AWARE SELF-REFINING TEXT-TO-SQL GENERATION USING LARGE LANGUAGE MODELS</title>
    <link rel="alternate" href="http://dspace.dtu.ac.in:8080/jspui/handle/repository/22950" />
    <author>
      <name>SHARMA, ANKIT</name>
    </author>
    <author>
      <name>KUMAR, VINOD ( SUPERVISOR )</name>
    </author>
    <id>http://dspace.dtu.ac.in:8080/jspui/handle/repository/22950</id>
    <updated>2026-06-25T05:09:31Z</updated>
    <published>2026-06-01T00:00:00Z</published>
    <summary type="text">Title: EXECUTION-AWARE SELF-REFINING TEXT-TO-SQL GENERATION USING LARGE LANGUAGE MODELS
Authors: SHARMA, ANKIT; KUMAR, VINOD ( SUPERVISOR )
Abstract: The increasing need for data-driven systems has resulted in the growth of user-friendly&#xD;
database interaction methods, especially for non-technical users. Natural Language to&#xD;
SQL, also known as Text-to-SQL is hence required for converting natural language queries&#xD;
from users into executable SQL statements, allowing users to fetch data from databases&#xD;
more efficiently. Although, it is a challenging task to generate accurate SQL queries&#xD;
due to the complexity of natural language understanding, schema linking and variety&#xD;
of database structures. Recent progress in Large Language Models (LLMs) have sig&#xD;
nificantly improved the traditional Text-to-SQL methods by enhancing their semantic&#xD;
understanding for generalized SQL generation. This thesis proposes an Execution-Aware&#xD;
Self-Refining Text-to-SQL Generation framework using Large Language Models, where&#xD;
generated SQL queries are refined iteratively through execution feedback and context&#xD;
awareness. This framework integrates prompt engineering, RAG based architectures and&#xD;
execution validation to identify syntax errors, logical disparities and failed query results.&#xD;
The study also presents an overview of benchmark Text-to-SQL datasets such as Spider,&#xD;
WikiSQL, BIRD, and CoSQL representing that execution-aware self-refinement enhances&#xD;
SQL generation performance compared to conventional approaches. Moreover, it high&#xD;
lights challenges related to scalability, explainability and computational efficiency towards&#xD;
making a domain-centric Text-to-SQL systems for a broad range of technical and non&#xD;
technical users.</summary>
    <dc:date>2026-06-01T00:00:00Z</dc:date>
  </entry>
  <entry>
    <title>EXPLAINABLE TEMPORAL TRANSFORMER FOR  DISEASE PROGRESSION PREDICTION USING  ATTENTION AND SHAP ANALYSIS</title>
    <link rel="alternate" href="http://dspace.dtu.ac.in:8080/jspui/handle/repository/22947" />
    <author>
      <name>CHHIBBER, AJITESH</name>
    </author>
    <author>
      <name>KUMAR, VINOD ( SUPERVISOR )</name>
    </author>
    <id>http://dspace.dtu.ac.in:8080/jspui/handle/repository/22947</id>
    <updated>2026-06-25T05:09:07Z</updated>
    <published>2026-05-01T00:00:00Z</published>
    <summary type="text">Title: EXPLAINABLE TEMPORAL TRANSFORMER FOR  DISEASE PROGRESSION PREDICTION USING  ATTENTION AND SHAP ANALYSIS
Authors: CHHIBBER, AJITESH; KUMAR, VINOD ( SUPERVISOR )
Abstract: Parkinson’s disease (PD) is a progressive neurodegenerative disorder characterized by&#xD;
the gradual deterioration of motor functions, significantly affecting the quality of life of&#xD;
patients. Accurate prediction of disease progression is essential for timely clinical inter&#xD;
vention, treatment planning, and personalized patient management. Traditional machine&#xD;
learning approaches often treat clinical observations as independent samples, limiting&#xD;
their ability to capture the temporal dynamics inherent in longitudinal patient data. Al&#xD;
though deep learning models such as Long Short-Term Memory (LSTM) networks have&#xD;
demonstrated improved temporal modeling capabilities, they may struggle to effectively&#xD;
capture long-range dependencies present in disease progression trajectories.&#xD;
This thesis presents an Explainable Feature-Aware Enhanced Temporal Transformer (FAETT)&#xD;
framework for predicting Parkinson’s disease progression using longitudinal voice-based&#xD;
biomarkers from the Parkinson Telemonitoring dataset obtained from the UCI Machine&#xD;
Learning Repository. The proposed framework integrates temporal sequence modeling&#xD;
with self-attention mechanisms to learn complex relationships across historical patient ob&#xD;
servations. A comprehensive preprocessing pipeline involving data cleaning, feature scal&#xD;
ing, temporal sequence generation, and patient-wise train-test partitioning is employed to&#xD;
ensure robust model development and unbiased evaluation.&#xD;
To assess the effectiveness of the proposed approach, FAETT is compared against three&#xD;
baseline models: Random Forest (RF), Long Short-Term Memory (LSTM), and a stan&#xD;
dard Transformer architecture. Model performance is evaluated using Mean Absolute&#xD;
Error (MAE), Root Mean Squared Error (RMSE), and the coefficient of determination&#xD;
(R²). Experimental results demonstrate that the proposed FAETT model achieves supe&#xD;
rior predictive performance, attaining an MAE of 1.2688, RMSE of 1.6457, and R² score&#xD;
of 0.9754, outperforming the baseline approaches. The findings indicate that the incor&#xD;
poration of feature-aware temporal attention significantly enhances the model’s ability to&#xD;
capture disease progression patterns.&#xD;
iii&#xD;
To address the interpretability requirements of clinical decision-support systems, SHAP&#xD;
(SHapley Additive exPlanations) analysis is integrated into the framework. The explain&#xD;
ability analysis identifies the most influential vocal biomarkers contributing to disease&#xD;
progression prediction and provides transparent insights into model behavior. Correlation&#xD;
analysis, residual diagnostics, and prediction-performance visualizations further validate&#xD;
the robustness and reliability of the proposed framework.&#xD;
The results demonstrate that the proposed FAETT architecture effectively combines pre&#xD;
dictive accuracy with model interpretability, making it a promising tool for explainable&#xD;
disease progression forecasting in Parkinson’s disease. The study highlights the potential&#xD;
of attention-based temporal learning and explainable artificial intelligence techniques for&#xD;
advancing data-driven healthcare applications.</summary>
    <dc:date>2026-05-01T00:00:00Z</dc:date>
  </entry>
  <entry>
    <title>ADAPTIVE CONTEXT COMPRESSION TECHNIQUES FOR  EFFICIENT LARGE LANGUAGE MODEL INFERENCE:  A QUERY-COMPLEXITY-AWARE APPROACH</title>
    <link rel="alternate" href="http://dspace.dtu.ac.in:8080/jspui/handle/repository/22923" />
    <author>
      <name>HAQ, INJAMAMUL</name>
    </author>
    <author>
      <name>Bansal, Nipun (SUPERVISOR)</name>
    </author>
    <id>http://dspace.dtu.ac.in:8080/jspui/handle/repository/22923</id>
    <updated>2026-06-25T04:57:03Z</updated>
    <published>2025-05-01T00:00:00Z</published>
    <summary type="text">Title: ADAPTIVE CONTEXT COMPRESSION TECHNIQUES FOR  EFFICIENT LARGE LANGUAGE MODEL INFERENCE:  A QUERY-COMPLEXITY-AWARE APPROACH
Authors: HAQ, INJAMAMUL; Bansal, Nipun (SUPERVISOR)
Abstract: The exponential growth of large-scale neural language models has opened up transforma&#xD;
tive possibilities across a broad range of natural language understanding problems. Sys&#xD;
tems built on the Transformer architecture [1] have demonstrated remarkable proficiency &#xD;
on tasks ranging from reading comprehension and open-domain question answering to code &#xD;
synthesis and multi-step reasoning. Yet the self-attention operation sitting at the heart of &#xD;
every Transformer block carries a computational burden that scales quadratically with input &#xD;
length, denoted O(n2). As contexts grow beyond a few thousand tokens — a routine re&#xD;
quirement in legal document analysis, multi-document question answering, and long-form &#xD;
summarisation — this quadratic growth translates into sluggish response times, swollen &#xD;
GPU memory footprints, and deployment costs that place the technology beyond reach for &#xD;
many organisations. &#xD;
Prior work on context compression has made genuine progress by shortening the input &#xD;
before it reaches the model [7, 8]. The unifying flaw across all fourteen methods examined &#xD;
in this thesis, however, is a deceptively simple assumption: that every incoming query &#xD;
deserves the same compression budget. A simple lookup question — “When was BERT &#xD;
published?” — can be answered from a single sentence. A comparative multi-hop question &#xD;
— “How do the pre-training strategies of BERT and GPT-3 differ in their downstream effect &#xD;
on reasoning tasks?” — may need a dozen passages spread across an entire document &#xD;
collection. Treating both with the same 40% removal rate is not an approximation; it is &#xD;
a category error that systematically harms the harder queries and wastes capacity on the &#xD;
easier ones. &#xD;
This thesis introduces the Query-Complexity-Aware Adaptive Context Compression &#xD;
(QCAC) framework, a model-agnostic preprocessing system that measures how demand&#xD;
ing an incoming question is and adjusts the compression ratio accordingly. QCAC com&#xD;
putes a per-query complexity score C(q) ∈ [0, 1] from three lightweight surface features of &#xD;
the question — its normalised length, vocabulary entropy, and the presence of multi-hop &#xD;
syntactic cues — and derives a per-query removal ratio r(q) = rmax − (rmax − rmin) · C(q). &#xD;
Every sentence in the document is then ranked using a weighted combination of three &#xD;
signals extracted from BERT [2]: the attention-magnitude score A(si), the attentional&#xD;
entropy score H(si), and the cosine similarity between the sentence and query embeddings &#xD;
sim(si, q), inspired by dense retrieval research [19]. &#xD;
Experiments conducted on SQuAD v1.1 [22] and HotpotQA [23] (300 samples each) using &#xD;
BERT-base-uncased [2] as the scorer and RoBERTa-base [5] as the downstream QA model &#xD;
on NVIDIA Tesla T4 hardware show that QCAC with weights (α = 0.10, β = 0.30, γ = &#xD;
0.60) achieves 70.4% F1 on SQuAD at 36.8% sentence removal — a +16.2 percentage- &#xD;
iii &#xD;
point improvement over the prior Attn+Entropy baseline at equivalent compression. Adap&#xD;
tive behaviour is statistically validated without per-dataset tuning: HotpotQA queries re&#xD;
ceive a higher mean C(q) score (0.587 vs. 0.539, p &lt; 0.001, t = 11.2) and automatically &#xD;
receive less compression. A seven-variant ablation study reveals that query-semantic simi&#xD;
larity is the strongest individual signal (58.0% F1 in isolation), while attention-only scoring &#xD;
actively underperforms random pruning (44.9%), consistent with the findings of Clark et &#xD;
al. [21]. End-to-end inference latency stands at 89.7 ms per sample — the lowest among &#xD;
all BERT-based compression methods evaluated — and the framework requires no modifi&#xD;
cation or retraining of the downstream language model.</summary>
    <dc:date>2025-05-01T00:00:00Z</dc:date>
  </entry>
</feed>

