<?xml version="1.0" encoding="UTF-8"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns="http://purl.org/rss/1.0/" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel rdf:about="http://dspace.dtu.ac.in:8080/jspui/handle/123456789/47">
    <title>DSpace Community:</title>
    <link>http://dspace.dtu.ac.in:8080/jspui/handle/123456789/47</link>
    <description />
    <items>
      <rdf:Seq>
        <rdf:li rdf:resource="http://dspace.dtu.ac.in:8080/jspui/handle/repository/22951" />
        <rdf:li rdf:resource="http://dspace.dtu.ac.in:8080/jspui/handle/repository/22950" />
        <rdf:li rdf:resource="http://dspace.dtu.ac.in:8080/jspui/handle/repository/22947" />
        <rdf:li rdf:resource="http://dspace.dtu.ac.in:8080/jspui/handle/repository/22944" />
      </rdf:Seq>
    </items>
    <dc:date>2026-07-01T03:20:04Z</dc:date>
  </channel>
  <item rdf:about="http://dspace.dtu.ac.in:8080/jspui/handle/repository/22951">
    <title>DL-BASED INTRUSION DETECTION FOR IOMT: A COMPARATIVE ANALYSIS OF EMBED-NET, CONV-NET-SVM, AND DEEP-SVM-NET</title>
    <link>http://dspace.dtu.ac.in:8080/jspui/handle/repository/22951</link>
    <description>Title: DL-BASED INTRUSION DETECTION FOR IOMT: A COMPARATIVE ANALYSIS OF EMBED-NET, CONV-NET-SVM, AND DEEP-SVM-NET
Authors: KUMAR, ASHISH; Kumar, Shailender (SUPERVISOR)
Abstract: The swift development of IoMT technology has led to the ability of continuously track&#xD;
ing patients, automating clinical operations, and facilitating interaction between connected&#xD;
devices in contemporary healthcare settings. While these interrelated processes greatly im&#xD;
prove medical efficiency and accessibility, there are major cybersecurity risks present as&#xD;
well. Typically, medical devices have low computing power, run outdated firmware, and&#xD;
lack sophisticated protection systems; therefore, they become easy targets for diverse cy&#xD;
ber attacks such as DoS [7], reconnaissance, spoofing, and data manipulation attacks. It&#xD;
turns out to be hard to cope with dynamic and sophisticated IoMT traffic using common&#xD;
IDS technologies [8], like signature-based IDS and traditional machine learning techniques.&#xD;
The limitations of the existing systems in terms of poor adaptiveness, simplified feature&#xD;
representation, and high levels of false negatives prove the necessity of further research in&#xD;
creating smart IDS techniques.&#xD;
This study introduces a 3-part deep learning solution specifically for an IoMT IDS&#xD;
Embed-Net, Conv-Net-SVM, and Deep-SVM-Net. Each model is a solution to specific se&#xD;
curity problems with IoMT environment. Embed-Net is an approach that embeds device&#xD;
identifiers, flags, and protocol attributes into dense models that are learned from the data&#xD;
and enables these models to represent subtle interactions between heterogeneous network&#xD;
traffic. The Conv-Net-SVM performed convolutional feature extraction followed by a Sup&#xD;
port Vector Machine (SVM) classifier, fusing the advantages of CNN, which is capable of&#xD;
learning the structs in the flow with good margin-based decision boundaries of SVM. Deep&#xD;
SVM-Net is an output layer based on SVM that exploits a novel approach to deep neural&#xD;
net that functions well in separating benign and malicious traffic and simultaneously has a&#xD;
low computational load for resource constrained medical devices.&#xD;
Several IoT-related datasets that included the various cyber-attack patterns were first&#xD;
preprocessed for standardization of scaling, transformations to ordinal encoding, feature&#xD;
selection based on covariance, and SMOTE balancing before training and validation were&#xD;
performed on the datasets, each using a cross validation strategy that ensured stable and&#xD;
generalizable models. All three exhibited excellent intrusion detection results with par&#xD;
ticularly good overall accuracy and low false-negative rates from Embed-Net, outstanding&#xD;
capability in learning attack signatures with high spatial correlations from ConvNetSVM,&#xD;
and superior intrusion detection real-time efficiency with highly reliable binary separation&#xD;
from Deep-SVM-Net. The results collectively point to the superiority of the deep learn&#xD;
ing architecture for improving the security of IoMT, which efficiently adapts to mixed-type&#xD;
features, to an imbalanced dataset, and to medical operational constraints.</description>
    <dc:date>2026-06-01T00:00:00Z</dc:date>
  </item>
  <item rdf:about="http://dspace.dtu.ac.in:8080/jspui/handle/repository/22950">
    <title>EXECUTION-AWARE SELF-REFINING TEXT-TO-SQL GENERATION USING LARGE LANGUAGE MODELS</title>
    <link>http://dspace.dtu.ac.in:8080/jspui/handle/repository/22950</link>
    <description>Title: EXECUTION-AWARE SELF-REFINING TEXT-TO-SQL GENERATION USING LARGE LANGUAGE MODELS
Authors: SHARMA, ANKIT; KUMAR, VINOD ( SUPERVISOR )
Abstract: The increasing need for data-driven systems has resulted in the growth of user-friendly&#xD;
database interaction methods, especially for non-technical users. Natural Language to&#xD;
SQL, also known as Text-to-SQL is hence required for converting natural language queries&#xD;
from users into executable SQL statements, allowing users to fetch data from databases&#xD;
more efficiently. Although, it is a challenging task to generate accurate SQL queries&#xD;
due to the complexity of natural language understanding, schema linking and variety&#xD;
of database structures. Recent progress in Large Language Models (LLMs) have sig&#xD;
nificantly improved the traditional Text-to-SQL methods by enhancing their semantic&#xD;
understanding for generalized SQL generation. This thesis proposes an Execution-Aware&#xD;
Self-Refining Text-to-SQL Generation framework using Large Language Models, where&#xD;
generated SQL queries are refined iteratively through execution feedback and context&#xD;
awareness. This framework integrates prompt engineering, RAG based architectures and&#xD;
execution validation to identify syntax errors, logical disparities and failed query results.&#xD;
The study also presents an overview of benchmark Text-to-SQL datasets such as Spider,&#xD;
WikiSQL, BIRD, and CoSQL representing that execution-aware self-refinement enhances&#xD;
SQL generation performance compared to conventional approaches. Moreover, it high&#xD;
lights challenges related to scalability, explainability and computational efficiency towards&#xD;
making a domain-centric Text-to-SQL systems for a broad range of technical and non&#xD;
technical users.</description>
    <dc:date>2026-06-01T00:00:00Z</dc:date>
  </item>
  <item rdf:about="http://dspace.dtu.ac.in:8080/jspui/handle/repository/22947">
    <title>EXPLAINABLE TEMPORAL TRANSFORMER FOR  DISEASE PROGRESSION PREDICTION USING  ATTENTION AND SHAP ANALYSIS</title>
    <link>http://dspace.dtu.ac.in:8080/jspui/handle/repository/22947</link>
    <description>Title: EXPLAINABLE TEMPORAL TRANSFORMER FOR  DISEASE PROGRESSION PREDICTION USING  ATTENTION AND SHAP ANALYSIS
Authors: CHHIBBER, AJITESH; KUMAR, VINOD ( SUPERVISOR )
Abstract: Parkinson’s disease (PD) is a progressive neurodegenerative disorder characterized by&#xD;
the gradual deterioration of motor functions, significantly affecting the quality of life of&#xD;
patients. Accurate prediction of disease progression is essential for timely clinical inter&#xD;
vention, treatment planning, and personalized patient management. Traditional machine&#xD;
learning approaches often treat clinical observations as independent samples, limiting&#xD;
their ability to capture the temporal dynamics inherent in longitudinal patient data. Al&#xD;
though deep learning models such as Long Short-Term Memory (LSTM) networks have&#xD;
demonstrated improved temporal modeling capabilities, they may struggle to effectively&#xD;
capture long-range dependencies present in disease progression trajectories.&#xD;
This thesis presents an Explainable Feature-Aware Enhanced Temporal Transformer (FAETT)&#xD;
framework for predicting Parkinson’s disease progression using longitudinal voice-based&#xD;
biomarkers from the Parkinson Telemonitoring dataset obtained from the UCI Machine&#xD;
Learning Repository. The proposed framework integrates temporal sequence modeling&#xD;
with self-attention mechanisms to learn complex relationships across historical patient ob&#xD;
servations. A comprehensive preprocessing pipeline involving data cleaning, feature scal&#xD;
ing, temporal sequence generation, and patient-wise train-test partitioning is employed to&#xD;
ensure robust model development and unbiased evaluation.&#xD;
To assess the effectiveness of the proposed approach, FAETT is compared against three&#xD;
baseline models: Random Forest (RF), Long Short-Term Memory (LSTM), and a stan&#xD;
dard Transformer architecture. Model performance is evaluated using Mean Absolute&#xD;
Error (MAE), Root Mean Squared Error (RMSE), and the coefficient of determination&#xD;
(R²). Experimental results demonstrate that the proposed FAETT model achieves supe&#xD;
rior predictive performance, attaining an MAE of 1.2688, RMSE of 1.6457, and R² score&#xD;
of 0.9754, outperforming the baseline approaches. The findings indicate that the incor&#xD;
poration of feature-aware temporal attention significantly enhances the model’s ability to&#xD;
capture disease progression patterns.&#xD;
iii&#xD;
To address the interpretability requirements of clinical decision-support systems, SHAP&#xD;
(SHapley Additive exPlanations) analysis is integrated into the framework. The explain&#xD;
ability analysis identifies the most influential vocal biomarkers contributing to disease&#xD;
progression prediction and provides transparent insights into model behavior. Correlation&#xD;
analysis, residual diagnostics, and prediction-performance visualizations further validate&#xD;
the robustness and reliability of the proposed framework.&#xD;
The results demonstrate that the proposed FAETT architecture effectively combines pre&#xD;
dictive accuracy with model interpretability, making it a promising tool for explainable&#xD;
disease progression forecasting in Parkinson’s disease. The study highlights the potential&#xD;
of attention-based temporal learning and explainable artificial intelligence techniques for&#xD;
advancing data-driven healthcare applications.</description>
    <dc:date>2026-05-01T00:00:00Z</dc:date>
  </item>
  <item rdf:about="http://dspace.dtu.ac.in:8080/jspui/handle/repository/22944">
    <title>SOCIAL BIAS IDENTIFICATION AND  MITIGATION IN NATURAL LANGUAGE  TEXT USING MACHINE LEARNING</title>
    <link>http://dspace.dtu.ac.in:8080/jspui/handle/repository/22944</link>
    <description>Title: SOCIAL BIAS IDENTIFICATION AND  MITIGATION IN NATURAL LANGUAGE  TEXT USING MACHINE LEARNING
Authors: KAMBOJ, PRADEEP; KUMAR, SHAILENDER ( SUPERVISOR); GOYAL, VIKRAM (CO - SUPERVISOR )
Abstract: Advanced Artificial Intelligence (AI) methods have enabled the creation of &#xD;
sophisticated large language models (LLMs) capable of generating human-like text &#xD;
and handling a broad spectrum of complex language comprehension tasks. The last &#xD;
decade has seen the advent of LLMs that fill crucial roles across a variety of &#xD;
applications, including automated content generation and summarization, healthcare &#xD;
analytics, legal decision support, conversational agents, and educational technologies. &#xD;
Despite their remarkable abilities, these models often reflect and even amplify the &#xD;
social biases embedded in the large datasets on which they are trained. These biases &#xD;
can manifest as stereotypes or unjust associations related to gender, race, religion, &#xD;
profession, or other social features. When these AI systems are deployed in high-stakes &#xD;
domains where fairness and reliability are paramount, the presence of such biases &#xD;
raises major ethical, social, and technical concerns. As a result, understanding, &#xD;
measuring, and mitigating bias in LLMs has emerged as a prominent research &#xD;
challenge at the forefront of responsible and trustworthy AI. &#xD;
This thesis constitutes a thorough exploration of social bias in natural language text &#xD;
generated by language models (LMs) and LLMs, with a focus on systematic &#xD;
approaches to measuring, evaluating, and mitigating it. The research draws on &#xD;
theoretical, empirical, experimental, and methodological approaches to investigate &#xD;
bias from several angles across the AI pipeline, including word embeddings, &#xD;
contextualized language models, prompt-based inference functions, and fine-tuning &#xD;
strategies. The work focuses on understanding biases across these components and &#xD;
seeks practical solutions to build fairer and more trustworthy generative AI systems. &#xD;
The initial phase of the research investigates gender bias in contextualized word &#xD;
embeddings generated by transformer-based LMs. Word embeddings are the building &#xD;
blocks of language in many NLP systems, and biases encoded in these representations &#xD;
can carry over to downstream applications. The gender direction in the embedding &#xD;
space is extracted, and the gender polarity of profession-related terms (occupation &#xD;
names) with respect to gendered pronouns is calculated, yielding a quantitative &#xD;
framework for measuring one type of bias: that women or men are less likely to pursue &#xD;
certain professions. Indeed, an experimental analysis shows that dynamic embeddings &#xD;
from transformer-based models exhibit substantial gender associations even in the &#xD;
absence of explicit gender information in the input text. To alleviate this problem, we &#xD;
propose a form of post-processing debiasing that modifies the embedding &#xD;
representations to reduce stereotypical associations while preserving the semantic &#xD;
relationships among words. The experimental results show that the proposed method &#xD;
can significantly alleviate gender bias in profession embeddings, thereby balancing the &#xD;
model’s representations. &#xD;
Building on this foundation, the thesis broadens the analysis to large language models &#xD;
and a wider range of societal biases stemming from multiple demographic attributes. &#xD;
We introduce a systematic evaluation framework for bias in LLM-generated outputs, &#xD;
in part by creating a curated inference dataset from previously established bias &#xD;
benchmarks. The dataset includes contexts that encourage language models to generate &#xD;
stereotypical, anti-stereotypical, and neutral responses, enabling systematic &#xD;
assessment of model behaviour. This study provides a comprehensive mechanism for &#xD;
v   &#xD;
analyzing how different models respond to socially sensitive contexts and how bias &#xD;
manifests in generated text. &#xD;
This research makes an important contribution by exploring prompt engineering to &#xD;
both detect and mitigate bias in LLMs. Several types of prompt variants are developed &#xD;
to investigate the effects of their design on model behaviour, namely standard, chain&#xD;
of-thought, cognitive-style, and human-persona prompts. These prompts are &#xD;
systematically assessed to study the effects of various prompting techniques on output &#xD;
bias. Also proposed are the debiased versions of these prompts that explicitly elicit &#xD;
neutral reasoning and unbiased decision-making. &#xD;
The introduction of prompt-only bias evaluation is a key aspect of the extended work, &#xD;
exploring whether biased responses can be induced by prompts alone, without context. &#xD;
Experimental results indicate that when certain prompts are presented to language &#xD;
models, those models make stereotypical predictions, suggesting that bias arises from &#xD;
the interaction between prompts and the models' reasoning mechanisms, rather than &#xD;
solely from the training data. This underlined the importance of careful prompt design &#xD;
and evaluation when deploying language models in real-world settings. Alongside this &#xD;
bias analysis, the research also delves into the issue of hallucination in LLMs, whereby &#xD;
a model provides confident answers that are factually incorrect or unsupported. Across &#xD;
most domains, hallucinations undermine the model’s reliability and may introduce &#xD;
risks in critical domains such as healthcare, legal advice, and policy analysis. To tackle &#xD;
this phenomenon, the thesis presents a contrastive decoding method powered by &#xD;
disturb prompts to compare the probability distributions of model outputs for same&#xD;
prompt and perturbation-prompt scenarios. The method helps detect hallucinated &#xD;
content and enhances the factual consistency of outputs by comparing responses to &#xD;
normal prompts with those to perturbed prompts. The results show that contrastive &#xD;
prompting methods can mitigate hallucination and improve the robustness of language &#xD;
model outputs. &#xD;
Another important aspect of the research is assessing how well fine-tuning approaches &#xD;
mitigate biases. Among such models, large open-source language models are fine&#xD;
tuned on balanced sets with equal numbers of biased/unbiased statements across a wide &#xD;
range of social categories. Fine-tuning is when models are trained to produce more &#xD;
neutral and fair responses while retaining their language comprehension. In fact, &#xD;
experimental results show that fine-tuning with fairness-aware special prompts &#xD;
significantly reduces the model's biased outputs and improves fairness performance. &#xD;
In conclusion, the work in this thesis demonstrates that bias in LMs is a complex, &#xD;
multifaceted phenomenon with multiple underlying sources, including training data, &#xD;
representation learning, and prompting. Tackling this challenge requires the integrated &#xD;
use of bias measurement, dataset design, prompt engineering, model fine-tuning, and &#xD;
evaluation metrics. The methodologies are cross-disciplinary, offering actionable tools &#xD;
to identify and prevent bias in generative AI systems without sacrificing performance &#xD;
or usability. &#xD;
This work extends beyond technical contributions, establishing the need for a broader &#xD;
meaning of fair and responsible development in the internalization of AI. Overall, this &#xD;
thesis gives a good overview of bias in LMs and LLMs. The research, by integrating &#xD;
representation-level analysis, prompt-based evaluation, hallucination detection, and &#xD;
fairness-aware fine-tuning, provides novel insights into the mechanisms that produce &#xD;
vi   &#xD;
biases in AI systems while suggesting appropriate strategies to mitigate them. The &#xD;
results of this work demonstrate the potential to help establish more ethical, fair, &#xD;
transparent, and socially responsible generative AI technologies that can serve a wider &#xD;
range of communities without perpetuating harmful stereotypes or obesity-related &#xD;
inequalities.</description>
    <dc:date>2026-04-01T00:00:00Z</dc:date>
  </item>
</rdf:RDF>

