Please use this identifier to cite or link to this item: http://dspace.dtu.ac.in:8080/jspui/handle/repository/22044
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSHUKLA, PRADYUMN-
dc.date.accessioned2025-08-01T06:07:34Z-
dc.date.available2025-08-01T06:07:34Z-
dc.date.issued2025-05-
dc.identifier.urihttp://dspace.dtu.ac.in:8080/jspui/handle/repository/22044-
dc.description.abstractThe increasing reliance on data-driven decision-making has brought intuitive database access into limelight, particularly for inexperienced users. Text-to-SQL technologies bridge this shortcoming by converting natural language queries to SQL queries and thereby render database interaction more intuitive. Large Language Models have also influenced the Text2SQL system paradigm towards predicting correct and context-aware SQL. This survey maps the historical development of Text2SQL approaches from rule-based systems to LLM- based neural models. Extensive efforts have gone into using prompt engineering, schema alignment methods, and domain fine-tuning to ensure higher accuracy and generality. The models now exhibit significant progress in understanding complex queries as well as precise SQL code generation through emergent Large Language Model capabilities. The early systems had extremely strong template-based or rule-based mechanisms, whereas generation these days is extremely advanced neural systems brimming with domain knowledge and highly specialized embeddings. Although LLMs, particularly GPT and BERT, have really set the bar high for query interpretability and execution accuracy, there are significant challenges regarding meeting the needs of domain specificity, intricate queries, and scalability across heterogeneous schemas. It also pointed out how the RAG generation mechanism has been integrated and called for a paradigm shift towards adopting TAG for richer schema interaction. Future directions involve developing explainable models, fine-tuning multiturn conversational capabilities, and optimizing computational efficiency toward robustness and ease of use for Text2SQL systems. By addressing the gaps, the study lays the foundations for innovations in database querying, using LLMs to redefine accessibility and usability for a wide range of users.en_US
dc.language.isoenen_US
dc.relation.ispartofseriesTD-8123;-
dc.subjectTEXT-TO-SQLen_US
dc.subjectLARGE LANGUAGE MODELS (LLMS)en_US
dc.subjectRETRIEVAL AUGMENTED GENERATION (RAG)en_US
dc.subjectPROMPT ENGINEERINGen_US
dc.subjectQUERY INTERPRETABILITYen_US
dc.subjectCROSS-DOMAIN GENERALIZATIONen_US
dc.subjectDATA-DRIVEN DECISION-MAKINGen_US
dc.subjectCOMPLEX QUERIESen_US
dc.subjectTAGen_US
dc.titleANALYSIS AND DEVELOPMENT OF TEXT-TO-SQL TRANSLATION SYSTEM USING LARGE LANGUAGE MODELS (LLMs)en_US
dc.typeThesisen_US
Appears in Collections:MTech Data Science

Files in This Item:
File Description SizeFormat 
Pradyumn Shukla M.Tech.pdf2.59 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.