Institutional-Repository, University of Moratuwa.  

Text-to-SQL generation using schema item classifier and encoder-decoder architecture

Show simple item record

dc.contributor.advisor Uthayasanker, T
dc.contributor.author Rushdy, MSA
dc.date.accessioned 2025-02-03T08:44:53Z
dc.date.available 2025-02-03T08:44:53Z
dc.date.issued 2023
dc.identifier.citation Rushdy, M.S.A. (2023). Text-to-SQL generation using schema item classifier and encoder-decoder architecture [Master's theses, University of Moratuwa]. Institutional Repository University of Moratuwa. http://dl.lib.uom.lk/handle/123/23424
dc.identifier.uri http://dl.lib.uom.lk/handle/123/23424
dc.description.abstract The objective of the text-to-SQL task is to convert natural language queries into SQL queries. However, the presence of extensive text-to-SQL datasets across multiple domains, such as Spider, introduces the challenge of effectively generalizing to unseen data. Existing semantic parsing models have struggled to achieve notable performance improvements on these cross-domain datasets. As a result, recent advancements have focused on leveraging pre-trained language models to address this issue and enhance performance in text-to-SQL tasks. These approaches represent the latest and most promising attempts to tackle the challenges associated with generalization and performance improvement in this field. I proposed an approach to evaluate and use the Seq2Seq model by giving the most relevant schema items as the input to the encoder and to generate accurate and valid cross-domain SQL queries using the decoder by understanding the skeleton of the target SQL query. The proposed approach is evaluated using Spider dataset which is a well-known dataset for text-to-sql task and able to get promising results where the Exact Match accuracy and Execution accuracy has been boosted to 72.7% and 80.2% respectively compared to other best related approaches. Keywords: Text-to-SQL, Seq2Seq model, BERT, RoBERTa, T5-Base en_US
dc.language.iso en en_US
dc.subject TEXT-TO-SQL
dc.subject SEQ2SEQ MODEL
dc.subject BERT
dc.subject T5-BASE
dc.subject ROBERTA
dc.subject COMPUTER SCIENCE & ENGINEERING – Dissertation
dc.subject COMPUTER SCIENCE- Dissertation
dc.subject MSc in Computer Science
dc.title Text-to-SQL generation using schema item classifier and encoder-decoder architecture en_US
dc.type Thesis-Abstract en_US
dc.identifier.faculty Engineering en_US
dc.identifier.degree MSc in Computer Science en_US
dc.identifier.department Department of Computer Science & Engineering en_US
dc.date.accept 2023
dc.identifier.accno TH5304 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record