Fine-tuning LLaMA 3.2 1B for SQL Generation

This project is about fine-tuning a small LLaMA model (1B) to generate SQL queries from natural language. I'm using a dataset that contains examples of how people ask questions and how those get translated into SQL.

What I'm Doing

I'm starting with a pre-trained LLaMA 3.2 1B model.
I use a dataset called synthetic_text_to_sql-ShareGPT which has examples of prompts and the corresponding SQL queries.
Dataset URL: https://huggingface.co/datasets/mlabonne/synthetic_text_to_sql-ShareGPT
I fine-tune the model using Unsloth libary with LoRA Adapters. This allows me to train only parts of the model, which makes it much faster and memory-efficient.

Evaluation Process

The evaluation pipeline (see Evaluate_LLM.ipynb) works as follows:

Question Generation: 10 SQL questions are generated using Groq’s Gemma 2-9b-it model.
Model Answering: Both the original and fine-tuned LLaMA models answer all 10 questions.
Scoring: Each answer is evaluated and scored (1–10) by Groq’s Gemma 2-9b-it model.
Results: The average scores and feedback for both models are summarized and saved.

Note: I usually use Gemini for evaluation, but yesterday Gemini was slow for some reason, so I used Groq instead. Groq is faster, but the questions and evaluation quality are not as good as Gemini. Even so, the fine-tuned model still performed very well.

Why I’m Doing This

I want to build a model that can understand plain English and generate accurate SQL queries. This can be useful for tools where people want to ask questions about their data without writing SQL themselves.

I’m also doing this for fun and for learning—it’s an investment in my future skills.

Where to Find the Model & Notebooks

You can find the fine-tuned model, including the .gguf file format for easy local use, on my Hugging Face repository:

👉 https://huggingface.co/Adhishtanaka/llama_3.2_1b_SQL/tree/main

You can also download and try the model directly using Ollama:

👉 https://ollama.com/adhishtanaka/llama_3.2_1b-SQL

To run it with Ollama, use:

ollama run adhishtanaka/llama_3.2_1b-SQL

You can find the Jupyter Notebook files used in this project directly in this repository:

Evaluate_LLM.ipynb: The evaluation pipeline for the fine-tuned model.
Llama3.2_1B-SQL.ipynb: The main notebook for fine-tuning and experimentation.

👉 Browse these files in the GitHub repository for full code and documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
screenshots		screenshots
Evaluate_LLM.ipynb		Evaluate_LLM.ipynb
LICENSE		LICENSE
Llama3.2_1B-SQL.ipynb		Llama3.2_1B-SQL.ipynb
model_comparison_detailed_20250708_114354.csv		model_comparison_detailed_20250708_114354.csv
model_comparison_summary_20250708_114354.csv		model_comparison_summary_20250708_114354.csv
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fine-tuning LLaMA 3.2 1B for SQL Generation

What I'm Doing

Evaluation Process

Why I’m Doing This

Where to Find the Model & Notebooks

About

Uh oh!

Releases

Languages

License

Adhishtanaka/llama3.2_1.b-SQL

Folders and files

Latest commit

History

Repository files navigation

Fine-tuning LLaMA 3.2 1B for SQL Generation

What I'm Doing

Evaluation Process

Why I’m Doing This

Where to Find the Model & Notebooks

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Languages