This repository is a personal collection of experiments and notebooks exploring language and vision architectures (GPT-2, BERT, ViT/DeiT, LSTM/RNN) along with a small personal chatbot project. Notebooks are intended to be readable, runnable, and educational.
- Notebooks (top-level):
bert.ipynbGPT2 .ipynb(note: filename contains a space)lstm.ipynbrnn.ipynbVision_transformer.ipynbDeiT.ipynbnano_vlm.ipynb
- Data:
data/MNIST/raw/— raw IDX MNIST files for quick experiments
- Personal chatbot:
personal_chatbot/GPT2_project.ipynbpersonal_chatbot/instruction-data.json
- Misc:
attn_is_all_u_need.png
-
Create and activate a virtual environment (recommended):
python -m venv .venv source .venv/bin/activate
-
Install common packages used across the notebooks (example):
pip install --upgrade pip pip install jupyterlab notebook numpy pandas matplotlib torch torchvision transformers datasets tqdm
-
Launch Jupyter and open the notebook you want to run:
jupyter lab
Notes:
- Many notebooks are exploratory; run them cell-by-cell and read the inline commentary.
- For training or fine-tuning, a GPU (CUDA) will significantly speed up runs. Notebooks will run on CPU but more slowly.
Open personal_chatbot/GPT2_project.ipynb for a guided example of using a GPT-2 model for a small chatbot, including tokenizer setup and fine-tuning data format (instruction-data.json).