Understanding Linear Probing then Fine-tuning Language Models from NTK Perspective (NeurIPS 2024)

This repository contains the code for our paper:

Understanding Linear Probing then Fine-tuning Language Models from NTK Perspective. Akiyoshi Tomihari and Issei Sato. arXiv OpenReview

Dependencies

The main dependencies are:

Python 3.9 or higher
adapter-transformers = 2.2.0
torch = 1.12.1

Please refer to the pyproject.toml file for more details.

Setup

To set up and run the project, follow these steps:

# Install torch and torchvision
pushd wheels
bash download.sh
popd

# Configure the project to create virtual environments within the project directory
poetry config virtualenvs.in-project true

# Set the local python version using pyenv
pyenv local 3.9.18

# Install dependencies and activate the virtual environment
poetry install
poetry shell

Data files

The data files for GLUE and SuperGLUE will be automatically downloaded.

To conduct the experiments of OOD and PubMed, you need to download the following data files:

OOD Datasets

Amazon:
- train.tsv
- test.tsv
- Location: tasks/OOD_NLP/datasets/process/SentimentAnalysis/amazon
Dynasent, SemEval, SST-5:
- test.tsv for each dataset
- Locations:
  - tasks/OOD_NLP/datasets/process/SentimentAnalysis/dynasent
  - tasks/OOD_NLP/datasets/process/SentimentAnalysis/semeval
  - tasks/OOD_NLP/datasets/process/SentimentAnalysis/sst5

PubMed 20k Dataset

Files:
- train.txt
- dev.txt
- test.txt
Location: tasks/pubmed/pubmed-rct/PubMed_20k_RCT_numbers_replaced_with_at_sign

Please refer to the data sources and our codes for the details.

Acknowledgments

We use the following resources and libraries:

Base code structure: PETuning
Computing the NTK matrix and linear probing: LM-Kernel-FT
LoRA method: LoRA
Libraries for implementation: Hugging Face Transformers, Adapter-Hub
Datasets: OOD_NLP, PubMed 200k RCT dataset

Citation

@inproceedings{
  tomihari2024understanding,
  title={Understanding Linear Probing then Fine-tuning Language Models from NTK Perspective},
  author={Akiyoshi Tomihari and Issei Sato},
  booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
  year={2024},
  url={https://openreview.net/forum?id=1v4gKsyGfe}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
model		model
scripts		scripts
tasks		tasks
training		training
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
arguments.py		arguments.py
pyproject.toml		pyproject.toml
run.py		run.py
run_ntk.py		run_ntk.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Understanding Linear Probing then Fine-tuning Language Models from NTK Perspective (NeurIPS 2024)

Dependencies

Setup

Data files

OOD Datasets

PubMed 20k Dataset

Acknowledgments

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Understanding Linear Probing then Fine-tuning Language Models from NTK Perspective (NeurIPS 2024)

Dependencies

Setup

Data files

OOD Datasets

PubMed 20k Dataset

Acknowledgments

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages