Skip to content

Latest commit

 

History

History
72 lines (50 loc) · 2.98 KB

File metadata and controls

72 lines (50 loc) · 2.98 KB

SEMIKONG teaser

SEMIKONG Model

SEMIKONG is a semiconductor-focused language model project. This model/ subtree contains the current model assets, training code, inference scripts, configs, references, and model-specific documentation.

The top-level repository is being organized around two major areas:

  • model/ for the semiconductor model
  • ontology/ for the semiconductor ontology and knowledge-graph assets

Principals

Quick Links

Papers

@article{semikong2024,
  title={SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language Model},
  author={Nguyen, Christopher and Nguyen, William and Suzuki, Atsushi and Oku, Daisuke and Phan, Hong An and Dinh, Sang and Nguyen, Zooey and Ha, Anh and Raghavan, Shruti and Vo, Huy and Nguyen, Thang and Nguyen, Lan and Hirayama, Yoshikuni},
  journal={arXiv preprint arXiv:2411.13802},
  year={2024}
}

Start Here

How To Use

From the repository root:

make -C model install
make -C model train
make -C model infer

If you need to change paths or parameters first, edit:

Documentation

License

The code and repository contents in this project are distributed under the MIT License. Model weights, datasets, and third-party assets may carry separate upstream license terms.