SEMIKONG is a semiconductor-focused language model project. This model/ subtree contains the current model assets, training code, inference scripts, configs, references, and model-specific documentation.
The top-level repository is being organized around two major areas:
model/for the semiconductor modelontology/for the semiconductor ontology and knowledge-graph assets
- Christopher Nguyen (
ctn@aitomatic.com) - William Nguyen (
william@aitomatic.com)
- Dataset and benchmarks: https://drive.google.com/drive/u/0/folders/1IjuVyP35-xBEe_i_KkG9MnE-4o7Eb7tq
- Public model weights:
- Base 70B: https://huggingface.co/pentagoniac/SEMIKONG-70B
- Quantized 8B GPTQ: https://huggingface.co/pentagoniac/SEMIKONG-8b-GPTQ
- Quantized 8B instruct GPTQ: https://huggingface.co/sitloboi2012/SEMIKONG-8B-Instruct-GPTQ
- Instruct chat API: launch with
python -m vllm.entrypoints.openai.api_server ...as shown in USAGE.md - Paper: https://arxiv.org/abs/2411.13802
- SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language Model Christopher Nguyen, William Nguyen, Atsushi Suzuki, Daisuke Oku, Hong An Phan, Sang Dinh, Zooey Nguyen, Anh Ha, Shruti Raghavan, Huy Vo, Thang Nguyen, Lan Nguyen, and Yoshikuni Hirayama. arXiv:2411.13802, 2024.
@article{semikong2024,
title={SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language Model},
author={Nguyen, Christopher and Nguyen, William and Suzuki, Atsushi and Oku, Daisuke and Phan, Hong An and Dinh, Sang and Nguyen, Zooey and Ha, Anh and Raghavan, Shruti and Vo, Huy and Nguyen, Thang and Nguyen, Lan and Hirayama, Yoshikuni},
journal={arXiv preprint arXiv:2411.13802},
year={2024}
}- Setup and environment: INSTALL.md
- Usage and serving: USAGE.md
- Commands: Makefile
- Training config: configs/training-config.yaml
- Inference config: configs/inference-config.yaml
From the repository root:
make -C model install
make -C model train
make -C model inferIf you need to change paths or parameters first, edit:
- Project overview and model summary: docs/overview.md
- Ecosystem, deployment, and references: docs/ecosystem.md
- Governance, contributions, disclaimer, and license notes: docs/governance.md
The code and repository contents in this project are distributed under the MIT License. Model weights, datasets, and third-party assets may carry separate upstream license terms.
