Skip to content

atif85-git/hf-agents

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 

Repository files navigation

hf-agents : hf CLI extension to detect the best model/quant for a user's hardware and then spins up a local coding agent

hf-agents is a HF CLI extension that uses llmfit to detect the user's hardware and recommend models they can actually run, then spins up a local llama.cpp server with the most suited model and launches a coding agent with Pi.

Go from "what can my machine run?" to "running a local coding agent" in one command.

Requirements & References

Installation

curl -LsSf https://hf.co/cli/install.sh | bash
hf extensions install hf-agents

Usage

Hardware detection & model recommendations

Passes through to llmfit — all llmfit subcommands work:

hf agents fit recommend -n 5      # top 5 models for your hardware
hf agents fit system              # show detected hardware
hf agents fit search "qwen"       # search models
hf agents fit recommend --use-case coding --min-fit good

Run a local coding agent

Detects your hardware, lets you pick a model, starts llama-server, and launches Pi:

hf agents run pi                  # interactive model selection + coding agent
hf agents run pi --print "hello"  # non-interactive mode, forward args to Pi

If llama-server is already running on the target port, it reuses it and skips model selection.

Environment Variables

Variable Default Purpose
LLAMA_SERVER_PORT 8080 Port for llama-server
HF_TOKEN (from hf CLI) For downloading gated models

About

HF CLI extension to run local coding agent powered by llmfit and llama.cpp

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Shell 100.0%