cpu-first llm inference engine in rust. gguf loader, gpt-2
transformer with kv cache, zero framework dependencies.
~2,200 lines.
rustinferenceggufcpu
minimal x86_64 kernel in rust. gdt, idt, paging, heap with
3 allocation strategies, cooperative async executor,
ramdisk, and a shell with 10 commands. runs in qemu.
rustkernelx86_64no_std