👋 Hello world.
I'm Antonello Fratepietro, a cloud engineer and developer passionate about building scalable Cloud and AI infrastructure. I enjoy working with modern technologies to create efficient, production-ready solutions.
-
Cognitora inference - Open-source, datacenter-scale LLM inference stack: orchestration above vLLM, SGLang, TensorRT-LLM, and llama.cpp with KV-aware routing, prefill/decode disaggregation, multi-tier KV cache, and static Rust binaries for bare metal, Kubernetes, or cloud.
-
s0-cli - LLM-driven CLI agent that finds security vulnerabilities and AI-slop patterns (stub auth, hallucinated imports, dummy crypto, prompt-injection sinks). Hybrid of classic scanners (semgrep, bandit, gitleaks, trivy) + LLM triage, with a self-optimizing meta-harness loop.
-
VittoriaDB - Zero-configuration embedded vector database with HNSW indexing, ACID storage, and REST API. Single Go binary for local AI development.
-
DistX - High-performance vector database written in Rust. Features HNSW indexing with SIMD optimizations, Qdrant-compatible REST API, and gRPC support.
-
PeerClaw - Decentralized P2P AI agent network. One static binary. BitTorrent meets AI inference: peers share compute, agents earn & spend tokens.
- Website: fratepietro.com antonello.dev
- GitHub: @antonellof
- Credly: My Certifications
- Databricks: My Credentials
- X.com: ☁️ Hack the Cloud
🕹️ Learned to code on a Commodore, and I've basically been hitting RUN ever since.





