On‑Demand Head of AI. Proven AI Leadership. Transformative Results.

Agentic AI — Tool‑Integrated Reasoning (TIR)

LLM Reason Py Python Tool LLM+ Update # tool: python import math math.factorial(6) => 720 TIR • Tool‑Integrated Reasoning
LLM routes sub‑tasks to Python, integrates results, and continues reasoning.

As your on‑demand Head of AI and Chief Data Scientist, I partner with post‑seed and growth teams to set strategy and roadmaps, make the first AI hires, evaluate vendors, and ship real systems—from prototype to production. We design and run LLM/RAG architectures, build agentic workflows and tools, and implement evaluation, observability, safety guardrails, and end‑to‑end MLOps.

20 years of hands‑on AI and systems engineering — from Fortune 500 programs to award‑winning open source. We bring that experience to post‑seed and growth teams with a velocity approach that ships functioning prototypes.

About Shlomo Kashani

Shlomo Kashani

Founder, QNeura.ai

On‑Demand Head of AI • Chief Data Scientist

Shlomo Kashani is an AIMO‑2 Gold Medalist, published author (Deep Learning Interviews, GitHub), and founder of QNeura.ai. He leads strategy and hands‑on delivery for LLM/RAG systems, agentic AI, and MLOps.

An interdisciplinary technologist, he weaves advanced AI research with DSS‑informed philosophical inquiry and operates as a Chief Scientist, fusing scientific precision with ethical and cultural insight.

Hands‑on with agentic frameworks and orchestrators, LLMs/VLMs (Anthropic, OpenAI, DeepSeek), and the full stack from pre‑training and fine‑tuning to LoRA, multi‑GPU inference, and deployments on HuggingFace and vLLM.

Education: DSS Strategic Studies (MSU), M.Sc. Quantum Physics/Computing (Johns Hopkins), M.Sc. DSP (Queen Mary), B.Sc. Engineering (Ben‑Gurion). Open‑source contributions include QuantumLLMInstruct, metalQwen3, vLLM‑5090, and osxQ.

Read Full Profile View Publications

Services

We partner end‑to‑end—from strategy and roadmaps to prototypes and production—to implement GenAI and quantum capabilities that move the needle.

Under the hood that means Python/PyTorch, C++/CMake, CUDA, ONNX, TensorRT, vLLM/llama.cpp, AWS/GCP, and edge‑optimized inference. I maintain open‑source projects including QuantumLLMInstruct, metalQwen3, vLLM‑5090, and QonFusion. My background spans DSS Strategic Studies (Missouri State University), Johns Hopkins (M.Sc. Quantum Physics/Computing), Queen Mary (M.Sc. DSP), and Ben‑Gurion (B.Sc. Engineering); I’m an AIMO‑2 Gold Medalist and author of Deep Learning Interviews (GitHub).

AI Strategy Consulting

Strategic guidance on AI implementation, technology selection, and organizational transformation for quantum-ready enterprises.

Quantum ML Development

Custom quantum machine learning solutions, algorithm development, and hybrid classical-quantum system design.

Technology Integration

Seamless integration of quantum-enhanced AI capabilities into existing business workflows and technical infrastructure.

Get in touch

Ready to accelerate your AI roadmap or discuss fractional leadership? Reach out and we’ll help you get started.

QNeura.ai

osxQ — Apple Silicon Quantum Simulator