Welcome to the AI Revolution
The field of Artificial Intelligence and Large Language Models is transforming every industry
Reddit Starter Kit
Featured Learning Resources
Attention Is All You Need
The architecture that changed AI
Introduces self‑attention, multi‑head mechanisms, positional encodings, and the encoder‑decoder stack that underpins modern LLMs.
BERT
Bidirectional Encoder Representations
Masked language modeling and next‑sentence prediction for strong bidirectional representations across core NLP tasks.
CLIP
Contrastive Language–Image Pre‑training
Shared image‑text representations enabling zero‑shot classification and retrieval; a foundation for modern VLMs.
LoRA
Low‑Rank Adaptation of LLMs
Parameter‑efficient fine‑tuning via low‑rank matrix updates — reduces trainable parameters and memory while preserving performance.
Latent Diffusion Models
High‑Resolution Image Synthesis
Text‑to‑image generation using diffusion in a compressed latent space — efficient, high‑quality synthesis enabling SD use cases.
wav2vec 2.0
Self‑Supervised Speech Representations
Contrastive learning on raw audio for downstream ASR with minimal labeled data; strong STT with fine‑tuning.
Whisper
Robust ASR via Web‑Scale Supervision
Large‑scale multilingual speech recognition with strong zero‑shot transcription and translation robustness.
Conformer
Convolution‑Augmented Transformer
Combines attention and convolution for local features and global context; a strong ASR backbone.
WaveNet
Generative Raw‑Waveform Audio
Autoregressive raw‑audio synthesis that launched the neural vocoder era for natural TTS.
Tacotron 2
Mel Spectrograms + Vocoder
End‑to‑end TTS with mel‑spectrogram prediction and WaveNet vocoder; set a new quality bar.
VITS
Variational + Adversarial TTS
Unified acoustic and vocoder network with CVAE and GAN training for natural, end‑to‑end speech.
SV2TTS
Speaker‑Encoder + TTS Transfer
Few‑shot voice cloning via transfer learning from speaker verification to multi‑speaker TTS.
AutoVC
Zero‑Shot Voice Style Transfer
Content/speaker disentanglement for zero‑shot voice conversion using only autoencoder losses.
Real‑Time Voice Cloning
Open Toolkit (SV2TTS‑based)
Practical pipeline for real‑time voice cloning: speaker‑encoder, synthesizer, vocoder.
Codex CLI
Terminal‑Based Agentic Coding
Plan, patch, and validate in a single CLI workflow — task planning, apply_patch edits, test/lint hooks, and iterative delivery.
Claude Code CLI
Local + Cloud Coding Flows
Fast, interactive coding sessions with tool use, refactors, and file‑aware edits — works alongside local and hosted LLMs.
OpenCode
Local‑First Agentic IDE
Local coding agent with OpenAI‑compatible endpoints — private inference, routing between local/cloud models, and CLI/IDE bridges.
Courses & Reading Lists (Agents & LLMs)
Stanford CS329A
Self‑Improving AI Agents
Stanford course focused on agentic AI and self‑improvement, with lectures and curated readings on reasoning, tools, and planning.
Berkeley CS294/194‑196
Agentic AI — Fall 2025
UC Berkeley course on agentic AI: lectures, guest talks, and assignments covering frameworks, memory, evaluation, and long‑horizon tasks.
UCSD CSE‑291A
Systems for LLMs & AI Agents
Reading list spanning LLM infrastructure, agents, retrieval, and evaluation — with links to papers and slides.
Agentic AI — Lecture 1
UC Berkeley RDI (Slides)
Introductory slide deck outlining the course overview, agent architectures, and research directions in agentic AI.
Beyond the Basics
Once you've mastered the fundamentals, explore advanced topics that represent the cutting edge of AI research and practical applications.
Chain-of-Thought Reasoning
Advanced prompting techniques that enable LLMs to break down complex problems into step-by-step reasoning processes.
Explore PaperRetrieval-Augmented Generation
Combine the power of large language models with external knowledge bases for more accurate and up-to-date responses.
Explore PaperConstitutional AI
Methods for training AI systems to be helpful, harmless, and honest through constitutional principles and self-correction.
Explore PaperTool-Using Agents
AI systems that can interact with external tools and APIs to extend their capabilities beyond text generation.
Explore PaperNeed Guidance?
Looking for personalized AI learning guidance or strategic consulting on implementing these technologies in your organization?