Wen-Chuang Chou

Logo

Passionate about AI with roots in theoretical neuroscience,
I explore LLMs, AI agent, and human mobility—driven by a vision of accessible, everyday AI for all.

View My GitHub Profile

My AI Portfolio

Engineering advanced AI systems—from autonomous multi-agent systems and scaling reasoning-focused LLMs on multi-node GPU clusters to performance profiling and distilling DeepSeek R1.


AI Agent

Building intelligent AI agents that dynamically reason, retrieve, and self-correct—from Agentic RAG with colocated vLLM inference to tool-augmented reasoning on the GAIA benchmark.

Key projects:

Explore all AI Agent projects →


LLM Benchmarking and Profiling

Systematic performance analysis of Transformer architectures—benchmarking FP32 vs. BF16 mixed precision and profiling compute- vs. memory-bound operations in self-attention.

Key projects:

Explore all Benchmarking projects →


LLM Distillation & Fine-Tuning

Advanced post-training and fine-tuning across large language models—from distilling DeepSeek R1 on multi-node HPC to instruction-tuning Llama 3.

Radar plot

Key projects:

Explore all LLM Distillation & Fine-Tuning projects →


Generative AI & Applied Machine Learning

Developing and fine-tuning generative models for image synthesis, as well as applying advanced deep learning architectures to real-world predictive modeling and audio processing tasks.

LoRA Output   Anime Face

Key projects:

Explore all Generative AI & Applied Machine Learning projects →