Skip to main content

YuanBo Li

GenAI Specialist Solution Architect @ AWS

Focused on LLM inference optimization, AI application platforms, and retrieval-augmented generation

Email: ybalbert@amazon.com

WeChat: AI猿智慧 (1,209 followers)

Bilibili: 前滩猿神 (349 followers)

Research

LLM Inference 2025 – present

Best practices for deploying open-source LLMs (DeepSeek, etc.) on AWS. Performance benchmarking of inference engines (SGLang) across GPU instance types including next-gen B200.

GenAI Platform - Dify on AWS 2024 – 2025

Dify Top Contributor. Building the AWS ecosystem within Dify, making Bedrock the best-supported model provider. Enabling 5 categories of SageMaker-deployed GenAI models to integrate with Dify.

Book Translation

  • Generative AI on AWS (Chinese edition), Douban 8.8/10, Co-translator, 2024 Influential Translator Award