High-Performance AI Inference: Systems, Caching, and Distributed Execution

As large language models move into production, inference performance is increasingly defined by systems-level decisions, not model architecture or…

概要・詳細

As large language models move into production, inference performance is increasingly defined by systems-level decisions, not model architecture or prompts.
This session explores the infrastructure and low-level engineering challenges behind efficient LLM inference, including KV cache movement, memory bandwidth, cache efficiency, distributed execution, and long-context optimization.
Across three technical talks, we’ll cover disaggregated inference on modern cloud hardware, data-oriented design for high-performance inference engines, and structural sparsity techniques for KV cache compression.
The event is designed for engineers and researchers working on LLM infrastructure, inference engines, and ML systems, and concludes with networking, food, and drinks.

Agenda
18:00 Doors open
18:30 - 18:50 Disaggregated Inference with EFA and NIXL (Toshinobu Akazawa - Solutions Architect, AWS)
18:50 - 19:10 High-Performance Inference Execution, Caching, and Systems using Data-Oriented Design (Julie...

プラットフォーム: luma

おすすめイベント

関連タグ

High-Performance AI Inference: Systems, Caching, and Distributed Execution

概要・詳細

おすすめイベント

【Algomatic Monthly Webinar】大手企業のシステム開発・保守運用「AI活用の最前線」

法律のプロが監修したAIで契約審査をご支援！「マネーフォワードクラウドAI契約書レビュー」デモ説明会

社内データをAIに学ばせると情報は漏れるか：仕組みから理解するリスク判断

Claude Codeで磨き上げ、現役メディアが斬る。　あなたのプレスリリースを"その場で"生まれ変わらせる　広報AIエージェント実装ワークショップ

人気のプレイ

16タイプ性格診断

魚へん漢字クイズ

47都道府県ランキングクイズ

イベント集客知ってるか検定【はじめて主催する人向け】

おすすめイベント

【Algomatic Monthly Webinar】大手企業のシステム開発・保守運用「AI活用の最前線」

法律のプロが監修したAIで契約審査をご支援！「マネーフォワードクラウドAI契約書レビュー」デモ説明会

社内データをAIに学ばせると情報は漏れるか：仕組みから理解するリスク判断

Claude Codeで磨き上げ、現役メディアが斬る。　あなたのプレスリリースを"その場で"生まれ変わらせる　広報AIエージェント実装ワークショップ

人気のプレイ

16タイプ性格診断

魚へん漢字クイズ

47都道府県ランキングクイズ

イベント集客知ってるか検定【はじめて主催する人向け】

High-Performance AI Inference: Systems, Caching, and Distributed Execution

概要・詳細

おすすめイベント

【Algomatic Monthly Webinar】大手企業のシステム開発・保守運用「AI活用の最前線」

法律のプロが監修したAIで契約審査をご支援！「マネーフォワード クラウドAI契約書レビュー」デモ説明会

社内データをAIに学ばせると情報は漏れるか：仕組みから理解するリスク判断

Claude Codeで磨き上げ、現役メディアが斬る。 あなたのプレスリリースを"その場で"生まれ変わらせる 広報AIエージェント実装ワークショップ

人気のプレイ

16タイプ性格診断

魚へん漢字クイズ

47都道府県ランキングクイズ

イベント集客知ってるか検定【はじめて主催する人向け】

おすすめイベント

【Algomatic Monthly Webinar】大手企業のシステム開発・保守運用「AI活用の最前線」

法律のプロが監修したAIで契約審査をご支援！「マネーフォワード クラウドAI契約書レビュー」デモ説明会

社内データをAIに学ばせると情報は漏れるか：仕組みから理解するリスク判断

Claude Codeで磨き上げ、現役メディアが斬る。 あなたのプレスリリースを"その場で"生まれ変わらせる 広報AIエージェント実装ワークショップ

人気のプレイ

16タイプ性格診断

魚へん漢字クイズ

47都道府県ランキングクイズ

イベント集客知ってるか検定【はじめて主催する人向け】

法律のプロが監修したAIで契約審査をご支援！「マネーフォワードクラウドAI契約書レビュー」デモ説明会

Claude Codeで磨き上げ、現役メディアが斬る。　あなたのプレスリリースを"その場で"生まれ変わらせる　広報AIエージェント実装ワークショップ

法律のプロが監修したAIで契約審査をご支援！「マネーフォワードクラウドAI契約書レビュー」デモ説明会

Claude Codeで磨き上げ、現役メディアが斬る。　あなたのプレスリリースを"その場で"生まれ変わらせる　広報AIエージェント実装ワークショップ