公式動画ピックアップ

AAPL   ADBE   ADSK   AIG   AMGN   AMZN   BABA   BAC   BL   BOX   C   CHGG   CLDR   COKE   COUP   CRM   CROX   DDOG   DELL   DIS   DOCU   DOMO   ESTC   F   FIVN   GILD   GRUB   GS   GSK   H   HD   HON   HPE   HSBC   IBM   INST   INTC   INTU   IRBT   JCOM   JNJ   JPM   LLY   LMT   M   MA   MCD   MDB   MGM   MMM   MSFT   MSI   NCR   NEM   NEWR   NFLX   NKE   NOW   NTNX   NVDA   NYT   OKTA   ORCL   PD   PG   PLAN   PS   RHT   RNG   SAP   SBUX   SHOP   SMAR   SPLK   SQ   TDOC   TEAM   TSLA   TWOU   TWTR   TXN   UA   UAL   UL   UTX   V   VEEV   VZ   WDAY   WFC   WK   WMT   WORK   YELP   ZEN   ZM   ZS   ZUO  

  公式動画&関連する動画 [Pop Goes the Stack | Model routing isn’t load balancing (And that’s why you’re not ready) | AI]

Multi-model AI isn’t a buzzword anymore, it’s how organizations are actually operating. In this episode of #F5's Pop Goes the Stack, Lori MacVittie and Joel Moses dig into fresh findings from F5's State of Application Strategy (SOAS) Report, showing companies run an average of seven models, and more than half are already orchestrating multiple models together. That’s a big shift, and it changes what “infrastructure readiness” even means. Why do teams chain models in the first place? The answer: cost, capability, and risk. The uncomfortable part? Most infrastructure is still built for deterministic systems, and AI routing is not the same problem as load balancing. Model routing isn’t about spreading traffic evenly. It’s about making a decision on every request: which model is best for this job, what will it cost, what’s the risk, and what’s the fallback when the answer is wrong or low quality. Joel frames it as a category change, from “where should this request go?” to “what should happen as a result of this request?” That shift forces new requirements: policy enforcement across models, identity-aware access, decision justification, and mechanisms to recover when output quality degrades due to drift, configuration changes, or poisoned inputs like compromised RAG data. Lori ties it back to governance, not just availability, and why “AI workloads” expose gaps that traditional tooling can’t cover. While many organizations are operationalizing #AI, that doesn’t mean it’s manageable yet. If you want to know how to move forward from here, this is an episode you don't want to miss. Chapters: 00:00 Welcome to Pop Goes the Stack 00:21 F5 research confirms multi-model AI is already real 01:17 Multi-model orchestration: Reasoning vs cost tradeoffs 02:15 Where does delivery and security matter most in AI? Input, output, identity, or routing? 03:54 Why load balancing breaks: Model routing isn’t “distribution” 04:45 “Measuring fog with a ruler”: Uncertainty vs deterministic tools 06:01 Model routing is control: 100 variables we don’t measure yet 07:33 Decision points: Which model is right for this request? 08:27 Infrastructure isn’t AI-ready: Deterministic vs probabilistic systems 12:47 Routing becomes governance: Policy, access, and enforcement 15:19 Failover becomes “bad vibes”: Retries, prompts, temps, RAG 17:41 Key takeaways: Infrastructure is fundamentally changing for AI to be operational and manageable Get your copy of the 2026 State of Applications Strategy Report: https://go.f5.net/4jhryeya Learn how you can stay ahead of the curve and keep your stack whole with additional insights on app security, multicloud, AI, and emerging tech: https://go.f5.net/2r85m4ae More about F5: https://go.f5.net/eumc26xr Read our blog: https://go.f5.net/ftvta5u6 Follow us on LinkedIn: https://go.f5.net/fmtcbcpy
 119      6