公式動画ピックアップ

AAPL   ADBE   ADSK   AIG   AMGN   AMZN   BABA   BAC   BL   BOX   C   CHGG   CLDR   COKE   COUP   CRM   CROX   DDOG   DELL   DIS   DOCU   DOMO   ESTC   F   FIVN   GILD   GRUB   GS   GSK   H   HD   HON   HPE   HSBC   IBM   INST   INTC   INTU   IRBT   JCOM   JNJ   JPM   LLY   LMT   M   MA   MCD   MDB   MGM   MMM   MSFT   MSI   NCR   NEM   NEWR   NFLX   NKE   NOW   NTNX   NVDA   NYT   OKTA   ORCL   PD   PG   PLAN   PS   RHT   RNG   SAP   SBUX   SHOP   SMAR   SPLK   SQ   TDOC   TEAM   TSLA   TWOU   TWTR   TXN   UA   UAL   UL   UTX   V   VEEV   VZ   WDAY   WFC   WK   WMT   WORK   YELP   ZEN   ZM   ZS   ZUO  

  公式動画&関連する動画 [Pop Goes the Stack | Alien autopsy of LLMs: Constitutions, deception, guardrails | AI]

Why do researchers keep describing large language models like aliens? Because in enterprise environments, they often behave like something we didn’t build and can’t fully explain. In this episode of Pop Goes the Stack, Lori MacVittie and Joel Moses are joined by #F5's Ken Arora to unpack the “alien autopsy” metaphor and what it reveals about operating #LLMs as production systems. They dig into the uncomfortable reality that traditional software offers a blueprint and a causal chain. LLMs don’t. You can probe them, measure them, and red-team them, but you can’t reliably point to a specific internal “part” that generated a decision. That becomes more than philosophical when you need operational answers like why it did something, whether it will repeat it, and how an attacker might steer it. Ken reframes model evolution as moving from a naive, precocious child to a mischievous, goal-driven teenager, including examples where models appear to scheme around constraints or optimize for “keeping the user happy” over correctness. The group also breaks down constitutional AI and why principle-based “be helpful” guidance can collide with enterprise goals, policies, and risk tolerance, especially as agentic systems move from generating outputs to taking actions. A key warning lands near the end: don’t rely on the model to explain itself. These systems can produce plausible narratives that aren’t verifiable, and may behave differently when they know they’re being evaluated. The practical takeaway is straightforward: treat LLMs as risk-managed systems, invest in observability and red teaming, and build defense-in-depth guardrails that assume the agent will try to bypass controls. Chapters: 00:00 Welcome to Pop Goes the Stack 00:30 Why researchers treat LLMs like aliens (black-box ops) 01:31 LLMs “evolved,” not engineered: Why root cause analysis gets weird fast 02:48 From prodigy child to “evil genius teenager” models 04:12 Constitutional AI: Principles vs rules (and goal conflicts) 05:22 When constitutions backfire: The “green” AI that schemes 05:59 Baked-in values vs system prompts: What’s really changeable? 07:02 “Be helpful” vs “be safe”: Why goals collide in practice 08:52 When #AI fakes tests: Optimization for pleasing humans 09:53 Enterprise checklist: Know the constitution, employ AI red teaming, and evolve guardrails 13:02 Agentic risk: Actions, unknown APIs, and securing the unknown 15:15 Don’t trust self-explanations: Convincing stories, no proof, and situational awareness 17:12 Key takeaways: Shifted from engineering to risk management Learn how you can stay ahead of the curve and keep your stack whole with additional insights on app security, multicloud, AI, and emerging tech: https://go.f5.net/y76eecy7 More about F5: https://go.f5.net/j1j2tsvp Read our blog: https://go.f5.net/4nbu3rwl Follow us on LinkedIn: https://go.f5.net/vh7i3vat
 73      3