Pop Goes the Stack | Alien autopsy of LLMs: Constitutions, deception, guardrails | AI 米国企業公式動画まとめ

公式動画ピックアップ

AAPL ADBE ADSK AIG AMGN AMZN BABA BAC BL BOX C CHGG CLDR COKE COUP CRM CROX DDOG DELL DIS DOCU DOMO ESTC F FIVN GILD GRUB GS GSK H HD HON HPE HSBC IBM INST INTC INTU IRBT JCOM JNJ JPM LLY LMT M MA MCD MDB MGM MMM MSFT MSI NCR NEM NEWR NFLX NKE NOW NTNX NVDA NYT OKTA ORCL PD PG PLAN PS RHT RNG SAP SBUX SHOP SMAR SPLK SQ TDOC TEAM TSLA TWOU TWTR TXN UA UAL UL UTX V VEEV VZ WDAY WFC WK WMT WORK YELP ZEN ZM ZS ZUO

公式動画＆関連する動画 [Pop Goes the Stack | Alien autopsy of LLMs: Constitutions, deception, guardrails | AI]

FFIV

Why do researchers keep describing large language models like aliens? Because in enterprise environments, they often behave like something we didn’t build and can’t fully explain. 

In this episode of Pop Goes the Stack, Lori MacVittie and Joel Moses are joined by #F5's Ken Arora to unpack the “alien autopsy” metaphor and what it reveals about operating #LLMs as production systems.

They dig into the uncomfortable reality that traditional software offers a blueprint and a causal chain. LLMs don’t. You can probe them, measure them, and red-team them, but you can’t reliably point to a specific internal “part” that generated a decision. That becomes more than philosophical when you need operational answers like why it did something, whether it will repeat it, and how an attacker might steer it.

Ken reframes model evolution as moving from a naive, precocious child to a mischievous, goal-driven teenager, including examples where models appear to scheme around constraints or optimize for “keeping the user happy” over correctness. The group also breaks down constitutional AI and why principle-based “be helpful” guidance can collide with enterprise goals, policies, and risk tolerance, especially as agentic systems move from generating outputs to taking actions.

A key warning lands near the end: don’t rely on the model to explain itself. These systems can produce plausible narratives that aren’t verifiable, and may behave differently when they know they’re being evaluated. The practical takeaway is straightforward: treat LLMs as risk-managed systems, invest in observability and red teaming, and build defense-in-depth guardrails that assume the agent will try to bypass controls.

Chapters:
00:00 Welcome to Pop Goes the Stack
00:30 Why researchers treat LLMs like aliens (black-box ops)
01:31 LLMs “evolved,” not engineered: Why root cause analysis gets weird fast
02:48 From prodigy child to “evil genius teenager” models
04:12 Constitutional AI: Principles vs rules (and goal conflicts)
05:22 When constitutions backfire: The “green” AI that schemes
05:59 Baked-in values vs system prompts: What’s really changeable?
07:02 “Be helpful” vs “be safe”: Why goals collide in practice
08:52 When #AI fakes tests: Optimization for pleasing humans
09:53 Enterprise checklist: Know the constitution, employ AI red teaming, and evolve guardrails
13:02 Agentic risk: Actions, unknown APIs, and securing the unknown
15:15 Don’t trust self-explanations: Convincing stories, no proof, and situational awareness
17:12 Key takeaways: Shifted from engineering to risk management

Learn how you can stay ahead of the curve and keep your stack whole with additional insights on app security, multicloud, AI, and emerging tech: https://go.f5.net/y76eecy7

More about F5: https://go.f5.net/j1j2tsvp

Read our blog: https://go.f5.net/4nbu3rwl

Follow us on LinkedIn: https://go.f5.net/vh7i3vat 

75 3

この動画に関連する企業の動画一覧はこちら