公式動画ピックアップ
AAPL
ADBE
ADSK
AIG
AMGN
AMZN
BABA
BAC
BL
BOX
C
CHGG
CLDR
COKE
COUP
CRM
CROX
DDOG
DELL
DIS
DOCU
DOMO
ESTC
F
FIVN
GILD
GRUB
GS
GSK
H
HD
HON
HPE
HSBC
IBM
INST
INTC
INTU
IRBT
JCOM
JNJ
JPM
LLY
LMT
M
MA
MCD
MDB
MGM
MMM
MSFT
MSI
NCR
NEM
NEWR
NFLX
NKE
NOW
NTNX
NVDA
NYT
OKTA
ORCL
PD
PG
PLAN
PS
RHT
RNG
SAP
SBUX
SHOP
SMAR
SPLK
SQ
TDOC
TEAM
TSLA
TWOU
TWTR
TXN
UA
UAL
UL
UTX
V
VEEV
VZ
WDAY
WFC
WK
WMT
WORK
YELP
ZEN
ZM
ZS
ZUO
公式動画&関連する動画 [Pop Goes the Stack | Alien autopsy of LLMs: Constitutions, deception, guardrails | AI]
Why do researchers keep describing large language models like aliens? Because in enterprise environments, they often behave like something we didn’t build and can’t fully explain.
In this episode of Pop Goes the Stack, Lori MacVittie and Joel Moses are joined by #F5's Ken Arora to unpack the “alien autopsy” metaphor and what it reveals about operating #LLMs as production systems.
They dig into the uncomfortable reality that traditional software offers a blueprint and a causal chain. LLMs don’t. You can probe them, measure them, and red-team them, but you can’t reliably point to a specific internal “part” that generated a decision. That becomes more than philosophical when you need operational answers like why it did something, whether it will repeat it, and how an attacker might steer it.
Ken reframes model evolution as moving from a naive, precocious child to a mischievous, goal-driven teenager, including examples where models appear to scheme around constraints or optimize for “keeping the user happy” over correctness. The group also breaks down constitutional AI and why principle-based “be helpful” guidance can collide with enterprise goals, policies, and risk tolerance, especially as agentic systems move from generating outputs to taking actions.
A key warning lands near the end: don’t rely on the model to explain itself. These systems can produce plausible narratives that aren’t verifiable, and may behave differently when they know they’re being evaluated. The practical takeaway is straightforward: treat LLMs as risk-managed systems, invest in observability and red teaming, and build defense-in-depth guardrails that assume the agent will try to bypass controls.
Chapters:
00:00 Welcome to Pop Goes the Stack
00:30 Why researchers treat LLMs like aliens (black-box ops)
01:31 LLMs “evolved,” not engineered: Why root cause analysis gets weird fast
02:48 From prodigy child to “evil genius teenager” models
04:12 Constitutional AI: Principles vs rules (and goal conflicts)
05:22 When constitutions backfire: The “green” AI that schemes
05:59 Baked-in values vs system prompts: What’s really changeable?
07:02 “Be helpful” vs “be safe”: Why goals collide in practice
08:52 When #AI fakes tests: Optimization for pleasing humans
09:53 Enterprise checklist: Know the constitution, employ AI red teaming, and evolve guardrails
13:02 Agentic risk: Actions, unknown APIs, and securing the unknown
15:15 Don’t trust self-explanations: Convincing stories, no proof, and situational awareness
17:12 Key takeaways: Shifted from engineering to risk management
Learn how you can stay ahead of the curve and keep your stack whole with additional insights on app security, multicloud, AI, and emerging tech: https://go.f5.net/y76eecy7
More about F5: https://go.f5.net/j1j2tsvp
Read our blog: https://go.f5.net/4nbu3rwl
Follow us on LinkedIn: https://go.f5.net/vh7i3vat
73
3