How to Train Your LLM Web Agent: a Statistical Diagnosis 米国企業公式動画まとめ | 米国株を知る場所 nasdaqchart（ナスダックチャート）

公式動画ピックアップ

AAPL ADBE ADSK AIG AMGN AMZN BABA BAC BL BOX C CHGG CLDR COKE COUP CRM CROX DDOG DELL DIS DOCU DOMO ESTC F FIVN GILD GRUB GS GSK H HD HON HPE HSBC IBM INST INTC INTU IRBT JCOM JNJ JPM LLY LMT M MA MCD MDB MGM MMM MSFT MSI NCR NEM NEWR NFLX NKE NOW NTNX NVDA NYT OKTA ORCL PD PG PLAN PS RHT RNG SAP SBUX SHOP SMAR SPLK SQ TDOC TEAM TSLA TWOU TWTR TXN UA UAL UL UTX V VEEV VZ WDAY WFC WK WMT WORK YELP ZEN ZM ZS ZUO

公式動画＆関連する動画 [How to Train Your LLM Web Agent: a Statistical Diagnosis]

NOW

Welcome to the AI research bites. This series of short and informative talks showcases cutting-edge research work from ServiceNow AI Research team. The AI Research Bites are open to all, especially those interested in keeping up with the fast-paced AI research community. 
In this presentation, Massimo Caccia shows how to best allocate training compute between supervised fine-tuning (SFT) on expert demonstrations and reinforcement learning (RL) on the agent’s own trajectories — a trade-off between quality and quantity. The results demonstrate that starting with SFT, then continuing with RL, consistently advances the Pareto front of performance vs compute. Moreover, as the amount of SFT warm-up increases, the optimal RL hyperparameters shift, revealing how prior supervision shapes the efficiency and stability of downstream RL fine-tuning.
 
Paper: https://arxiv.org/abs/2507.04103
Blogpost: https://huggingface.co/blog/ppEmiliano/how-to-train-your-llm-web-agent-a-statistical-diag
ServiceNow AI Research team: https://www.servicenow.com/research/ 

393 6

この動画に関連する企業の動画一覧はこちら