evaluation

Search documents
X @The Economist
The Economist· 2025-07-11 17:20
The country’s split with the EU, paradoxically, encouraged the services boom by crushing sterling, making British salaries cheaper for foreigners https://t.co/NiecyDf34n ...
FDA to Revisit Opioid Labeling for Chronic Pain
Bloomberg Television· 2025-07-11 16:03
Talk to us about opioids that the FDA failed. The FDA did fail. There were many parts of our health care system that failed.Look, I feel terrible about the opioid epidemic. I personally prescribed opioids with misinformation. It was taught to me by my chief resident in my surgical residency, and we sort of woke up to this problem around 2014, 2015 and thought, Oh my God, what have we done.What information did we not have. I think was regulatory capture at the agency. Now we can't prove that.But when the rev ...
Why Did The Dollar Just Hit A 50-Year Low? - Chamath Palihapitiya
All-In Podcast· 2025-07-10 15:01
Chim, any thoughts here on the dollar. I think the dollar has devalued 50% in the last 35 or 40 years. So, I think it's somewhat useful to look at any single couple of months in time, but this has been a one-way trade for a very long time.And it's probably important to understand why that is. And I think it generally has to do with the fact that the United States finances a lot of growth and that has been the right decision. So unless you see a complete collapse in the currency, I suspect that this decay co ...
Jack Clark: 美国AI 政策的隐形推手,时代的良心还是囚徒?
Hu Xiu· 2025-07-06 00:12
Jack Clark是最关注和熟悉中国在芯片、计算和模型上进展的AI Lab领导人之一。他毫不吝啬对中国AI进展的认可,将DeepSeek R1视作"推理模型大范围 扩散"的起点,近期又把HyperHetero使用的异构集群叫做通过"超级智能进行持续自我训练"的垫脚石。 同时,他是美国的AI政策"民间沙皇",曾是OpenAI的政策口负责人,现在是Anthropic的联合创始人,他对中国异常冷酷和强硬,Dario Amodei在R1出现 后的万字檄文背后就有他的影子。6月25日的华盛顿DC有关算力限制和AI竞争的政策听证会,他也是发言的焦点,为美国遏制中国AI发展设计了详细的5 层战略。 那么,这位记者出身的英文系毕业生是如何进入AI的核心圈?作为一名出生于英国小镇的美国移民,Jack Clark提出对华强硬政策的底层原因是什么?他 的政策理念和具体对华战略的政策建议到底是什么?以AI安全为由推动的对抗性AI政策背后的悖论在哪里?这些都是本文希望回答的问题。 一、Intro:技术必然与社会外部因素互相交织 华盛顿DC的六月,空气已经算湿热。在雷伯恩众议院办公大楼那间镶着深色木板的听证会室里,冷气开得十足,却驱不 ...
Jack Clark: 美国 AI 政策的隐形推手,时代的良心还是囚徒?
海外独角兽· 2025-07-04 07:58
海外独角兽长期开放开源共创,欢迎在我们这里发布你关于 AI 行业的深度观察与思考,可后台留 言"投稿"联系我们。 作者:程天一 本文为科技评论撰稿人程天一投稿,深入剖析了 Jack Clark——这个在 AI 爱好者中或许并不耳熟能 详,但却是值得中国 AI 社区深入了解的人物。 Jack Clark 是最关注和熟悉中国在芯片、计算和模型上进展的 AI Lab 领导人之一。他毫不吝啬对 中国 AI 进展的认可,将 DeepSeek R1 视作"推理模型大范围扩散"的起点,近期又把 HyperHetero 使用的异构集群叫做通过"超级智能进行持续自我训练"的垫脚石。 同时,他是美国的 AI 政策"民间沙皇",曾是 OpenAI 的政策口负责人,现在是 Anthropic 的联合创始 人,他对中国异常冷酷和强硬,Dario Amodei 在 R1 出现后的万字檄文背后就有他的影子。6 月 25 日的华盛顿 DC 有关算力限制和 AI 竞争的政策听证会,他也是发言的焦点,为美国遏制中国 AI 发 展设计了详细的 5 层战略。 那么,这位记者出身的英文系毕业生是如何进入 AI 的核心圈?作为一名出生于英国小镇的美国移 ...
How Government Debt Reduces Your Buying Power
Principles by Ray Dalio· 2025-07-03 13:24
The most important principle to keep in mind when thinking about large government debts and deficits such as those that we have and that are coming is when countries have too much debt, lowering interest rates and devaluing the currency that the debt is denominated in is the preferred path government policy makers are likely to take. So it pays to bet on that happening. That means betting on a weaker currency and uh lower real interest rates are the best path.And the reason governments uh prefer to take tha ...
摩根士丹利:中国思考-可能改变一切的三方组合-如果被允许的话
摩根· 2025-07-03 02:41
July 2, 2025 09:30 AM GMT China Musings | Asia Pacific M Idea The Trio That Could Change Everything — If It's Allowed To China doesn't just need new stimulus, a new growth algorithm is needed too. The "trio" of reforms could eventually form the basis for that. But old habits and incentives die hard. The 15th Five- Year-Plan will be the real litmus test: is Beijing ready to stop rewarding what it wants to reduce? At the Central Commission for Financial and Economic Affairs meeting hosted this Tuesday (July 1 ...
Suddenly Whirlpool is in the driver's seat, says Jim Cramer
CNBC Television· 2025-07-03 00:20
[Music] Hey, I'm Kramer. Welcome to Mad Money. Welcome to Kramer.I'll do my make friends. I'm just trying to make you a little money. My job is not just entertain but dedicate to a little teaching.So call me at 180073 CBC. Tweet me at Jim Kramer. You know why I first got into this wacky business.Stories. Stories. That's why.Tremendous intriguing stories. Tales that could explain what's going to happen and you could actually make a little money from them. And that's what's happening right now.And the stories ...
AI Agent、传统聊天机器人有何区别?如何评测?这篇30页综述讲明白了
机器之心· 2025-07-02 07:03
论文作者包括来自上海交通大学的朱家琛、芮仁婷、单榕、郑琮珉、西云佳、林江浩、刘卫文、俞勇、张伟楠,以及华为诺亚研究所的朱梦辉、陈渤、唐睿明。 本文第一作者是朱家琛,上海交通大学博士生,主要研究兴趣集中在大模型推理,个性化 Agent。本文通讯作者是张伟楠,上海交通大学教授,研究方向包含强化 学习、数据科学、机器人控制、推荐搜索等。 自从 Transformer 问世,NLP 领域发生了颠覆性变化。大语言模型极大提升了文本理解与生成能力,成为现代 AI 系统的基础。而今,AI 正不断向前,具备自主决 策和复杂交互能力的新一代 AI Agent 也正加速崛起。 不同于以往只会对话的 LLM 机器人,AI Agent 能够接入互联网、调用各类 API,还能根据真实环境反馈灵活调整策略。AI Agent 因此具备了感知环境和自主决策 的能力,已经突破了传统 "问答模式" 的限制,能够主动执行任务、应对各种复杂场景,真正成为用户身边可靠的智能助手。 在这股 AI Agent 浪潮中,每个人都可以有属于自己的 AI Agent。而如何衡量自己的 AI Agent 是否足够强大呢? 海量的 Agent 评测方式层出不穷 , ...
Taming Rogue AI Agents with Observability-Driven Evaluation — Jim Bennett, Galileo
AI Engineer· 2025-06-27 10:27
[Music] So I'm here to talk about taming rogue AI agents but essentially want to talk about uh evaluation driven development observability driven but really why we need observability. So, who uses AI? Is that Jim's stupid most stupid question of the day? Probably. Who trusts AI? Right. If you'd like to meet me after, I've got some snake oil you might be interested in buying. Yeah, we do not trust AI in the slightest. Now, different question. Who reads books? That's reading books. If you want some recommenda ...