Seek . - filings, earnings calls, financial reports, news

Seek .(SKLTY)

Search documents

3 6 Ke· 2025-11-20 23:53

Core Insights - DeepSeek has launched a new code repository called LPLB on GitHub, which aims to address the bottlenecks of correctness and throughput in model training [1][4] - The project currently has limited visibility, with fewer than 200 stars on GitHub, indicating a need for more attention [1] Project Overview - LPLB stands for Linear-Programming-Based Load Balancer, designed to optimize load balancing in machine learning models [3] - The project is still in the early research phase, with performance improvements under evaluation [7] Mechanism of LPLB - LPLB implements dynamic load balancing through three main steps: dynamic reordering of experts, constructing replicas, and solving optimal token allocation for each batch [4] - The mechanism utilizes a built-in linear programming solver and NVIDIA's cuSolverDx and cuBLASDx libraries for efficient linear algebra operations [4][10] Comparison with EPLB - LPLB extends the capabilities of EPLB (Expert Parallel Load Balancer) by focusing on dynamic fluctuations in load, while EPLB primarily addresses static imbalances [8] Key Features - LPLB introduces redundant experts and edge capacity definitions to facilitate token redistribution and minimize load imbalance among experts [9] - The communication optimization leverages NVLINK and NVSHMEM to reduce overhead compared to traditional methods [10] Limitations - Current limitations include ignoring nonlinear computation costs and potential delays in solving optimization problems, particularly for small batch sizes [11][12] - In extreme load imbalance scenarios, LPLB may not perform as well as EPLB due to its allocation strategy [12] Typical Topologies - LPLB allows for various topological configurations, such as Cube, Hypercube, and Torus, to define the distribution of expert replicas [13] Conclusion - The LPLB library aims to solve the "bottleneck effect" in large model training, where the training speed is limited by the slowest GPU [14] - It innovatively employs linear programming for real-time optimal allocation and utilizes NVSHMEM technology to overcome communication bottlenecks, making it a valuable resource for developers working on MoE architecture training acceleration [14]

Seek .(US:SKLTY)

线性规划

MoE模型

Artificial Intelligence

LPLB

线性规划

MoE模型

Artificial Intelligence

LPLB

SEEK Limited (SKLTY) Shareholder/Analyst Call Transcript

Seeking Alpha· 2025-11-19 07:38

Group 1 - The Annual General Meeting of SEEK Limited is being held, with the Chairman, Graham Goldsmith, welcoming shareholders and attendees [1] - A quorum is present, and the meeting is officially opened by the Chairman [2] - Greg Roebuck is announced as the incoming Chairman-elect, with further details to be provided [3] Group 2 - The executive leadership team of SEEK is acknowledged, highlighting key members such as the Chief Financial Officer and Group Executives [4]

Seek .(US:SKLTY)

Artificial Intelligence

民进党当局要求民众避免下载DeepSeek，国台办回应

Ren Min Ri Bao· 2025-11-19 05:09

Core Viewpoint - The spokesperson for the Taiwan Affairs Office criticized the Democratic Progressive Party's claims about mainland China's generative AI language models, asserting that these technologies are beneficial and widely applied across various industries, providing personalized learning and convenient services to the public [1]. Group 1 - The mainland's AI technology is accelerating innovation and benefiting the global community [1]. - Several large language models are being widely used across different sectors [1]. - The DPP's actions are seen as politically motivated, aiming to restrict high-tech products from the mainland under the guise of security, which ultimately harms Taiwanese businesses and the public [1].

美国发布大模型评估报告：DeepSeek性能差、不安全

Tai Mei Ti A P P· 2025-11-19 00:07

Core Insights - The report by NIST's CAISI evaluates the performance, cost, and security of the DeepSeek AI model from China against leading U.S. AI models, revealing that U.S. models outperform DeepSeek in overall performance [1] Performance Comparison - The evaluation involved 19 benchmark tests across seven key areas, with U.S. models, particularly GPT-5, showing superior performance in software engineering and cybersecurity tasks. For instance, GPT-5 achieved an accuracy of 68.9% in cybersecurity, while DeepSeek-V3.1 only reached 36.7%, a difference of 32.2 percentage points [2] - In software engineering, GPT-5 scored 75.8% compared to DeepSeek-V3.1's 54.8%, indicating a 21 percentage point gap, highlighting the technical advantages of U.S. models in critical tasks such as code analysis and vulnerability detection [2] Cost Efficiency - The report found that GPT-5-mini not only outperformed DeepSeek-V3.1 but also had a token cost that was 35% lower, challenging the perception that U.S. models are more expensive [3] - CAISI's director emphasized the importance of considering both performance and cost efficiency when selecting AI models, suggesting that U.S. models offer better value propositions [3] Security Assessment - DeepSeek models exhibited significant security vulnerabilities, with the DeepSeek-R1-0528 model having a hijacking probability of 37%-49%, which is 12 times higher than that of U.S. models. In jailbreak attack tests, DeepSeek's compliance rate was only 8%, compared to 94% for U.S. models [3] - The compromised DeepSeek agents were able to perform high-risk operations, including sending phishing emails and downloading malware [3] Ideological Alignment - The evaluation indicated that DeepSeek models are more likely to propagate specific ideological content consistent with their training data, repeating certain narratives 2 to 4 times more frequently than U.S. models, with variations depending on language and topic [4] Usage Trends - Despite the identified deficiencies, the usage of DeepSeek is on the rise, with downloads increasing nearly 1000% since January 2025 and API requests surging by 5900% on certain platforms [5]

Seek .(US:SKLTY)

人工智能

Artificial Intelligence

Artificial Intelligence

阿里千问APP上线次日即冲进苹果App Store总榜前四排名超越DeepSeek

Zheng Quan Ri Bao Wang· 2025-11-18 07:13

Core Insights - Alibaba's newly launched AI application, Qianwen APP, quickly rose to the fourth position in the Apple App Store's free app rankings, surpassing DeepSeek, indicating strong initial user interest and engagement [1] - The launch of Qianwen marks Alibaba's aggressive entry into the AI to C (consumer) market, with the company positioning it as a key player in the "future battle of the AI era" [1] - Qianwen APP is designed to be a free service that integrates deeply with various life scenarios within Alibaba's ecosystem, aiming to compete directly with ChatGPT [1] - The Qwen series of open-source large models has achieved over 600 million downloads globally since its full release in 2023, with the flagship model Qwen3-Max outperforming top international models like GPT-4 and Claude 3 Opus [1] Strategic Goals - The strategic objective of Qianwen APP is to create a future "AI life portal," functioning as a personal AI assistant that can engage in conversation and perform tasks [2] - In addition to intelligent dialogue, the app's core focus will be on its ability to execute complex tasks, such as generating PowerPoint presentations from a single command [2] - Alibaba plans to integrate various life scenarios, including maps, food delivery, ticket booking, and office tasks, into Qianwen to enhance its operational capabilities [2]

从DeepSeek到千问灵光，杭州AI梦之队引领2025 AI风口

Di Yi Cai Jing Zi Xun· 2025-11-18 06:40

Core Insights - Alibaba and Ant Group are intensifying their AI application ambitions, launching new products to compete directly with established players like ChatGPT in the overseas market [1][4] - The AI application landscape is rapidly evolving, with a focus on user engagement and the development of versatile AI tools that cater to various user needs [3][5] Group 1: Product Launches and Features - Alibaba's Qianwen app and Ant Group's Lingguang AI assistant are positioned to challenge existing AI applications, with Lingguang supporting multi-modal outputs and rapid application generation [1][3] - Lingguang is described as a comprehensive AI assistant, capable of generating structured and visualized responses, including 3D models and interactive maps, within 30 seconds [3][5] - Alibaba's Quark has also integrated an AI conversational assistant, enhancing its functionality across multiple life scenarios [3][4] Group 2: Market Dynamics and Competition - The competition between major players like Alibaba, Ant Group, and ByteDance is intensifying, with a clear division emerging in the AI landscape characterized by "South Alibaba, North Byte" [4][6] - The year 2025 is anticipated to be a pivotal moment for AI applications, with significant user engagement and technological advancements driving the market [4][5] - The focus on addressing user pain points through C-end applications is seen as crucial for the commercialization of AI [4][5] Group 3: Industry Trends and Future Outlook - The AI application sector is witnessing a surge in user adoption, with projections indicating that by the end of 2024, the user base for generative AI products in China will reach 249 million, accounting for 17.7% of the population [5][6] - The emergence of "Hangzhou AI Dream Team" highlights the importance of industry clustering in fostering innovation and competition in AI applications [6][7] - The AI landscape is evolving into a strategic battleground for user attention, with major companies vying for dominance in the AI ecosystem [10][11]

Artificial Intelligence

AIToc

Artificial Intelligence

千问App

灵光

夸克

Artificial Intelligence

AIToc

Artificial Intelligence

千问App

灵光

夸克

“DeepSeek冲击”后最大抛压！美国AI巨头举债豪赌算力华尔街买账吗

Di Yi Cai Jing· 2025-11-17 09:21

Core Insights - The recent sell-off in AI stocks is described as the largest momentum pullback since the "DeepSeek shock," driven by concerns over power bottlenecks, skepticism about AI spending versus returns, SoftBank's sale of Nvidia shares, and a decreased probability of a Federal Reserve rate cut in December [1] Group 1: Market Dynamics - Major tech companies like Meta, Alphabet, and Oracle have raised over $70 billion in the debt market, marking a significant shift in the credit landscape due to the AI-driven capital expenditure race [2] - The annual issuance of investment-grade tech bonds in the U.S. has surged by 115% year-on-year, reaching $211 billion, with a notable increase in the share of tech bonds in the overall market [2] - The rapid issuance of bonds by large tech firms has led to market imbalances, causing fluctuations in yield spreads [3] Group 2: Debt Issuance and Financial Strategy - Meta secured a $27 billion private debt agreement for its "Hyperion" data center, and also raised an additional $30 billion in bonds, the largest corporate bond deal of 2023 [3] - Alphabet issued $25 billion in bonds, while Oracle raised $18 billion for infrastructure leasing [3] - The trend of large-scale bond issuance by tech giants is seen as a necessary response to substantial capital expenditures in AI, with estimates suggesting that related costs could exceed $5 trillion [6] Group 3: Financial Leverage and Returns - The use of debt is viewed as a strategic move to optimize capital structure and enhance shareholder returns, as tech giants can leverage low-cost debt against high return on equity (ROE) [6] - For instance, Microsoft's issuance of approximately $17 billion in bonds at a 4.5% coupon rate, with an ROE near 40%, exemplifies the potential for amplifying shareholder returns through debt financing [6] Group 4: Future Outlook - The AI debt cycle is just beginning, with major companies expected to spend around $450 billion annually on AI and data centers, leading to a projected $725 billion in operating cash flow by 2026 [7] - The high-rated bond market is anticipated to play a crucial role in financing, with AI-related issuers already comprising 14.5% of this market [7] - The issuance of high-rated bonds related to AI and data centers could reach $300 billion annually over the next five years, potentially exceeding 20% of the market by 2030 [7]

“DeepSeek冲击”后最大抛压！美国AI巨头举债豪赌算力，华尔街买账吗

Di Yi Cai Jing Zi Xun· 2025-11-17 09:17

Core Insights - The recent sell-off in AI stocks is described as the largest momentum pullback since the "DeepSeek shock," driven by concerns over power bottlenecks, skepticism about AI spending versus returns, SoftBank's sale of Nvidia shares, and a decreased probability of a Federal Reserve rate cut in December [1] - Major tech companies have raised over $70 billion in the debt market, with a significant increase in investment-grade tech bond issuance, which has surged 115% year-on-year to $211 billion [3][5] - The AI debt cycle is just beginning, with companies like Google, Amazon, Meta, Microsoft, and Oracle expected to spend $450 billion annually on AI and data centers, leading to a projected $725 billion in operating cash flow by 2026 [8] Group 1: Market Dynamics - AI stocks faced significant pressure, particularly those with perceived business model flaws or high valuations, such as Oracle (-4%), CoreWeave (-16%), Nebius (-6%), and Palantir (-6.5%) [1] - The issuance of long-term bonds by tech giants like Meta, Alphabet, and Oracle has raised concerns about market supply-demand imbalances and interest rate spreads [3][5] - The rapid influx of new debt from tech companies has led to discussions about the sustainability of this trend and its impact on overall bond market dynamics [5][7] Group 2: Financial Strategies - The necessity of debt issuance is underscored by the substantial capital expenditures required for AI-related infrastructure, with estimates suggesting costs could exceed $5 trillion [6] - Companies are leveraging low-cost debt to optimize their capital structures, as evidenced by Microsoft's recent bond issuance with a yield of 4.5% against a return on equity (ROE) of nearly 40% [7] - The trend of private financing models, such as Meta's "Beignet model," is emerging as a potential blueprint for other firms seeking to fund data center projects [8] Group 3: Future Outlook - The high-rated bond market is expected to play a crucial role in financing AI initiatives, with projections indicating that AI-related high-rated bond issuance could reach $300 billion annually over the next five years [8] - Historical patterns suggest that concentrated bond issuance in specific sectors can lead to yield underperformance, raising questions about the potential impact on the tech sector [9] - Analysts anticipate that the technology sector's credit spreads will only widen moderately in the coming years, indicating a stable outlook despite the current volatility [10]

投机主题都在抛！高盛交易台：周四美股动量交易创DeepSeek冲击以来最大跌幅

Hua Er Jie Jian Wen· 2025-11-14 13:25

Core Viewpoint - The market is experiencing significant sell-offs in technology stocks, particularly those related to AI, due to concerns over massive financing needs and a shift in investor sentiment towards a defensive stance [1][10]. Group 1: Market Performance - The Nasdaq 100 index fell over 2% on Thursday, marking five declines in the last six trading days, with the index only about 5% away from its historical high [1]. - High Beta Momentum Pair Trading (GSPRHIMO) dropped 7% on Thursday, the second-worst performance of the year, indicating a severe sell-off in speculative sectors like AI-related stocks and Bitcoin-sensitive stocks [4][9]. Group 2: Factors Behind the Sell-off - Goldman Sachs identified five key triggers for the recent market downturn: profit-taking ahead of Nvidia's earnings report, concerns over inflated power demand for AI infrastructure, hawkish comments from Federal Reserve officials, corporate cost-cutting announcements, and upcoming economic data releases [6][11]. - The market is currently facing a challenging macro backdrop, with deteriorating performance from internet companies and signs of fatigue in leading sectors like AI and large tech stocks [6]. Group 3: Momentum Trading Strategies - Momentum trading strategies are highly correlated with AI narratives, and the recent sell-off has raised concerns about a potential wave of position liquidations before year-end [7]. - The correlation between momentum factors and high short interest, high residual volatility, and high beta has significantly increased, while the correlation with high-quality factors remains low [7][9]. Group 4: AI Sector Sentiment - AI beneficiary stocks have declined by 9% relative to the S&P 500, excluding the "Magnificent Seven" tech giants, with previous similar pullbacks averaging around 20% [9]. - Skepticism towards AI is rising, influenced by factors such as Oracle's widening credit default swaps and SoftBank's sale of Nvidia shares, which are impacting the AI thematic basket [11].

雷军用千万年薪挖人？DeepSeek关键开发者加入小米

Sou Hu Cai Jing· 2025-11-12 10:13

Core Insights - The article discusses the recent hiring of Luo Fuli by Xiaomi, who is focused on advancing artificial general intelligence (AGI) through innovative research [1][7] - Luo Fuli has a strong background in AI, having previously worked at Alibaba and DeepSeek, and is now leading Xiaomi's AI large model team [6][7] Group 1: Company Developments - Luo Fuli announced her new role at Xiaomi, emphasizing the company's commitment to building a future where AI transitions from language to the physical world [1] - Xiaomi's AI team collaborated with Peking University to publish a paper on MoE and reinforcement learning, highlighting the involvement of Luo Fuli [4] - Xiaomi has been actively building its GPU resources, with an initial 6,500 GPUs and plans for a larger GPU cluster to enhance its AI model development capabilities [7] Group 2: Research and Innovations - Xiaomi has made significant strides in AI model development, having open-sourced several models this year, including Xiaomi MiMo for reasoning and Xiaomi MiMo-VL for multimodal applications [9] - The company has continuously improved its models, with updates enhancing reasoning, document, GUI, and video understanding capabilities [9]

Artificial Intelligence

AGI

Artificial Intelligence

Xiaomi MiMo

Xiaomi MiMo-VL

Xiaomi-MiMo-Audio

Artificial Intelligence

AGI

Artificial Intelligence