Workflow
LLM (Large Language Model)
icon
Search documents
X @Avi Chawla
Avi Chawla· 2026-03-12 07:25
How to build a RAG app on AWS!The visual below shows the exact flow of how a simple RAG system works inside AWS, using services you already know.At its core, RAG is a two-stage pattern:- Ingestion (prepare knowledge)- Querying (use knowledge)Below is how each stage works in practice.> Ingestion: Turning raw data into searchable knowledge- Your documents live in S3 or any internal data source.- Whenever something new is added, a Lambda ingestion function kicks in.- It cleans, processes, and chunks the file i ...
Hidden Pitfalls in ORCL Earnings & CRWV "Pure Play" Counterpoint
Youtube· 2026-03-11 18:00
Core Insights - Oracle's stock has seen a significant increase of approximately 10% following a strong earnings report, indicating positive market sentiment [1][17] - The company reported a backlog of $553 billion, reflecting robust demand in the LLM (Large Language Model) sector, which is expected to grow rapidly [2][6] - Oracle's cloud revenue growth is highlighted at 44%, although overall revenue growth of 22% has been viewed as less impressive [3][4] Financial Performance - Oracle's total revenue guidance for fiscal 2027 is projected to reach $90 billion, showcasing strong future expectations [6] - The company plans to raise $50 billion in financing in calendar year 2026, which may alleviate some investor concerns regarding capital expenditures [7] - The remaining performance obligation (RPO) backlog has increased by 325%, indicating potential for future revenue growth, although customer concentration risks exist [14][16] Market Position and Competition - Oracle's capital expenditures currently exceed its revenue, raising questions about the sustainability of its spending strategy [4][9] - Comparisons with Corewe, a competitor, reveal a significant valuation discrepancy, with Oracle's enterprise value at $500 billion compared to Corewe's $50 billion, despite both companies operating in similar markets [12][13] - The market's perception of Oracle's financials is complicated by its diverse business units and revenue classification methods, making it challenging to assess true performance [12]
Cerence(CRNC) - 2026 Q1 - Earnings Call Transcript
2026-02-04 22:30
Financial Data and Key Metrics Changes - The company reported revenue of $115.1 million for Q1 2026, a significant increase of 126% from $50.9 million in the prior year period [16] - Adjusted EBITDA for the quarter was $44.6 million, representing a margin of 39%, compared to $1.4 million or 3% in the prior year [20] - The company generated record quarterly free cash flow of $35.6 million, marking a strong performance in cash generation [4][22] - GAAP net loss for the quarter was $5.2 million, an improvement from a net loss of $24.3 million in the same quarter last year [21] Business Line Data and Key Metrics Changes - Variable license revenue was $30.5 million, up 34% year-over-year, driven by steady customer utilization and adoption across core programs [17] - Fixed license revenue was $7.8 million, with a timing difference affecting year-over-year comparisons [17] - Connected services revenue was $14.5 million, up 6% year-over-year, with a potential increase of over 20% when excluding a prior year true-up benefit [18] Market Data and Key Metrics Changes - Approximately 11.9 million cars produced in the quarter included Cerence technology, flat compared to the prior year [22] - The number of connected cars shipped grew by 14% on a trailing 12-month basis, indicating strong momentum in vehicle connectivity [23] - 51% of worldwide auto production included Cerence technology, consistent with historical penetration rates [23] Company Strategy and Development Direction - The company has three key priorities for 2026: advancing business through leading technology, maintaining cost diligence, and driving top-line growth [4] - The introduction of new AI agents expands the company's reach beyond in-vehicle experiences into broader areas of the automotive ecosystem [8] - The company aims to increase adoption of Cerence XUI and drive greater penetration of its stack in existing programs, which is expected to deliver increased per unit pricing (PPU) [9] Management's Comments on Operating Environment and Future Outlook - Management expressed confidence in the company's technology and customer momentum, indicating a solid foundation for long-term sustainable growth [14][25] - The company reaffirmed its full-year guidance for fiscal 2026, expecting revenue between $300 million and $320 million and adjusted EBITDA between $50 million and $70 million [25] - Management highlighted the positive reception of new products and the potential for growth in non-automotive businesses, with expected revenue impacts starting in late fiscal year 2026 [13] Other Important Information - The company recorded $49.5 million in patent license revenue from the resolution of litigation with Samsung, marking a significant milestone in its IP monetization strategy [14][18] - The company repurchased $30 million in principal value of its 2028 convertible notes, demonstrating a commitment to deleveraging its balance sheet [22] Q&A Session Summary Question: Interest in the mobile work agent and its impact on ARPU - Management indicated strong demand for the mobile work agent, which can be implemented in existing vehicles, potentially increasing ARPU [28][30] Question: Impact of new signings on TTM billings and backlog - Management confirmed that new signings would contribute to TTM billings and backlog, with revenue recognition starting as vehicles are produced and sold [32][36] Question: Trends in usage of existing in-car connected systems - Management noted that usage of connected systems has improved with newer technology, particularly with the introduction of LLMs [37] Question: Competitive process for new wins and future win rates - Management highlighted that technology capability and team confidence are key differentiators in winning contracts, with a strong pipeline of interest from both Western and Chinese OEMs [41][53]
野村 2025 年投资论坛:AI 相关主题 -与 Sakana AI 及 Ibiden 的对话-Nomura Investment Forum 2025_ AI-related themes_ Conversation with Sakana AI and Ibiden
野村· 2025-12-08 00:41
Investment Rating - The report assigns a "Buy" rating to Ibiden, indicating a positive outlook for the company's performance in the market [1]. Core Insights - The AI industry is expected to see sustainable growth through application-specific custom AI and packaging technologies that lower inference costs [1]. - Sakana AI focuses on developing innovative algorithms and application-specific AI models, aiming for sustainable growth by specializing in niche areas [2]. - Ibiden is increasing its market share in the custom ASIC packaging sector, with manufacturing difficulties for ASICs approaching those of GPUs, which may lead to a broader customer base for AI accelerator packages [3][5]. Summary by Sections Sakana AI - Sakana AI, founded by notable figures from Google Brain and Mercari Europe, specializes in advanced algorithms and application-specific AI models [2]. - The company aims to improve algorithms efficiently through an evolutionary process rather than relying on large-scale computing resources [2]. - Concerns about sustainability in ultra-large model learning persist, as only a few companies can maintain the necessary infrastructure [2]. Ibiden - Ibiden has refined its manufacturing technologies for large packages since the mid-2010s, with expectations of increased manufacturing difficulty for custom ASICs and GPUs [3]. - The company is likely to expand its customer base for AI accelerator packages due to its capability to stably mass produce EMIB-T, a technology that may see wider adoption by 2027-28 [3]. - Technological advancements in providing LLM inference with stable quality at lower prices are expected to enhance Ibiden's competitive advantage in the market [3][5].
X @Avi Chawla
Avi Chawla· 2025-10-12 19:29
Core Problem of Traditional RAG - Most retrieved chunks in traditional RAG setups do not effectively aid the LLM, leading to increased computational costs, latency, and context processing [1][5] - Classic RAG involves fetching similar chunks from a vector database and directly inputting the retrieved context into the LLM [5] REFRAG Solution by Meta AI - Meta AI's REFRAG introduces a novel approach by compressing and filtering context at a vector level, focusing on relevance [1][2] - REFRAG employs chunk compression, relevance policy (RL-trained), and selective expansion to process only essential information [2] - The process involves encoding documents, finding relevant chunks, using a relevance policy to select chunks, and concatenating token-level representations [3][4] Performance Metrics of REFRAG - REFRAG outperforms LLaMA on 16 RAG benchmarks, demonstrating enhanced performance [5][7] - REFRAG achieves 30.85x faster time-to-first-token, significantly improving processing speed [5][7] - REFRAG handles 16x larger context windows, allowing for more extensive information processing [5][7] - REFRAG utilizes 2-4x fewer tokens, reducing computational resource consumption [5][7] - REFRAG leads to no accuracy loss across RAG, summarization, and multi-turn conversation tasks [7]
Taboola.com (TBLA) FY Conference Transcript
2025-08-12 18:15
Summary of Taboola.com (TBLA) FY Conference Call - August 12, 2025 Company Overview - **Company**: Taboola.com (TBLA) - **Industry**: Performance Advertising in the Open Web - **Market Opportunity**: $55 billion market opportunity in performance advertising [4][5] Core Business Model - **Unique Offering**: Taboola is a leading performance advertising platform that complements search and social advertising by providing targeted ads based on first-party data [3][4] - **Daily Reach**: The company reaches approximately 600 million people daily through partnerships with major publishers like Yahoo, Apple News, Disney, and NBC [4] - **Revenue Goals**: Targeting $2 billion in revenue from a $55.7 billion market, with over $200 million in adjusted EBITDA, representing a margin of over 30% [5] Financial Performance - **EBITDA Margin**: The company maintains a strong EBITDA margin of over 30% and a free cash flow of 70% of EBITDA, which is being used for share buybacks [5][67] - **Share Buybacks**: Taboola has repurchased 12% of its shares in the first half of the year and plans to continue aggressive buybacks [5][69] Market Position and Strategy - **Two-Sided Marketplace**: Taboola operates a two-sided marketplace with exclusive long-term relationships with 11,000 publishers, providing predictable inventory and access to consumer data [6][7] - **Shift to Performance Marketing**: The introduction of the Realize product marks a pivot towards broader performance marketing, allowing advertisers to use various ad formats beyond native advertising [12][14] - **Display Advertising Market**: Taboola estimates a $10 billion display ad market among its publishers, aiming to capture 30% market share [18] Growth and Future Outlook - **Growth Strategy**: The company aims to double its revenue from $2 billion to $4 billion primarily through increased demand and spending from advertisers [15][25] - **Realize Product Adoption**: Early signs of success with Realize include 650 advertisers trying the product, with existing advertisers increasing their spending [27][28] - **Focus on Performance Advertising**: Taboola is committed to performance advertising, avoiding branding-focused areas like CTV, which is seen as a competitive and less favorable market [36][39] Challenges and Market Dynamics - **Native Advertising Growth**: The native advertising space is not growing as expected, prompting the shift to a broader performance advertising strategy [22][23] - **Impact of Search Traffic**: Currently, only 5% of Taboola's traffic is driven by search, and the company has not seen significant impacts from changes in search dynamics [48][49] Technology and Innovation - **Use of AI and LLMs**: Taboola is leveraging machine learning and large language models (LLMs) across various departments to enhance productivity and create value [65][66] - **Predictive Audiences**: The company is developing features like predictive audiences to help advertisers optimize their campaigns [64] Conclusion - **Investment Philosophy**: Taboola prioritizes growth while maintaining profitability, with a focus on responsible business practices and maximizing shareholder value through buybacks and strategic investments [56][68] - **Future Expectations**: The company is optimistic about returning to double-digit growth through the successful adoption of Realize and continued investment in technology and partnerships [25][60]