Experimentation

Search documents
Learning beyond classrooms | Adarsh Kumar Singh | TEDxYouth@SunbeamBhagwanpur
TEDx Talks· 2025-09-10 16:20
uh when it comes to the uh things that both of the co-hosts talked about. So majorly in today's talk I'll be covering three things major starting with first of all so often when we are in school because because I'm mostly catering uh to an audience from school I'll be keeping it specifically for them when we are in school specifically we often are told that we should work on a lot of things if you want to be an entrepreneur often because of Shark Tank everyone wants to create something new they want to put ...
The neurobiology of sport | Natalia Stefańska | TEDxTrilo Youth
TEDx Talks· 2025-08-14 16:09
Decision Making & Well-being - The paradox of choice suggests that more options can lead to less satisfaction with decisions [1][2] - Individuals make an average of 35,000 decisions daily, potentially leading to concentration and emotional regulation difficulties [2][3] - 70% of young people feel pressure regarding their career path before the age of 20 [4] - Boredom activates the brain's "default mode network," which is crucial for concentration, emotional regulation, and creative problem-solving [5][6] - Studies suggest that even 10 to 15 minutes of daily disconnection can improve positive emotional abilities and creative solutions [8] Strategies for Enhanced Focus & Creativity - Techniques like the 54321 method can help ground individuals in the present moment amidst racing thoughts [8][9][10] - Engaging in monotonous manual activities can increase creativity by 28% by allowing the brain to enter a "diffuse thinking" mode [11] - Scheduling time for boredom is presented as a luxury that allows individuals to recharge and reconnect with their desires [11][12] Embracing Experimentation & Action - Action is more important than striving for perfection; life should be approached as an experiment [16] - Viewing life as an experiment reduces the fear of failure, as setbacks are a natural part of progress [17] - Building trust in oneself through action strengthens character, which is a valuable asset [18] - It's more important to find something that resonates with you than to find a "passion" [19] - Life is not a quiz; it's about testing and experimenting [21][22]
How to look at your data — Jeff Huber (Choma) + Jason Liu (567)
AI Engineer· 2025-08-06 16:22
Retrieval System Evaluation - Industry should prioritize fast and inexpensive evaluations (fast evals) using query and document pairs to enable rapid experimentation [7] - Industry can leverage LLMs to generate queries, but should focus on aligning synthetic queries with real-world user queries to avoid misleading results [9][11] - Industry can empirically validate the performance of new embedding models on specific data using fast evals, rather than relying solely on public benchmarks like MTeb [12] - Weights & Biases chatbot analysis reveals that the original embedding model (text embedding three small) performed the worst, while voyage 3 large model performed the best, highlighting the importance of data-driven evaluation [17][18] Output Analysis and Product Development - Industry should extract structured data from user conversations (summaries, tools used, errors, satisfaction, frustration) to identify patterns and inform product development [28][29] - Industry can use extracted metadata to find clusters and identify segments for targeted improvements, similar to how marketing uses user segmentation [29][26] - Cura library enables summarization, clustering, and aggregation of conversations to compare evals across different KPIs, helping to identify areas for improvement [32] - Industry should focus on providing the right infrastructure and tools to support AI agents, rather than solely focusing on improving the AI itself [39] - Industry should define evals, find clusters, and compare KPIs across clusters to make informed decisions on what to build, fix, and ignore [40][41] - Industry should monitor query types and performance over time to understand how the product is being used and identify opportunities for improvement [45]
Practical tactics to build reliable AI apps — Dmitry Kuchin, Multinear
AI Engineer· 2025-08-03 04:34
Core Problem & Solution - Traditional software development lifecycle is insufficient for AI applications due to non-deterministic models, requiring a data science approach and continuous experimentation [3] - The key is to reverse engineer metrics from real-world scenarios, focusing on product experience and business outcomes rather than abstract data science metrics [6] - Build evaluations (evals) at the beginning of the process, not at the end, to identify failures and areas for improvement early on [14] - Continuous improvement of evals and solutions is necessary to reach a baseline benchmark for optimization [19] Evaluation Methodology - Evaluations should mimic specific user questions and criteria relevant to the solution's end goal [7] - Use Large Language Models (LLMs) to generate evaluations, considering different user personas and expected answers [9][11] - Focus on the details of each evaluation failure to understand the root cause, whether it's the test definition or the solution's performance [15] - Experimentation involves changing models, logic, prompts, or data, and continuously running evaluations to catch regressions [16][18] Industry Specific Examples - For customer support bots, measure the rate of escalation to human support as a key metric [5] - For text-to-SQL or text-to-graph database applications, create a mock database with known data to validate expected results [22] - For call center conversation classifiers, use simple matching to determine if the correct rubric is applied [23] Key Takeaways - Evaluate AI applications the way users actually use them, avoiding abstract metrics [24] - Frequent evaluations enable rapid progress and reduce regressions [25] - Well-defined evaluations lead to explainable AI, providing insights into how the solution works and its limitations [26]
Why your product needs an AI product manager, and why it should be you — James Lowe, i.AI
AI Engineer· 2025-07-28 19:53
[Music] Hi everyone. Thanks for that welcome. Uh, as you just heard, my name is James Low.I'm head of AI engineering at the Incubator for AI. We're a small team of experts uh, in the UK government. We were created by 10 Downing Street to deliver public good using AI and we do that via experimentation and product building.The UK government delivers uh for its citizens. It spends over a trillion pounds delivering for its over 70 million citizens. So there's a lot to play for.At the incubator for AI, uh we del ...
Break the Box: Turning Setbacks into Creative Success | Shivathmaj Shenoy | TEDxKIS Pune Youth
TEDx Talks· 2025-07-24 15:59
Career & Personal Development - Individuals often face pressure to specialize despite having diverse interests and talents [1] - Societal expectations and parental influence can lead individuals to pursue careers that don't align with their passions [2][3][4] - Choosing a career path should prioritize personal fulfillment and passion over societal approval or perceived prestige [5][6] - Experimentation and exploration of different skills and opportunities can lead to unexpected career paths and financial success [7][8] - Sharing knowledge and helping others can enhance one's own learning and lead to personal satisfaction [9][10][11] - It's important to recognize when a job is not fulfilling and to seek opportunities for growth and passion [12][13][15] Entrepreneurship & Business - Building one's own business allows for greater creative freedom and control over one's career [16] - Support from friends and a belief in one's abilities can be crucial in taking the leap into entrepreneurship [16] - A content marketing agency can be built by focusing on storytelling and working with talented individuals [17] - Success can arise from unexpected circumstances, such as a failed exam [18] - Embracing experimentation, failure, and continuous learning is essential for building a fulfilling life and career [19]