Today's Top Episodes

How Humans Will Earn in the AI Economy with Jordan Gray from Public AI

Will AI steal your job or create new opportunities? Discover how humans will thrive in the evolving AI economy.

Viewing Podcast: Podcast
AI
Arts
Business
Crypto
Finance
Health
History
Interviews
Investing
Macro
Misc
News
Politics
Product
Programming
Science
Social
Startups
Technology
VC
Weaviate and SAS with Saurabh Mishra and Bob van Luijt - Weaviate Podcast #129!

Weaviate and SAS with Saurabh Mishra and Bob van Luijt - Weaviate Podcast #129!

Duration: 00:43:55
October 13, 2025
  • The evolution of AI from retrieval to RAG to agents reflects a growing need for enterprises to adapt general-purpose language models to their specific, unstructured data.
  • Key challenges for AI adoption in enterprises, such as data readiness and security, remain largely unchanged, despite advancements in AI capabilities.
  • The development of the SAS Retrieval Agent Manager (RAM) prioritizes flexibility, trustworthiness, rapid time-to-value, and performance to address enterprise needs with a no-code interface and comprehensive evaluation tools.
Weaviate's Query Agent with Charles Pierse - Weaviate Podcast #128!

Weaviate's Query Agent with Charles Pierse - Weaviate Podcast #128!

Duration: 01:01:32
September 22, 2025
  • The WVA query agent's GA release marks a significant step towards providing a next-generation, natural language interface for database interaction.
  • User feedback from the beta release led to key improvements, including the addition of chat functionality and a retrieval-only search mode.
  • Schema introspection allows the query agent to leverage database metadata, enabling constrained and structured outputs for more accurate and efficient queries.
GEPA with Lakshya A. Agrawal - Weaviate Podcast #127!

GEPA with Lakshya A. Agrawal - Weaviate Podcast #127!

Duration: 01:01:55
August 13, 2025
  • GPA/Jeppa optimizes AI systems in data-scarce environments by leveraging natural language traces to extract more learning signal from a single rollout compared to traditional methods.
  • A key innovation is Pareto-based candidate sampling, which maintains a pool of diverse candidate prompts, each excelling on different task instances, to prevent getting stuck in local optima and ensure domain-specific insights are preserved.
  • Japa enables rapid progress thanks to "coarse grain jumps" along the optimization landscape and is positioned to become a text evolution engine for various text components within AI systems to be available in DSP in close proximity to the airing of this podcast.
Agentic Topic Modeling with Maarten Grootendorst - Weaviate Podcast #126!

Agentic Topic Modeling with Maarten Grootendorst - Weaviate Podcast #126!

Duration: 01:05:18
July 9, 2025
  • Martin discusses the benefits of authoring a book with a publisher like O'Reilly, emphasizing collaboration and quality control over the typical solo blog post approach.
  • The conversation delves into the modularity of BERT topic and its evolution with LLMs, highlighting the potential of combining embedding-based methods with the strengths of LLMs while considering the cost and efficiency of reprocessing documents.
  • The podcast explores the challenge of evaluating topic modeling subjectively, especially concerning topic granularity, and the need for user-driven approaches with "human in the loop" agentic frameworks to steer results based on specific use cases.
Sufficient Context with Hailey Joren - Weaviate Podcast #125!

Sufficient Context with Hailey Joren - Weaviate Podcast #125!

Duration: 00:50:53
July 2, 2025
  • The core idea of sufficient context differs from relevance by evaluating if a model should be able to answer a question given the provided context, considering nuance like multi-hop reasoning.
  • The research surprisingly found that smaller models struggle to use available context, while all models are less likely to abstain when given additional context, even if it's insufficient.
  • Fine-tuning models to restore the ability to abstain after adding retrieval augmentation (RAG) proved difficult, though the surprising effectiveness of fine-tuning only a small number of parameters suggests unlocking latent capabilities rather than teaching new information.
RAG Benchmarks with Nandan Thakur - Weaviate Podcast #124!

RAG Benchmarks with Nandan Thakur - Weaviate Podcast #124!

Duration: 01:04:46
June 25, 2025
  • The BEIR benchmark was created to bridge the gap between the IR and NLP communities, providing resources for evaluating models on out-of-domain data.
  • Fresh Stack is a retrieval benchmark that evaluates systems on longer, more complex queries relevant to real-world programming problems where users dump entire codebases into the query.
  • The future of AI evaluation will likely focus more on domain-specific, grounded question answering, leading to the development of custom models for niche areas.
MUVERA with Rajesh Jayaram and Roberto Esposito - Weaviate Podcast #123!

MUVERA with Rajesh Jayaram and Roberto Esposito - Weaviate Podcast #123!

Duration: 01:13:06
May 28, 2025
  • Rajes discussed how his background in theoretical computer science, particularly in nearest neighbor search and complex metrics like earth mover distance, uniquely prepared him for work on multi-vector retrieval.
  • The conversation highlighted the benefits of multi-vector retrieval over single-vector methods, specifically in capturing fine-grained token interactions and enhanced interpretability.
  • Roberto and Rajes explored the Movea algorithm as a cost-efficient approach to multi-vector retrieval and they detailed compression techniques like product quantization to achieve meaningful performance gains.
Patronus AI with Anand Kannappan - Weaviate Podcast #122!

Patronus AI with Anand Kannappan - Weaviate Podcast #122!

Duration: 01:01:06
May 15, 2025
  • Perl is a new AI companion that helps developers debug agentic AI systems by identifying 60 different kinds of failure modes, including tool calling and planning errors.
  • The discussion highlighted three major challenges in AI workflow evaluation including context explosion, domain adaptation, and the increasing complexity of multi-agent orchestration.
  • Petronis believes the future of scalable AI oversight lies in dynamic evaluation, where intelligent AI systems oversee and evaluate other AI agents, moving beyond static datasets and benchmarks.
Haize Labs with Leonard Tang - Weaviate Podcast #121!

Haize Labs with Leonard Tang - Weaviate Podcast #121!

Duration: 00:54:15
May 12, 2025
  • Haze Labs is focused on enterprise-grade AI reliability, offering testing and auditing services to frontier labs and AI application developers.
  • Haze Labs simulates user interactions through prompt optimization and customer model creation, solving the cold start problem for AI application testing.
  • Haze Labs' Verdict framework leverages scalable oversight architectures like debate and ensembling to improve the performance of AI judging tasks.
Box AI with Ben Kus and Bob van Luijt

Box AI with Ben Kus and Bob van Luijt

Duration: 00:55:32
May 7, 2025
  • Box has successfully managed over 120,000 enterprise customers and trillions of unstructured content objects, highlighting the challenges of scaling cloud content management in a rapidly growing digital landscape.
  • The integration of AI and agents represents a transformative approach for automating complex enterprise tasks, such as retrieving and generating content, ultimately reducing manual effort and improving efficiency.
  • The importance of embedding strategies is emphasized, as they must balance retrieval effectiveness and storage efficiency, ensuring that the cost of managing this data at scale remains sustainable in large-scale systems.