AI Lovers

TensorOps

We Let Humans Talk about Machines

Episodes

  1. 02/08/2024

    The Future of AI Infrastructure on the Cloud

    Join us for our latest discussion with Gad Benram and Charles Frye from Modal as they explore the strategic reasons behind companies choosing to host their own AI infrastructure versus relying on external cloud services. From controlling critical data to customizing AI applications, this episode is packed with valuable insights for anyone navigating the complex world of AI deployment. Key topics include: • 00:00 Introduction: Insights on AI Resources for Hosting AI Models • 03:11 The Challenges of Existing Cloud Services • 09:14 Introducing Modal: A Fast and Interactive Development Experience • 15:13 Different Infrastructure Needs for Data Teams • 19:42 Addressing Slowness in AI Services • 26:20 Python and Notebooks for Data Scientists • 33:35 Fast and Seamless Deployment with Modal • 40:46 Future Directions and Closing Remarks In this episode, Gad Benram and Charles Frye discuss the challenges of hosting AI models in production and the limitations of existing cloud services. They highlight the lack of resources and GPUs available for serving AI applications and the slow bootstrapping process. They introduce Modal, a serverless runtime for distributed applications built on top of cloud resources, as a solution to these challenges. Modal offers fast deployment times, interactive development workflows, and support for large-scale models. 🔗 Visit our website for more resources and updates: ⁠⁠https://www.tensorops.ai/⁠⁠ 👥 Connect with us on social media: ⁠⁠Linkedin⁠⁠ ⁠⁠Twitter⁠⁠ 💬 Join our community: ⁠⁠https://www.meetup.com/ai-loves/

    46 min
  2. 23/05/2024

    OpenAI's Search - What's the technology behind the move?

    This episode on the transformative impacts of AI on search technologies features Gad Benram and Gabriel Gonçalves , along with our special guest, Edward Zhou —who has recently led the search ranking team at Notion—and share his experiences and expert insights on AI's impact on search technologies. Key topics include: • 00:00 - Introductions and Opening Remarks • 15:47 - Evaluating Search Systems and Techniques • 20:40 - Scoring Algorithm and Semantic Searching • 24:18 - Vector Space Model and Similarity Limitations • 27:05 - Embedding Models and Relevance Challenges • 32:00 - Addressing Search Bias Mitigation • 36:25 - Evaluating Search Results and Language Models • 41:49 - Language Models and Embedding Technologies Throughout the episode, the team discussed the potential of AI-powered search tools, including the combination of traditional search algorithms with AI-powered language models, and the importance of evaluating search systems based on user actions and business outcomes. They also explored the workings of a scoring algorithm, the relevance of similarity in a vector space, and the challenges and potential solutions in incorporating embedding models for specific business domains. Additionally, they addressed the issues of position and click bias in search results, the difficulties in evaluating search results and language models, and the current and future state of language models and embedding technologies. Finally, they looked into the future of search systems, considering how advancements in AI and embeddings could revolutionize search experiences. 🔗 Visit our website for more resources and updates: ⁠https://www.tensorops.ai/⁠ 👥 Connect with us on social media: ⁠Linkedin⁠ ⁠Twitter⁠ 💬 Join our community: ⁠https://www.meetup.com/ai-loves/ ⁠

    50 min
  3. 07/05/2024

    A Survey of Advanced Prompt Engineering Techniques

    This one-hour session exploring the Secrets of Prompt Engineering , we'll discuss how prompt engineering resembles programming and what common design patterns they share Support the Open-source project! ⭐ us on GitHub: https://github.com/TensorOpsAI/LLMStudio Key topics include: • 0:00 Intro • 2:11 The GenAI Revolution • 4:00 What are prompts • 5:53 Survey of techniques (part I) • 18:33 Demo: Exploring Capabilities with #LLMstudio • 22:40 Survey of techniques (part II) • 30:00 Fitting prompts in modern LLM app’s architectures • 34:20 Analysing different components - Chat • 46:38 Analysing different components - Agent • 54:16 The future of prompt engineering 💲 Struggling with managing costs of LLMs in production? Find out about our workshop here: https://www.tensorops.ai/llm-studio-cost-optimization-workshop 🔗 Visit our website for more resources and updates: https://www.tensorops.ai/ 👥 Connect with us on social media: Linkedin Twitter 💬 Join our community: https://www.meetup.com/ai-loves/ Special Thanks to Stephanie Gardner Founder @Candeo Consulting With more than twenty years of experience working with industry giants like AWS, USAA, and Verizon, Candeo bring a wealth of knowledge and innovative solutions tailored specifically for small to midsize companies. Don't forget to subscribe to our channel for more updates. #TechInnovation #TensorOps #ML #llm #openai #cloud #ai #prompt #promptengineering #prompting #chatgpt #gpt

    56 min

About

We Let Humans Talk about Machines