show episodes
 
Artwork

1
Programming Throwdown

Patrick Wheeler and Jason Gauci

icon
Unsubscribe
icon
Unsubscribe
Monthly
 
Programming Throwdown educates Computer Scientists and Software Engineers on a cavalcade of programming and tech topics. Every show will cover a new programming language, so listeners will be able to speak intelligently about any programming language.
  continue reading
 
High Agency is the podcast for AI builders. If you’re trying to understand how to successfully build AI products with Large Language Models and Generative AI then this podcast is made for you. Each week we interview leaders at companies building on the frontier who have already succeeded with AI in production. We share their stories, lessons and playbooks so you can build more quickly and with confidence. AI is moving incredibly fast and no-one is truly an expert yet, High Agency is for peop ...
  continue reading
 
Artwork

1
Your AI Roadmap

Dr. Joan Palmiter Bajorek

icon
Unsubscribe
icon
Unsubscribe
Weekly
 
Your AI Roadmap the podcast is on a mission to decrease fluffy HYPE and talk to the people actually building AI. Anyone can build in AI. Including you. Whether you’re terrified or excited, there’s been no better time than today to dive in! Now is the time to be curious and future-proof your career and ... ultimately your income. This podcast isn't about white dudes patting themselves on the back, this is about you and me and ALL the paths into cool projects around the world! What's next on y ...
  continue reading
 
Artwork

1
Fireside AI

Catherine Breslin

icon
Unsubscribe
icon
Unsubscribe
Weekly
 
AI is transforming our world - in science, healthcare, finance, and beyond. With rapid advancements in large language models, automation, and machine learning, companies are racing to build smarter, more efficient systems that shape the future. But how do you go from an AI idea to a scalable, impactful solution? Building and scaling AI isn’t just about the technology. It’s also about making the right strategic decisions. AI founders need to consider everything from data quality and model per ...
  continue reading
 
Artwork

1
VUX World

Kane Simms

icon
Unsubscribe
icon
Unsubscribe
Monthly+
 
Interviews with the best brains in AI, sharing how to improve customer experience and business operations using emerging AI technologies such as voice AI, conversational AI, NLP, Large Language Models (LLMs), generative AI and more. We educate business leaders and teams on why and how AI technologies are revolutionising the way consumers engage with businesses and the internet, why that matters and how to implement it properly. “One of the most consistently insightful and deeply respected po ...
  continue reading
 
Explore the exciting World of Legal Tech and Artificial Intelligence with Alphalect.ai. In this podcast we cover everything you need to know about the Legal Tech World, whether it is drafting a patent, the Use of Legal AI, Blockchain, LLM, Machine Learning and so much more! If you want to learn more, you can also visit our Website: https://alphalect.ai/ This Episode was created with AI. The Content is based on curated sources.
  continue reading
 
Welcome to "The Interconnectedness of Things," the podcast where we explore the seamless integration of technology in our modern world. Hosted by Dr. Andrew Hutson and Emily Nava of QFlow Systems, each episode delves into the dynamic interplay of enterprise solutions, innovative software, and the transformative power of technology in various industries. With expert insights, real-world case studies, and thoughtful discussions, "The Interconnectedness of Things" offers a comprehensive look at ...
  continue reading
 
Artwork
 
Welcome to 'Crypto for Everyone' with Myles Dhillon Delve into the world of Bitcoin and decentralized finance (DeFi) with Myles Dhillon, your guide to understanding and navigating the digital currency landscape.​ What to Expect: Bitcoin Insights: Stay informed about the latest developments, trends, and analyses in the Bitcoin ecosystem.​ Investment Strategies: Learn practical approaches to investing in Bitcoin and DeFi, tailored for both newcomers and seasoned investors.​ Expert Interviews: ...
  continue reading
 
Artwork

1
AI With Friends

AI With Friends LLC

icon
Unsubscribe
icon
Unsubscribe
Monthly+
 
Welcome to AI With Friends, your weekly launchpad into the world of Artificial Intelligence. Hosted by Marlon Avery, a pioneer in GenAI innovation, alongside Adrian Green, VP of Engineering at LiveNation, and Sekou Doumbouya, Senior Staff Cloud Systems Engineer, this show is your go-to source for all things AI. Our hosts bring diverse expertise—from AI strategy and tech innovation to industry leadership. Every week, they break down the latest AI trends, interview top experts, and simplify co ...
  continue reading
 
Open Tech Talks is your weekly sandbox for technology: Artificial Intelligence, Generative AI, Machine Learning, Large Language Models (LLMs) insights, experimentation, and inspiration. Hosted by Kashif Manzoor, AI Evangelist, Cloud Expert, and Enterprise Architect, this Podcast combines technology products, artificial intelligence, machine learning overviews, how-to's, best practices, tips & tricks, and troubleshooting techniques. Whether you're a CIO, IT manager, developer, or just curious ...
  continue reading
 
Machine learning and artificial intelligence are dramatically changing the way businesses operate and people live. The TWIML AI Podcast brings the top minds and ideas from the world of ML and AI to a broad and influential community of ML/AI researchers, data scientists, engineers and tech-savvy business and IT leaders. Hosted by Sam Charrington, a sought after industry analyst, speaker, commentator and thought leader. Technologies covered include machine learning, artificial intelligence, de ...
  continue reading
 
Artwork
 
Deep Papers is a podcast series featuring deep dives on today’s most important AI papers and research. Hosted by Arize AI founders and engineers, each episode profiles the people and techniques behind cutting-edge breakthroughs in machine learning.
  continue reading
 
In this episode, we explore DeepSeek, a Chinese AI company disrupting the industry with its open-source large language models like DeepSeek-R1, which has made waves for its low training costs and rapid market impact—while also raising concerns about censorship and privacy. We delve into the company's rise, its technology, and the global reaction to its advancements.
  continue reading
 
Artwork
 
"Last Week In r/LocalLLaMA" is your weekly roundup of the most interesting discussions, debates, and moments from the r/LocalLLaMA community. Join us for a fun and lighthearted take on the top posts, user opinions, and trending topics. Perfect for keeping up with the conversation, even when you’re short on time.
  continue reading
 
Artwork

1
Casual Inference

Lucy D'Agostino McGowan and Ellie Murray

icon
Unsubscribe
icon
Unsubscribe
Monthly
 
Keep it casual with the Casual Inference podcast. Your hosts Lucy D'Agostino McGowan and Ellie Murray talk all things epidemiology, statistics, data science, causal inference, and public health. Sponsored by the American Journal of Epidemiology.
  continue reading
 
Stay ahead of the future with Tomorrow’s AI Today, your go-to daily podcast for the latest in artificial intelligence, machine learning, and emerging technology. From groundbreaking AI innovations to policy shifts, ethical debates, and real-world applications, this podcast brings you the top AI news and trends—all in a quick, digestible format. Whether you're a tech professional, AI enthusiast, investor, or simply curious about how AI is shaping the world, Tomorrow’s AI Today delivers fast, ...
  continue reading
 
Artwork

1
Prompt & Pixels

Brent McWhirter

icon
Unsubscribe
icon
Unsubscribe
Daily+
 
**Prompt & Pixels** is your ultimate guide to the creative frontier where AI meets artistry. Join us as we explore cutting-edge technologies like large language models (LLMs) and AI-powered image generation. Whether you’re an artist, entrepreneur, or tech enthusiast, discover how to unlock your creative potential with expert insights, deep dives into emerging AI tools, and interviews with industry innovators. From mastering prompts to creating stunning visuals, *Prompt & Pixels* equips you w ...
  continue reading
 
Artwork

1
The Prompt Desk

Justin Macorin, Bradley Arsenault

icon
Unsubscribe
icon
Unsubscribe
Monthly
 
Embark on a captivating exploration of Large Language Models (LLMs), prompt engineering, and generative AI with hosts Bradley Arsenault and Justin Macorin. With 25 years of combined machine learning and product engineering experience, they are delving deep into the world of LLMs to uncover best practices and stay at the forefront of AI innovation. Join them in shaping the future of technology and software development through their discoveries in LLMs and generative AI. Podcast website: https ...
  continue reading
 
EPRI Current examines key issues and new R&D impacting the energy transition. Each episode features insights from EPRI, the world’s preeminent independent, non-profit energy research and development organization, and from other energy industry leaders. We also discuss how innovative technologies are shaping the global energy future. Learn more at www.epri.com
  continue reading
 
Artwork

1
DM Radio

Eric Kavanagh

icon
Unsubscribe
icon
Unsubscribe
Weekly
 
DM Radio is the world's longest-running show about data! Since 2008, we've interviewed the industry's brightest minds about AI, analytics, big data, cloud, data warehousing, digital transformation, Internet of Things (IoT), streaming and many other topics. Now broadcasting coast-to-coast, we're always looking for new voices. Send an email to [email protected]!
  continue reading
 
Are you a critical thinker ready to dive into AI? Welcome to Super Prompt: The Generative AI Podcast. Join me, Tony Wan, an ex Silicon Valley executive, as we 'unhype the hype' of AI via illuminating conversations with top engineers, and in-depth solo episodes. Our goal? To make it almost unnecessary to send a cybernetic organism back in time to fix things. Tailored for the technically-minded and discerningly skeptical, our discussions cover Large Language Models (LLMs), neural networks, mul ...
  continue reading
 
Artwork

1
Into the Bytecode

Sina Habibian

icon
Unsubscribe
icon
Unsubscribe
Monthly
 
Into the Bytecode is a podcast about building the future. Check out these links for more: - Twitter: twitter.com/sinahab - Website: intothebytecode.com - Newsletter for updates: bytecode.substack.com
  continue reading
 
Artwork

1
Crossing Thin Ice

Dave Ingram and Max Rudolph

icon
Unsubscribe
icon
Unsubscribe
Monthly+
 
A discussion of Risk and Risk Management from the perspective of an Insurance company risk manager. Insurers provide products that help everyone to manage their risks. Here you will hear Dave Ingram and Max Rudolph, talk about the sorts of things that keep those insurance company risk managers up at night. Or at least they should.
  continue reading
 
Artwork

51
Arxiv Papers

Igor Melnyk

icon
Unsubscribe
icon
Unsubscribe
Daily+
 
Running out of time to catch up with new arXiv papers? We take the most impactful papers and present them as convenient podcasts. If you're a visual learner, we offer these papers in an engaging video format. Our service fills the gap between overly brief paper summaries and time-consuming full paper reads. You gain academic insights in a time-efficient, digestible format. Code behind this work: https://github.com/imelnyk/ArxivPapers
  continue reading
 
Janes delivers validated open-source defence intelligence across four core capability areas threat, equipment, defence industry and country that are aligned with workflows across the defence industry, national security and government.
  continue reading
 
Artwork

1
The Secret Life of Language

School of Languages and Linguistics - The University of Melbourne

icon
Unsubscribe
icon
Unsubscribe
Monthly
 
The Secret Life of Language dives into the cultures, arts, and histories that underpin and inform the diverse languages we speak. From the studios of the University of Melbourne’s School of Languages and Linguistics.
  continue reading
 
We interview researchers and developers who are creating new and innovative ideas in AI and Machine Learning. This bi-weekly podcast is looking for practical insights from the research world that tell us where AI and Machine learning are headed.
  continue reading
 
Artwork

1
JAMA Medical News

JAMA Network

icon
Unsubscribe
icon
Unsubscribe
Weekly
 
Discussions of timely topics in clinical medicine, biomedical research, public health, health policy, and more, featured in the Medical News section of JAMA, the Journal of the American Medical Association.
  continue reading
 
Chris Romeo is going on a journey. A journey to understand threat modeling at the deepest levels. He thought he understood threat modeling but realized he could go deeper. Chris shares his findings and talks with some of the best-known experts in the space to experience continuous learning. Join along for the ride -- you will learn something. Chris Romeo is the CEO of Devici (THE Threat Modeling Company) and a General Partner at Kerr Ventures.
  continue reading
 
Artwork

1
Chat G

Andre Morton and Sholto Maud

icon
Unsubscribe
icon
Unsubscribe
Monthly
 
"Navigating the Future of AI, One Thought at a Time" Chat G, is a serious exploration of artificial intelligence with your hosts, Sholto and Andre, co-hosted by Hal-E, OpenAI’s GPT-4, a Large Language Model. In Chat G, we delve into the philosophical and ethical implications of the emerging world of AI. As this technology reshapes our world, we ask: what benefits and harms could it bring? Join us for a thought-provoking journey into the future of AI.
  continue reading
 
Welcome to Credit Shift, the podcast that dives into the challenges and opportunities, tools, and strategies shaping the world of credit, digital debt collection, and digital transformation. Brought to you by Webio.com, Credit Shift explores key industry trends and innovations, featuring insights from Webio experts and industry leaders. Whether you're navigating AI in collections, customer engagement strategies, or the future of digital debt collection, this podcast is your go-to resource fo ...
  continue reading
 
NEJM AI Grand Rounds, hosted by Arjun (Raj) Manrai, Ph.D. and Andrew Beam, Ph.D., features informal conversations with a variety of unique experts exploring the deep issues at the intersection of artificial intelligence, machine learning, and medicine. You’ll learn how AI will change clinical practice and healthcare, how it will impact the patient experience, and about the people who are pushing for innovation. Whether you are an AI researcher or a practicing clinician, these conversations w ...
  continue reading
 
Exploring SAP’s business technology platform and success with our revolutionary data tech. Discover key solutions and technologies that power SAP’s Business Technology Platform by listening in on conversations with partners, developers, and innovators across the SAP ecosystem. Get real-world stories and stats on topics like data governance, data integration and orchestration, machine learning, all things database, enterprise architecture, and more.
  continue reading
 
"Hello SundAI - Our World Through the Lens of AI," is your twice-weekly dive into how artificial intelligence shapes our digital landscape. Hosted by Roger and SundAI the AI, this podcast brings you practical tips, cutting-edge tools, and insightful interviews every Sunday and Wednesday morning. Whether you're a seasoned tech enthusiast or just starting to explore the digital domain, tune in to discover innovative ways to get things done and propel yourself forward in a world increasingly dr ...
  continue reading
 
The #EduDuctTape Podcast, hosted by Jake Miller, focuses on viewing #edtech as a tool used to meet goals, address learning standards, and solve problems in the classroom, much as duct tape is used as a tool that solves a plethora of problems in our lives. In each episode, Jake sits down with a different inspiring guest to share and discuss some awesome ideas for using tech in the classroom!
  continue reading
 
Loading …
show series
 
[This is our blog post on the papers, which can be found at https://transformer-circuits.pub/2025/attribution-graphs/biology.html and https://transformer-circuits.pub/2025/attribution-graphs/methods.html.] Language models like Claude aren't programmed directly by humans—instead, they‘re trained on large amounts of data. During that training process…
  continue reading
 
The paper introduces Speaking with Intent (SWI) in large language models, enhancing reasoning and generation quality through explicit intent, outperforming traditional methods in various benchmarks. https://arxiv.org/abs//2503.21544 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://po…
  continue reading
 
Lucy and Ellie chat about large language models, chat interfaces, and causal inference. Do LLMs Act as Repositories of Causal Knowledge?: https://arxiv.org/html/2412.10635v1 Follow along on Twitter: The American Journal of Epidemiology: @AmJEpi Ellie: @EpiEllie Lucy: @LucyStats 🎶 Our intro/outro music is courtesy of Joseph McDade. Edited by Cameron…
  continue reading
 
This is my conversation with Jeffrey Quesnelle, cofounder of Nous Research. Timestamps: - (00:00:00) intro - (00:01:08) working with new technologies - (00:06:15) Nous Research origin story - (00:14:08) open frontiers in research - (00:26:07) fourier transforms for gradient compression - (00:32:58) math behind distributed training - (00:38:18) spon…
  continue reading
 
A recent study showed AI-assisted screening using a large language model tool reduced time to determine trial eligibility compared with manual methods. Author Alexander J. Blood, MD, MSc, cardiologist at Brigham and Women's Hospital, and Associate Director of the Accelerator for Clinical Transformation Research Group at Harvard Medical School joins…
  continue reading
 
What part of your business can be wrong 20% of the time? Sales? Fulfillment? Accounting? The obvious answer is that 80% accuracy is a non-starter for most aspects of the modern enterprise. That should be cause for concern when leveraging Large Language Models. Yes, they are amazing, and they're getting better. But guardrails are required to make su…
  continue reading
 
Full article: Use of ChatGPT Large Language Models to Extract Details of Recommendations for Additional Imaging From Free-Text Impressions of Radiology Reports Morgan McLuckey, MD, discusses the AJR article by Li et al. exploring the use of large language models for extracting details related to recommendations for additional imaging from radiology…
  continue reading
 
Full article: Contrast Media in Children: Ten Important Concepts on Administration, Applications, Complications, and Environmental Considerations, From the AJR Special Series on Contrast Media How can radiologists and pediatricians optimize contrast media use in children? In this AJR Conversation, Pediatric Imaging Section Editor Jonathan R. Dillma…
  continue reading
 
In this episode, we dive deep into how NatWest uses generative AI to reshape customer service at scale, with Mark Worden, Strategy & Innovation Lead for Cora at NatWest. We explore how one of the UK’s biggest banks is building AI-driven customer journeys that are smarter, faster, and more efficient. From traditional NLU-based bots to cutting-edge R…
  continue reading
 
The US Is Suing Pharmacies for Aiding in the Opioid Crisis; Texas Measles Outbreak Spurs Call for Stronger Vaccine Advocacy; Study Finds Sleep-Related Infant Deaths Are on the Rise Related Content: US Government Sues Pharmacy Chains CVS and Walgreens for Their Alleged Role in the Opioid Epidemic Amid Texas Measles Outbreak, Clinicians Struggle to O…
  continue reading
 
The paper introduces Speaking with Intent (SWI) in large language models, enhancing reasoning and generation quality through explicit intent, outperforming traditional methods in various benchmarks. https://arxiv.org/abs//2503.21544 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://po…
  continue reading
 
Join this episode of DM Radio as host Eric Kavanagh interviews Max Howell and Timothy Lewis from tea.xyz about the critical role of open source in modern technology. They explore the challenges developers face in sustaining open-source projects, the importance of fair compensation, and how tea.xyz aims to revolutionize funding models for open-sourc…
  continue reading
 
This is my conversation with Michael Nielsen, scientist, author, and research fellow at the Astera Institute. Timestamps: - (00:00:00) intro - (00:01:06) cultivating optimism amid existential risks - (00:07:16) asymmetric leverage - (00:12:09) are "unbiased" models even feasible? - (00:18:44) AI and the scientific method - (00:23:23) unlocking AI's…
  continue reading
 
In Part 1of the latest Let’s Talk Cloud ERP podcast, host Jennifer Frank McGrory chats with Bill Piotrowski and Chris Perry of IBM about IBM’s ambitious transformation journey as it modernizes its global operations with SAP S/4HANA. As a tech giant with a complex enterprise landscape, IBM is leveraging cloud, AI, and automation to enhance efficienc…
  continue reading
 
Opt-CWM is a self-supervised method for motion estimation from videos, achieving state-of-the-art performance without labeled data by optimizing counterfactual probes from a pre-trained model. https://arxiv.org/abs//2503.19953 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts…
  continue reading
 
Opt-CWM is a self-supervised method for motion estimation from videos, achieving state-of-the-art performance without labeled data by optimizing counterfactual probes from a pre-trained model. https://arxiv.org/abs//2503.19953 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts…
  continue reading
 
Open Deep Search (ODS) enhances open-source LLMs with reasoning agents and web search tools, achieving state-of-the-art performance and surpassing proprietary solutions in accuracy on key benchmarks. https://arxiv.org/abs//2503.20201 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://p…
  continue reading
 
Open Deep Search (ODS) enhances open-source LLMs with reasoning agents and web search tools, achieving state-of-the-art performance and surpassing proprietary solutions in accuracy on key benchmarks. https://arxiv.org/abs//2503.20201 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://p…
  continue reading
 
Send us a text 🎙️ Welcome to AI With Friends – your weekly launchpad into the world of Artificial Intelligence! Hosted by Marlon Avery, a pioneer in GenAI innovation, alongside Adrian Green, VP of Engineering at LiveNation, and Sekou Doumbouya, Senior Staff Cloud Systems Engineer, this show is your go-to source for all things AI. Our hosts bring di…
  continue reading
 
In this episode of the AJR Podcast Series on Diagnostic Excellence and Error, Francis Deng, MD, and Benjamin Strong, MD, discuss the justification and implementation of artificial intelligence (AI) for detection of critical pathologies within a quality assurance framework. They explore AI’s evolving impact on diagnostic error and malpractice risk. …
  continue reading
 
LookAhead Tuning preserves safety in fine-tuning large language models by modifying training data, ensuring robust performance while minimizing disruptions to initial token distributions. https://arxiv.org/abs//2503.19041 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.appl…
  continue reading
 
LookAhead Tuning preserves safety in fine-tuning large language models by modifying training data, ensuring robust performance while minimizing disruptions to initial token distributions. https://arxiv.org/abs//2503.19041 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.appl…
  continue reading
 
ReSearch is a novel framework that enhances LLM reasoning by integrating search processes through reinforcement learning, improving generalizability and advanced reasoning capabilities without supervised data. https://arxiv.org/abs//2503.19470 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts:…
  continue reading
 
ReSearch is a novel framework that enhances LLM reasoning by integrating search processes through reinforcement learning, improving generalizability and advanced reasoning capabilities without supervised data. https://arxiv.org/abs//2503.19470 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts:…
  continue reading
 
We cover Anthropic’s groundbreaking Model Context Protocol (MCP). Though it was released in November 2024, we've been seeing a lot of hype around it lately, and thought it was well worth digging into. Learn how this open standard is revolutionizing AI by enabling seamless integration between LLMs and external data sources, fundamentally transformin…
  continue reading
 
About nine months ago, I and three friends decided that AI had gotten good enough to monitor large codebases autonomously for security problems. We started a company around this, trying to leverage the latest AI models to create a tool that could replace at least a good chunk of the value of human pentesters. We have been working on this project si…
  continue reading
 
(Audio version here (read by the author), or search for "Joe Carlsmith Audio" on your podcast app. This is the fourth essay in a series that I’m calling “How do we solve the alignment problem?”. I’m hoping that the individual essays can be read fairly well on their own, but see this introduction for a summary of the essays that have been released t…
  continue reading
 
LessWrong has been receiving an increasing number of posts and contents that look like they might be LLM-written or partially-LLM-written, so we're adopting a policy. This could be changed based on feedback. Humans Using AI as Writing or Research Assistants Prompting a language model to write an essay and copy-pasting the result will not typically …
  continue reading
 
Have you heard about turning carbon dioxide turning into graphite? In this lively fireside chat, Dr. Joan Palmiter Bajorek sits down with CEO and co-founder Makoto Eyre to explore why new ideas in climate tech matter more than ever—especially when it comes to tackling carbon dioxide’s impact on our planet. Homeostasis turns CO2 into industrial grap…
  continue reading
 
Find Chris on LinkedIn, and learn about Obrizum. I talk with Chris Pedder from Obrizum. Chris started his career as a Physicist, before moving into AI and Data. He is now Chief Data and AI Officer at Obrizum, who use AI to deliver adaptive learning. In this episode we talk about the behaviours that Chris intentionally fosters in his team, including…
  continue reading
 
FFN Fusion optimizes large language models by parallelizing Feed-Forward Network layers, achieving significant inference speedup and cost reduction while maintaining performance, especially in larger models. https://arxiv.org/abs//2503.18908 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: h…
  continue reading
 
FFN Fusion optimizes large language models by parallelizing Feed-Forward Network layers, achieving significant inference speedup and cost reduction while maintaining performance, especially in larger models. https://arxiv.org/abs//2503.18908 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: h…
  continue reading
 
Thanks to Jesse Richardson for discussion. Polymarket asks: will Jesus Christ return in 2025? In the three days since the market opened, traders have wagered over $100,000 on this question. The market traded as high as 5%, and is now stably trading at 3%. Right now, if you wanted to, you could place a bet that Jesus Christ will not return this year…
  continue reading
 
Today, we're joined by Julie Kallini, PhD student at Stanford University to discuss her recent papers, “MrT5: Dynamic Token Merging for Efficient Byte-level Language Models” and “Mission: Impossible Language Models.” For the MrT5 paper, we explore the importance and failings of tokenization in large language models—including inefficient compression…
  continue reading
 
The paper explores post-training methods for large language models to enhance both output diversity and quality in creative writing, achieving human-like diversity with minimal quality loss. https://arxiv.org/abs//2503.17126 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.a…
  continue reading
 
Loading …

Quick Reference Guide

Listen to this show while you explore
Play