
Snowflake AI Research
We are a team with extensive experience building systems and technology that have significantly reduced the cost of LLM training and inference. A lot of our work has been open-sourced to provide the AI community with more accessible and cost-effective LLMs.
The team includes many specialists in natural language processing and search. With the help of thousands of engineers worldwide at Snowflake, our cutting-edge technology powers enterprise AI products in Cortex AI and more. To meet the individual experts driving innovation, and their research check out our webpage.
sort
JUN 03, 2025Gen AI
Inside Snowflake Intelligence: Five Pillars of Enterprise-Grade Agentic AI

DEC 05, 2024Gen AI
SwiftKV: Accelerating Enterprise LLM Workloads with Knowledge Preserving Compute Reduction

NOV 19, 2024Gen AI
Benchmarking LLMs on Writing Feature Engineering Code

AUG 27, 2024Gen AI
The Recipe for Success: Blending Data for Better LLM Pretraining

JUL 23, 2024Gen AI
Fine-Tuning Llama 3.1 405B on a Single Node using Snowflake’s Memory-Optimized AI Stack

JUL 23, 2024Gen AI
Achieve Low-Latency and High-Throughput Inference with Meta's Llama 3.1 405B using Snowflake’s Optimized AI Stack

JUL 18, 2024
Snowflake Arctic Embed M v1.5: Hitting the ROI Sweet Spot for Enterprise Retrieval

JUL 11, 2024
Snowflake Arctic Cookbook Series: A Deep Dive into LLM Evaluation Standards

JUN 17, 2024Gen AI
Moving Beyond MTEB and BEIR: Snowflake AI Research Joins Forces with the University of Waterloo to Evolve RAG and Retrieval Benchmarks

Previous
1
2
Next