| | LLM Inference with Ray: Expert parallelism and prefill/decode disaggregation (anyscale.com) |
| 1 point by mycelia 5 days ago | past | discuss |
|
| | LLM Engine Orchestration for Performance (anyscale.com) |
| 1 point by mycelia 57 days ago | past |
|
| | Massively Parallel Agentic Simulations with Ray (anyscale.com) |
| 2 points by robertnishihara 83 days ago | past |
|
| | Deploy DeepSeek‑R1 with VLLM and Ray Serve on Kubernetes (anyscale.com) |
| 1 point by robertnishihara 3 months ago | past |
|
| | An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM (anyscale.com) |
| 1 point by robertnishihara 4 months ago | past |
|
| | Native LLM APIs in Ray Data and Ray Serve (anyscale.com) |
| 2 points by robertnishihara 4 months ago | past |
|
| | Joins and Hash-Shuffle in Ray Data (anyscale.com) |
| 3 points by robertnishihara 4 months ago | past |
|
| | Open Source RL Libraries for LLMs (anyscale.com) |
| 1 point by robertnishihara 5 months ago | past |
|
| | Large-Scale Deployment of Ray in Tencent's Weixin AI Infrastructure (anyscale.com) |
| 2 points by robertnishihara 5 months ago | past |
|
| | Uv and Ray: Pain-Free Python Dependencies in Clusters (anyscale.com) |
| 44 points by robertnishihara 5 months ago | past | 10 comments |
|
| | An OSS Stack for AI Compute: Kubernetes + Ray + PyTorch + LLM (anyscale.com) |
| 3 points by gabe_monroy 5 months ago | past |
|
| | An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM (anyscale.com) |
| 1 point by robertnishihara 5 months ago | past |
|
| | Uv and Ray: Pain-Free Python Dependencies in Clusters (anyscale.com) |
| 1 point by robertnishihara 9 months ago | past |
|
| | Direct Preference Optimization with Synthetic Data on Anyscale (anyscale.com) |
| 1 point by robertnishihara on Aug 21, 2024 | past |
|
| | Anyscale Appoints Keerti Melkote as CEO (anyscale.com) |
| 2 points by dnnssl2 on July 31, 2024 | past |
|
| | Building an LLM Router for High-Quality and Cost-Effective Responses (anyscale.com) |
| 1 point by robertnishihara on July 2, 2024 | past |
|
| | End-to-End LLM Workflows Guide (anyscale.com) |
| 1 point by GokuMohandas on June 18, 2024 | past |
|
| | Lessons from training a Stable Diffusion model on 2B images (anyscale.com) |
| 5 points by robertnishihara on May 11, 2024 | past |
|
| | Canva Built a Modern AI Platform Using Anyscale (anyscale.com) |
| 2 points by robertnishihara on April 3, 2024 | past |
|
| | Building RAG-Based LLM Applications for Production (anyscale.com) |
| 2 points by robertnishihara on Feb 14, 2024 | past |
|
| | Fine-tuning LLMs for longer context and better RAG systems (anyscale.com) |
| 1 point by robertnishihara on Feb 13, 2024 | past |
|
| | RAG at Scale: 10x Cheaper Embedding Computations with Anyscale and Pinecone (anyscale.com) |
| 1 point by robertnishihara on Jan 16, 2024 | past |
|
| | Comparing LLM Performance: Introducing the Open Source Leaderboard for LLM APIs (anyscale.com) |
| 2 points by robertnishihara on Dec 21, 2023 | past |
|
| | Anyscale Endpoints: JSON Mode and Function Calling Features (anyscale.com) |
| 2 points by robertnishihara on Dec 14, 2023 | past |
|
| | Anyscale Endpoints: JSON Mode and Function Calling Features (anyscale.com) |
| 1 point by tosh on Dec 13, 2023 | past |
|
| | Building RAG-Based LLM Applications for Production (anyscale.com) |
| 2 points by behnamoh on Nov 24, 2023 | past |
|
| | LLM summarization: A case study of human, Llama-2, & GPT-4 summarization quality (anyscale.com) |
| 1 point by robertnishihara on Nov 10, 2023 | past |
|
| | Building Rag-Based LLM Applications for Production (anyscale.com) |
| 1 point by akbarnama on Nov 6, 2023 | past |
|
| | Reproducible Performance Metrics for LLM Inference (anyscale.com) |
| 2 points by robertnishihara on Nov 2, 2023 | past |
|
| | Building Rag-Based LLM Applications for Production (anyscale.com) |
| 3 points by robertnishihara on Oct 25, 2023 | past |
|
|
| More |