The Kaitchup – AI on a Budget
Subscribe
Sign in
Home
Notes
AI Notebooks
The Kaitchup Pro
Table of Contents
The Kaitchup's Book
Tutorials
Models
Archive
About
Latest
Top
Discussions
The Weekly Kaitchup #59
Qwen2.5 - Ternary LLMs - Moshi
5 hrs ago
•
Benjamin Marie
2
Share this post
The Weekly Kaitchup #59
newsletter.kaitchup.com
Copy link
Facebook
Email
Note
Other
Run and Serve Faster VLMs Like Pixtral and Phi-3.5 Vision with vLLM
Understanding how much memory you need to serve a VLM
Sep 19
•
Benjamin Marie
5
Share this post
Run and Serve Faster VLMs Like Pixtral and Phi-3.5 Vision with vLLM
newsletter.kaitchup.com
Copy link
Facebook
Email
Note
Other
2
Multimodal RAG with ColPali and Qwen2-VL on Your Computer
Retrieve and exploit information from PDFs without OCR
Sep 16
•
Benjamin Marie
10
Share this post
Multimodal RAG with ColPali and Qwen2-VL on Your Computer
newsletter.kaitchup.com
Copy link
Facebook
Email
Note
Other
3
The Weekly Kaitchup #58
AdEMAMix - FLUTE
Sep 13
•
Benjamin Marie
4
Share this post
The Weekly Kaitchup #58
newsletter.kaitchup.com
Copy link
Facebook
Email
Note
Other
Introducing Minivoc: Faster and Memory-Efficient LLMs Through Vocabulary Reduction [WIP]
From 128k to 32k tokens
Sep 13
•
Benjamin Marie
4
Share this post
Introducing Minivoc: Faster and Memory-Efficient LLMs Through Vocabulary Reduction [WIP]
newsletter.kaitchup.com
Copy link
Facebook
Email
Note
Other
GuideLLM: Is Your Server Ready for LLM Deployment?
Simulate real-world inference workloads with GuideLLM
Sep 12
•
Benjamin Marie
5
Share this post
GuideLLM: Is Your Server Ready for LLM Deployment?
newsletter.kaitchup.com
Copy link
Facebook
Email
Note
Other
GGUF Quantization with Imatrix and K-Quantization to Run LLMs on Your CPU
Fast and accurate GGUF models for your CPU
Sep 9
•
Benjamin Marie
6
Share this post
GGUF Quantization with Imatrix and K-Quantization to Run LLMs on Your CPU
newsletter.kaitchup.com
Copy link
Facebook
Email
Note
Other
The Weekly Kaitchup #57
OLMoE - GuideLLM - vLLM 0.6.0
Sep 6
•
Benjamin Marie
5
Share this post
The Weekly Kaitchup #57
newsletter.kaitchup.com
Copy link
Facebook
Email
Note
Other
3
Falcon Mamba, Jamba, RWKV... Can You Use Them on Your Computer?
A close look at quantization and parameter-efficient fine-tuning (LoRA/QLoRA) for SSMs, RWKV, and hybrid models
Sep 5
•
Benjamin Marie
Share this post
Falcon Mamba, Jamba, RWKV... Can You Use Them on Your Computer?
newsletter.kaitchup.com
Copy link
Facebook
Email
Note
Other
Announcing The Kaitchup's Book: LLMs on a Budget [Pre-sales Open]
Learn how to fine-tune, quantize, run, and serve LLMs on consumer hardware
Sep 4
•
Benjamin Marie
13
Share this post
Announcing The Kaitchup's Book: LLMs on a Budget [Pre-sales Open]
newsletter.kaitchup.com
Copy link
Facebook
Email
Note
Other
5
Run Qwen2-VL on Your Computer with Text, Images, and Video, Step by Step
Your local multimodal chat model
Sep 2
•
Benjamin Marie
5
Share this post
Run Qwen2-VL on Your Computer with Text, Images, and Video, Step by Step
newsletter.kaitchup.com
Copy link
Facebook
Email
Note
Other
4
August 2024
The Weekly Kaitchup #56
NanoFlow - Comparison of LLM Inference Services (Accuracy) - Zamba2-1.2B
Aug 30
•
Benjamin Marie
5
Share this post
The Weekly Kaitchup #56
newsletter.kaitchup.com
Copy link
Facebook
Email
Note
Other
Share
Copy link
Facebook
Email
Note
Other
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts