Show HN: Sweep, Open-weights 1.5B model for next-edit autocomplete

Sweep Next-Edit 1.5B (GGUF)

A 1.5B parameter model for next-edit autocomplete, quantized to Q8_0 GGUF format.

Model Description

Sweep Next-Edit predicts your next code edit before you make it. It runs locally on your laptop in under 500ms (with speculative decoding) and outperforms models over 4x its size on next-edit benchmarks.

Usage

Download run_model.py and the model file, then:

uv pip install llama-cpp-python huggingface_hub python run_model.py

Model Details

Format : GGUF (Q8_0 quantization)

: GGUF (Q8_0 quantization) Parameters : 1.5B

... continue reading