This is an example blog post to demonstrate the Markdown blog feature. You can write full articles here using Markdown syntax.

How It Works

The compressed finite state machine (FSM) approach enables faster JSON and regex constrained decoding for local LLMs.

  • Compresses the FSM transitions to reduce memory and lookup overhead
  • Integrates with SGLang runtime
  • Achieves significant speedups over naive implementations

Delete this example post or replace it with your own content.