Getting Started with Prompt Compression API

This guide provides step-by-step instructions on setting up prompt compression API, that compresses input prompts before passing it to LLMs, helping you save cost and reduce inference speed.

1. Open compressor setup modal

The “Compressor setup modal” provides your essential connection details: API endpoint, key, and sample request bodies. This ensures a smooth setup for compressing your prompts.

Now, you can see the setup modal for the compressor. If you are not able to see the below modal, contact our support team.

If you close the modal window, you can access it again by clicking “Setup Now” CTA on the banner at the top of the page.

2. Access your API keys

The API keys equips you with the credentials needed to connect with Prompt Compression API. This unlocks the power to compress your prompts and experience cost savings.

We suggest you to keep your API keys very safely. If you experience any misuse, contact our support team.

3. Send first request to Prompt Compression API

Once you have your request setup for a sample body, you can send a request to the Compressor API to compress your own prompts.

All Done!

Congrats! You’ve set up your LLUMO’s Prompt Compression API and it has started saving your cost with each inference!

If you encounter any difficulties while setting up, please refer to the troubleshooting section of the guide or, contact our support team at connect@llumo.ai.