I anticipate we’ll shortly have PAFs, “Prompt Application Firewalls”, on the mar...

cddotdotslash · on May 13, 2023

I’ve started working on something like this as a side project: https://usagepanda.com

It started originally as a way to limit costs (the proxy would intercept requests, estimate the token sizes, and block requests before they are sent to OpenAI). However, at the request of some early users, I’ve expanded it to include things like keyword detection/blocking, moderation enforcement, etc.

I’m not entirely convinced you can ever fully block prompt attacks, but right now most companies are just asking for visibility into it. So you could monitor for things like: do certain malicious phrases appear in the request? Or does a significant percentage of the original prompt text also appear in the response (a signal that the prompt is leaking).

instance · on May 13, 2023

I've put up something like this as a side project a few weeks ago: https://flamingoo.ai/

Love the term "PAF", I think I'll include this.

awestroke · on May 13, 2023

The website does not mention what the product is

instance · on May 13, 2023

yea, you're right. It's really pre-MVP. Basically an API in between your user facing input and OpenAI that detects prompt injections and flags them for you so you can abort sending to OpenAI.

quickthrower2 · on May 13, 2023

I believe they will exist, but I don’t think they will be effective at stopping the threat, but a good money making opportunity for someone who wants to sell the feeling of reassurance.