2 Comments

User's avatar
Greg Jordan's avatar

Nice post! I'd meant to try this out when the FOCUS paper came out, but it dropped off my radar.

Here's an example of where we ended up with Paperpile's "structured summary" prompt that tries to achieve a similar result, using Gemini and the Sakana AI paper: https://gemini.google.com/share/07173a2e7a9b

One thing we found (which surprised me) is the models are *very* lazy about extracting quotations!

Just judging by eye, it looks like Claude and Gemini are supporting each key point with quotations around ~50% of the time. Gemini maybe a bit more often than Claude. Despite the FOCUS prompt being very explicit about "every point must be supported by a quote".

For Paperpile's summarization prompt, I went down a fairly long rabbit hole of prompt engineering to get the level of quotation citing that we were hoping for. Somewhat sadly, we found that repeating guidance in multiple places, SHOUTING IMPORTANT INSTRUCTIONS, and giving explicit in-context examples, all helped achieve a better result.

(I could go on for ages about what else we did to make AI-generated summaries useful, but I'll spare you too much detail. https://paperpile.com/h/ask-ai/ gives a pretty good overview, and we'll have a more in-depth blog post about this soon.)

Ryan Wright's avatar

Use this a lot now... thanks, Stephen.

No posts

Ready for more?