Oct 11, 2023
Interesting approach! I can see some potential latency challenges arising from the LLM's generation of (and inference of) multiple queries, as well as from RRF. Will definitely try this out.
Interesting approach! I can see some potential latency challenges arising from the LLM's generation of (and inference of) multiple queries, as well as from RRF. Will definitely try this out.
Distraction-free reading. No ads.
Organize your knowledge with lists and highlights.
Tell your story. Find your audience.
Read member-only stories
Support writers you read most
Earn money for your writing
Listen to audio narrations
Read offline with the Medium app
I am a data scientist and former mathematics tutor with a passion for reading, writing and teaching others. I am also a hobbyist poet and dog mum to Jujubee.