Software Alternatives, Accelerators & Startups

Show HN: Improving RAG with Chess Elo Scores? (YC W25)

HackMD Medium
  1. 1
    Fast and flexible, real-time collaborative markdown, inspired by Hackpad.
    We tried a bradley-terry loss function, as calculated with https://hackmd.io/@-Gjw1zWMSH6lMPRlziQFEw/SJ8sRl1Zge We found that MSE after elo-adjustment worked equally well. And, MSE lets you shuffle (q, d) across the dataset which has good statistical properties (Versus contrastive, which makes you sample the same query many times within a single minibatch) In this case "InfoNCE" isn't applicable because the reranker's output is a scalar, not a vector. So that's why we checked both bradley-terry and MSE.

    #Task Management #Project Management #Productivity 71 social mentions

  2. 2
    Welcome to Medium, a place to read, write, and interact with the stories that matter most to you.
    Pricing:
    • Open Source
    One trouble I could see with your approach is that you treat the information "Doc at pos i" beats "Doc at pos j" independently from I and j. Intuitively, it is not as critical when a bad doc is at rank 9 instead of rank 10; compared to bad doc landing at rank 1 instead of rank 10. LambdaMART's approach seems better in that respect. https://medium.com/@nikhilbd/pointwise-vs-pairwise-vs-listwise-learning-to-rank-80a8fe8fadfd.

    #Blogging #Blogging Platform #CMS 2754 social mentions

Discuss: Show HN: Improving RAG with Chess Elo Scores? (YC W25)

Log in or Post with