I have tried dozens of Android productivity apps, each promising to be the one but always falling short. That is, until I discovered Anytype. This isn't another note-taking or to-do app. It's a ...
Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results