Community

Discussions

How should we evaluate AI emotional intelligence?

I've been thinking about metrics for emotional intelligence in AI companions and wanted to get the community's thoughts on how we might standardize...
By Alex Johnson3 days ago
12
24

Proposed updates to the Humane Tech Scorecard

After collecting data for the past few months, I've noticed some potential improvements we could make to our scoring system, particularly around...
By Jane Smith1 week ago
28
47

Red teaming results: patterns across different LLMs

I've compiled the results from our community red teaming efforts across the top 5 LLMs and found some interesting patterns that might be worth...
By Sam Chen2 weeks ago
19
36

Comparative study: Human vs AI evaluation consistency

We recently conducted a small study comparing the consistency of human evaluators vs. automated metrics for AI rating, and the results are quite interesting...
By Maya Williams2 weeks ago
15
29
...

Top Contributors

1
Jane Smith55 contributions
452
2
John Doe42 contributions
326
3
Alex Johnson37 contributions
289
4
Maya Williams31 contributions
275
5
Sam Chen25 contributions
204
View all contributors →
Human Rating
A collaborative platform for evaluating AI systems and companions with a focus on human-centered metrics.

Contact

hello@humanrating.comSan Francisco, CA
© 2024 Human Rating. All rights reserved.