Community
Discussions
How should we evaluate AI emotional intelligence?
I've been thinking about metrics for emotional intelligence in AI companions and wanted to get the community's thoughts on how we might standardize...By Alex Johnson3 days ago
12
24
Proposed updates to the Humane Tech Scorecard
After collecting data for the past few months, I've noticed some potential improvements we could make to our scoring system, particularly around...By Jane Smith1 week ago
28
47
Red teaming results: patterns across different LLMs
I've compiled the results from our community red teaming efforts across the top 5 LLMs and found some interesting patterns that might be worth...By Sam Chen2 weeks ago
19
36
Comparative study: Human vs AI evaluation consistency
We recently conducted a small study comparing the consistency of human evaluators vs. automated metrics for AI rating, and the results are quite interesting...By Maya Williams2 weeks ago
15
29
...