When Can We Trust LLMs in Mental Health? Large-scale Benchmarks for Reliable LLM Evaluation
𝐀𝐮𝐭𝐡𝐨𝐫𝐬: 𝐀𝐛𝐞𝐞𝐫 𝐁𝐚𝐝𝐚𝐰𝐢, 𝐄𝐥𝐚𝐡𝐞 𝐑𝐚𝐡𝐢𝐦𝐢, 𝐌𝐝 𝐓𝐚𝐡𝐦𝐢𝐝 𝐑𝐚𝐡𝐦𝐚𝐧 𝐋𝐚𝐬𝐤𝐚𝐫, 𝐒𝐡𝐞𝐫𝐢 𝐆𝐫𝐚𝐜𝐡, 𝐋𝐢𝐧𝐝𝐬𝐚𝐲 𝐁𝐞𝐫𝐭𝐫𝐚𝐧𝐝, 𝐋𝐚𝐦𝐞𝐬 𝐃𝐚𝐧𝐨𝐤, 𝐉𝐢𝐦𝐦𝐲 𝐇𝐮𝐚𝐧𝐠, 𝐅𝐫𝐚𝐧𝐤 𝐑𝐮𝐝𝐳𝐢𝐜𝐳, 𝐄𝐥𝐡𝐚𝐦 𝐃𝐨𝐥𝐚𝐭𝐚𝐛𝐚𝐝𝐢.
𝐀𝐮𝐭𝐡𝐨𝐫𝐬: 𝐀𝐛𝐞𝐞𝐫 𝐁𝐚𝐝𝐚𝐰𝐢, 𝐄𝐥𝐚𝐡𝐞 𝐑𝐚𝐡𝐢𝐦𝐢, 𝐌𝐝 𝐓𝐚𝐡𝐦𝐢𝐝 𝐑𝐚𝐡𝐦𝐚𝐧 𝐋𝐚𝐬𝐤𝐚𝐫, 𝐒𝐡𝐞𝐫𝐢 𝐆𝐫𝐚𝐜𝐡, 𝐋𝐢𝐧𝐝𝐬𝐚𝐲 𝐁𝐞𝐫𝐭𝐫𝐚𝐧𝐝, 𝐋𝐚𝐦𝐞𝐬 𝐃𝐚𝐧𝐨𝐤, 𝐉𝐢𝐦𝐦𝐲 𝐇𝐮𝐚𝐧𝐠, 𝐅𝐫𝐚𝐧𝐤 𝐑𝐮𝐝𝐳𝐢𝐜𝐳, 𝐄𝐥𝐡𝐚𝐦 𝐃𝐨𝐥𝐚𝐭𝐚𝐛𝐚𝐝𝐢.
𝐀𝐮𝐭𝐡𝐨𝐫𝐬: 𝐀𝐛𝐞𝐞𝐫 𝐁𝐚𝐝𝐚𝐰𝐢, 𝐒𝐨𝐦𝐚𝐲𝐲𝐚 𝐄𝐥𝐦𝐨𝐠𝐡𝐚𝐳𝐲, 𝐒𝐚𝐦𝐢𝐫𝐚 𝐂𝐡𝐨𝐮𝐝𝐡𝐮𝐫𝐲, 𝐒𝐚𝐫𝐚 𝐄𝐥𝐠𝐚𝐳𝐳𝐚𝐫, 𝐊𝐡𝐚𝐥𝐢𝐝 𝐄𝐥𝐠𝐚𝐳𝐳𝐚𝐫, 𝐀𝐦𝐞𝐫 𝐌 𝐁𝐮𝐫𝐡𝐚𝐧.
𝐀𝐮𝐭𝐡𝐨𝐫𝐬: 𝐒𝐡𝐞𝐫𝐢 𝐆𝐫𝐚𝐜𝐡, 𝐀𝐛𝐞𝐞𝐫 𝐁𝐚𝐝𝐚𝐰𝐢, 𝐅𝐚𝐫𝐚𝐡 𝐀𝐡𝐦𝐚𝐝, 𝐄𝐥𝐡𝐚𝐦 𝐃𝐨𝐥𝐚𝐭𝐚𝐛𝐚𝐝𝐢.