«By recruiting over 100 NLP researchers to write novel ideas and blind
reviews of both LLM and human ideas, we obtain the first statistically
significant conclusion on current LLM capabilities for research ideation:
we find LLM-generated ideas are judged as more novel (p < 0.05) than human
expert ideas while being judged slightly weaker on feasibility.»

https://arxiv.org/abs/2409.04109v1

Reply via email to