Abstract: Large language model (LLM)-based auto-graders, like Claude 3.5 Sonnet, show promise in educational technology. To test their capabilities, we conducted an experiment in which four ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results