Tech
FormalProofBench: A New Benchmark for AI in Graduate-Level Mathematics
FormalProofBench is introduced as a benchmark to evaluate AI's ability to generate formally verified mathematical proofs, focusing on graduate-level tasks.
Editorial Staff 7 days ago