FormalProofBench: A New Benchmark for AI in Graduate-Level Mathematics
FormalProofBench is introduced as a benchmark to evaluate AI's ability to generate formally verified mathematical proofs, focusing on graduate-level tasks.
Technology, AI, cybersecurity, infrastructure, and innovation.