Tech
ManiBench: New Benchmark for Visual-Logic Drift in Code Generation
ManiBench is designed to evaluate code generation in dynamic visual contexts, addressing gaps left by existing benchmarks like HumanEval and MBPP.
Editorial Staff 21 days ago