Author Name: Jin, Zhijing
Keyword Term: Model evaluation
Keyword Term: Natural language processing
Study Type: experimental
Keyword Term: Benchmark
1 to 1 of 1 Result
Jan 7, 2024
Jin, Zhijing, 2024, "CLadder: Assessing Causal Reasoning in Language Models", https://doi.org/10.17617/3.NVRRA9, Edmond, V1
Paper: "CLadder: Assessing Causal Reasoning in Language Models" (NeurIPS 2023) by Zhijing Jin*, Yuen Chen*, Felix Leeb*, Luigi Gresele*, Ojasv Kamal, Zhiheng Lyu, Kevin Blin, Fernando Gonzalez, Max Kleiman-Weiner, Mrinmaya Sachan, Bernhard Schölkopf. (http://arxiv.org/abs/2312.04... |