Supplementary Tables

 
Table S7. Summary statistics for the grading component “scientific correctness” Summary Statistics for Scientific Correctness.
Question Number Mean Median Interquartile Range Minimum Maximum
1 ChatGPT 8.26 8 1.0 5 10
1 PT 5.43 6 4.0 1 10
2 ChatGPT 8.04 8 2.0 4 10
2 PT 5.22 5 2.5 2 10
3 ChatGPT 7.26 8 1.0 1 10
3 PT 6.33 7 3.0 1 10
4 ChatGPT 7.85 8 2.0 6 10
4 PT 7.89 8 2.0 4 10
5 ChatGPT 8.07 8 1.5 2 10
5 PT 7.85 8 2.0 2 10
6 ChatGPT 7.81 8 2.0 5 10
6 PT 7.63 8 2.0 2 10
7 ChatGPT 7.89 8 2.0 4 10
7 PT 5.93 6 3.0 1 10
8 ChatGPT 7.59 8 2.0 4 10
8 PT 6.19 7 3.0 0 10
9 ChatGPT 7.44 8 3.0 3 10
9 PT 6.85 7 2.5 0 10
 
Table S8. Summary statistics for the grading component “ comprehensibility” Summary Statistics for Comprehensibility.
Question Number Mean Median Interquartile Range Minimum Maximum
1 ChatGPT 7.67 8 2.0 3 10
1 PT 7.81 8 2.0 4 10
2 ChatGPT 8.63 9 1.5 5 10
2 PT 6.15 6 3.0 1 10
3 ChatGPT 8.48 9 2.0 5 10
3 PT 7.37 8 3.0 3 10
4 ChatGPT 7.81 8 2.0 3 10
4 PT 7.33 7 1.5 3 10
5 ChatGPT 8.44 8 1.5 5 10
5 PT 8.22 8 2.0 6 10
6 ChatGPT 8.37 8 1.0 6 10
6 PT 7.11 7 1.5 3 10
7 ChatGPT 8.52 9 1.0 5 10
7 PT 6.70 6 2.5 3 10
8 ChatGPT 8.26 8 1.0 5 10
8 PT 7.26 7 2.0 4 10
9 ChatGPT 8.04 8 2.0 4 10
9 PT 7.30 7 3.0 4 10
 
Table S9. Summary statistics for the grading component “actionability” Summary Statistics for Actionability.
Question Number Mean Median Interquartile Range Minimum Maximum
1 ChatGPT 6.78 7 4.00 2 10
1 PT 6.83 7 2.50 3 10
2 ChatGPT 7.22 7 3.00 2 10
2 PT 5.11 5 3.50 1 10
3 ChatGPT 8.20 8 1.75 6 10
3 PT 7.33 8 1.00 3 10
4 ChatGPT 6.52 7 1.50 0 10
4 PT 7.11 7 2.00 2 10
5 ChatGPT 7.59 8 1.50 3 10
5 PT 7.26 8 1.00 3 9
6 ChatGPT 7.78 8 2.00 2 10
6 PT 6.48 7 2.50 2 10
7 ChatGPT 8.04 8 1.50 3 10
7 PT 6.44 7 3.00 2 10
8 ChatGPT 7.33 7 3.00 3 10
8 PT 6.52 7 3.00 0 10
9 ChatGPT 7.15 7 3.00 3 10
9 PT 6.78 7 2.50 2 10