From 238ec2b7cc1557d6f34c33cc482e4d0cd3e266dd Mon Sep 17 00:00:00 2001 From: Christian Krinitsin Date: Sun, 6 Jul 2025 16:43:19 +0000 Subject: add results --- classification/preambel-user-mode | 10 + results/classifier/accel-gemma3:12b/analysis.csv | 2 +- results/classifier/deepseek-r1:32b/analysis.csv | 4 + results/classifier/deepseek-r1:32b/categories.csv | 5 + .../deepseek-r1:32b/output/instruction/1022 | 36 + .../deepseek-r1:32b/output/instruction/1028 | 37 + .../deepseek-r1:32b/output/instruction/1051 | 4 + .../deepseek-r1:32b/output/instruction/1054812 | 8 + .../deepseek-r1:32b/output/instruction/1086 | 72 ++ .../deepseek-r1:32b/output/instruction/1092 | 17 + .../deepseek-r1:32b/output/instruction/1095857 | 14 + .../deepseek-r1:32b/output/instruction/1129571 | 17 + .../deepseek-r1:32b/output/instruction/1156 | 4 + .../deepseek-r1:32b/output/instruction/1178 | 4 + .../deepseek-r1:32b/output/instruction/1245543 | 26 + .../deepseek-r1:32b/output/instruction/1248168 | 27 + .../deepseek-r1:32b/output/instruction/1251 | 18 + .../deepseek-r1:32b/output/instruction/1254786 | 45 + .../deepseek-r1:32b/output/instruction/1267955 | 45 + .../deepseek-r1:32b/output/instruction/1283519 | 13 + .../deepseek-r1:32b/output/instruction/1308381 | 17 + .../deepseek-r1:32b/output/instruction/1328996 | 6 + .../deepseek-r1:32b/output/instruction/1339 | 19 + .../deepseek-r1:32b/output/instruction/1370 | 16 + .../deepseek-r1:32b/output/instruction/1371 | 22 + .../deepseek-r1:32b/output/instruction/1372 | 23 + .../deepseek-r1:32b/output/instruction/1373 | 23 + .../deepseek-r1:32b/output/instruction/1374 | 25 + .../deepseek-r1:32b/output/instruction/1375 | 22 + .../deepseek-r1:32b/output/instruction/1376 | 18 + .../deepseek-r1:32b/output/instruction/1404690 | 41 + .../deepseek-r1:32b/output/instruction/1428352 | 47 + .../deepseek-r1:32b/output/instruction/1441 | 37 + .../deepseek-r1:32b/output/instruction/1452 | 4 + .../deepseek-r1:32b/output/instruction/1469342 | 6 + .../deepseek-r1:32b/output/instruction/1471 | 19 + .../deepseek-r1:32b/output/instruction/1512 | 4 + .../deepseek-r1:32b/output/instruction/1536 | 19 + .../deepseek-r1:32b/output/instruction/1547 | 15 + .../deepseek-r1:32b/output/instruction/1553 | 15 + .../deepseek-r1:32b/output/instruction/1574346 | 15 + .../deepseek-r1:32b/output/instruction/1590336 | 18 + .../deepseek-r1:32b/output/instruction/1594069 | 11 + .../deepseek-r1:32b/output/instruction/1605123 | 31 + .../deepseek-r1:32b/output/instruction/1606 | 32 + .../deepseek-r1:32b/output/instruction/1611394 | 32 + .../deepseek-r1:32b/output/instruction/1612 | 54 + .../deepseek-r1:32b/output/instruction/1613817 | 59 + .../deepseek-r1:32b/output/instruction/1620 | 97 ++ .../deepseek-r1:32b/output/instruction/1637 | 4 + .../deepseek-r1:32b/output/instruction/1641637 | 716 +++++++++++ .../deepseek-r1:32b/output/instruction/1642 | 25 + .../deepseek-r1:32b/output/instruction/1701821 | 217 ++++ .../deepseek-r1:32b/output/instruction/1713066 | 22 + .../deepseek-r1:32b/output/instruction/1722 | 90 ++ .../deepseek-r1:32b/output/instruction/1727737 | 28 + .../deepseek-r1:32b/output/instruction/1737 | 52 + .../deepseek-r1:32b/output/instruction/1738434 | 31 + .../deepseek-r1:32b/output/instruction/1748296 | 28 + .../deepseek-r1:32b/output/instruction/1751422 | 7 + .../deepseek-r1:32b/output/instruction/1771 | 36 + .../deepseek-r1:32b/output/instruction/1780 | 20 + .../deepseek-r1:32b/output/instruction/1781281 | 31 + .../deepseek-r1:32b/output/instruction/1790 | 32 + .../deepseek-r1:32b/output/instruction/1793119 | 32 + .../deepseek-r1:32b/output/instruction/1793608 | 19 + .../deepseek-r1:32b/output/instruction/1806243 | 87 ++ .../deepseek-r1:32b/output/instruction/1815024 | 18 + .../deepseek-r1:32b/output/instruction/1818075 | 56 + .../deepseek-r1:32b/output/instruction/1820686 | 8 + .../deepseek-r1:32b/output/instruction/1821430 | 35 + .../deepseek-r1:32b/output/instruction/1821444 | 32 + .../deepseek-r1:32b/output/instruction/1824344 | 48 + .../deepseek-r1:32b/output/instruction/1826568 | 16 + .../deepseek-r1:32b/output/instruction/1828867 | 11 + .../deepseek-r1:32b/output/instruction/1832422 | 12 + .../deepseek-r1:32b/output/instruction/1833 | 87 ++ .../deepseek-r1:32b/output/instruction/1841990 | 41 + .../deepseek-r1:32b/output/instruction/1847467 | 19 + .../deepseek-r1:32b/output/instruction/1854738 | 31 + .../deepseek-r1:32b/output/instruction/1859713 | 28 + .../deepseek-r1:32b/output/instruction/1861404 | 53 + .../deepseek-r1:32b/output/instruction/1863247 | 11 + .../deepseek-r1:32b/output/instruction/1873898 | 41 + .../deepseek-r1:32b/output/instruction/1874888 | 46 + .../deepseek-r1:32b/output/instruction/1877794 | 6 + .../deepseek-r1:32b/output/instruction/1881450 | 26 + .../deepseek-r1:32b/output/instruction/1889288 | 10 + .../deepseek-r1:32b/output/instruction/1901 | 22 + .../deepseek-r1:32b/output/instruction/1904210 | 54 + .../deepseek-r1:32b/output/instruction/1905356 | 15 + .../deepseek-r1:32b/output/instruction/1908626 | 68 + .../deepseek-r1:32b/output/instruction/1909 | 53 + .../deepseek-r1:32b/output/instruction/1912934 | 20 + .../deepseek-r1:32b/output/instruction/1913913 | 21 + .../deepseek-r1:32b/output/instruction/1914021 | 30 + .../deepseek-r1:32b/output/instruction/1915327 | 37 + .../deepseek-r1:32b/output/instruction/1916269 | 22 + .../deepseek-r1:32b/output/instruction/1922887 | 33 + .../deepseek-r1:32b/output/instruction/1925512 | 21 + .../deepseek-r1:32b/output/instruction/1926759 | 21 + .../deepseek-r1:32b/output/instruction/1967248 | 41 + .../deepseek-r1:32b/output/instruction/2078 | 37 + .../deepseek-r1:32b/output/instruction/2083 | 114 ++ .../deepseek-r1:32b/output/instruction/2089 | 30 + .../deepseek-r1:32b/output/instruction/2136 | 38 + .../deepseek-r1:32b/output/instruction/2175 | 41 + .../deepseek-r1:32b/output/instruction/2203 | 4 + .../deepseek-r1:32b/output/instruction/2302 | 28 + .../deepseek-r1:32b/output/instruction/2317 | 41 + .../deepseek-r1:32b/output/instruction/2318 | 37 + .../deepseek-r1:32b/output/instruction/2319 | 20 + .../deepseek-r1:32b/output/instruction/2371 | 55 + .../deepseek-r1:32b/output/instruction/2372 | 112 ++ .../deepseek-r1:32b/output/instruction/2373 | 98 ++ .../deepseek-r1:32b/output/instruction/2374 | 114 ++ .../deepseek-r1:32b/output/instruction/2375 | 88 ++ .../deepseek-r1:32b/output/instruction/2376 | 117 ++ .../deepseek-r1:32b/output/instruction/2386 | 46 + .../deepseek-r1:32b/output/instruction/2419 | 21 + .../deepseek-r1:32b/output/instruction/2422 | 72 ++ .../deepseek-r1:32b/output/instruction/2474 | 99 ++ .../deepseek-r1:32b/output/instruction/2483 | 23 + .../deepseek-r1:32b/output/instruction/2487 | 71 + .../deepseek-r1:32b/output/instruction/2495 | 75 ++ .../deepseek-r1:32b/output/instruction/2497 | 6 + .../deepseek-r1:32b/output/instruction/2498 | 54 + .../deepseek-r1:32b/output/instruction/2499 | 33 + .../deepseek-r1:32b/output/instruction/2500 | 7 + .../deepseek-r1:32b/output/instruction/2595 | 138 ++ .../deepseek-r1:32b/output/instruction/2604 | 47 + .../deepseek-r1:32b/output/instruction/266 | 4 + .../deepseek-r1:32b/output/instruction/2696 | 15 + .../deepseek-r1:32b/output/instruction/2775 | 137 ++ .../deepseek-r1:32b/output/instruction/2865 | 55 + .../deepseek-r1:32b/output/instruction/2878 | 4 + .../deepseek-r1:32b/output/instruction/2971 | 47 + .../deepseek-r1:32b/output/instruction/312 | 4 + .../deepseek-r1:32b/output/instruction/364 | 4 + .../deepseek-r1:32b/output/instruction/381 | 4 + .../deepseek-r1:32b/output/instruction/390 | 4 + .../deepseek-r1:32b/output/instruction/422 | 4 + .../deepseek-r1:32b/output/instruction/427 | 4 + .../deepseek-r1:32b/output/instruction/449 | 71 + .../deepseek-r1:32b/output/instruction/494 | 4 + .../deepseek-r1:32b/output/instruction/508 | 4 + .../deepseek-r1:32b/output/instruction/618 | 98 ++ .../deepseek-r1:32b/output/instruction/625 | 26 + .../deepseek-r1:32b/output/instruction/754 | 210 +++ .../deepseek-r1:32b/output/instruction/799 | 50 + .../deepseek-r1:32b/output/instruction/824 | 15 + .../deepseek-r1:32b/output/instruction/826 | 19 + .../deepseek-r1:32b/output/instruction/837 | 33 + .../deepseek-r1:32b/output/instruction/890 | 4 + .../deepseek-r1:32b/output/instruction/904308 | 101 ++ .../deepseek-r1:32b/output/instruction/952 | 100 ++ .../deepseek-r1:32b/output/instruction/979 | 10 + .../deepseek-r1:32b/output/instruction/984 | 26 + .../deepseek-r1:32b/output/instruction/993 | 84 ++ .../deepseek-r1:32b/output/instruction/998 | 63 + .../deepseek-r1:32b/output/manual-review/1033 | 30 + .../deepseek-r1:32b/output/manual-review/1054831 | 20 + .../deepseek-r1:32b/output/manual-review/1066909 | 10 + .../deepseek-r1:32b/output/manual-review/1075272 | 16 + .../deepseek-r1:32b/output/manual-review/1075339 | 6 + .../deepseek-r1:32b/output/manual-review/122 | 4 + .../deepseek-r1:32b/output/manual-review/127 | 4 + .../deepseek-r1:32b/output/manual-review/1394 | 64 + .../deepseek-r1:32b/output/manual-review/1416988 | 35 + .../deepseek-r1:32b/output/manual-review/1643619 | 35 + .../deepseek-r1:32b/output/manual-review/1673976 | 14 + .../deepseek-r1:32b/output/manual-review/1701973 | 20 + .../deepseek-r1:32b/output/manual-review/1729 | 50 + .../deepseek-r1:32b/output/manual-review/1734792 | 10 + .../deepseek-r1:32b/output/manual-review/1760 | 56 + .../deepseek-r1:32b/output/manual-review/1761153 | 26 + .../deepseek-r1:32b/output/manual-review/1783362 | 50 + .../deepseek-r1:32b/output/manual-review/1805913 | 24 + .../deepseek-r1:32b/output/manual-review/1810433 | 50 + .../deepseek-r1:32b/output/manual-review/1837 | 38 + .../deepseek-r1:32b/output/manual-review/1869073 | 10 + .../deepseek-r1:32b/output/manual-review/1869241 | 22 + .../deepseek-r1:32b/output/manual-review/1910605 | 19 + .../deepseek-r1:32b/output/manual-review/1926521 | 65 + .../deepseek-r1:32b/output/manual-review/2101 | 20 + .../deepseek-r1:32b/output/manual-review/2122 | 10 + .../deepseek-r1:32b/output/manual-review/2248 | 39 + .../deepseek-r1:32b/output/manual-review/2262 | 202 +++ .../deepseek-r1:32b/output/manual-review/2333 | 48 + .../deepseek-r1:32b/output/manual-review/2410 | 95 ++ .../deepseek-r1:32b/output/manual-review/2446 | 63 + .../deepseek-r1:32b/output/manual-review/2553 | 85 ++ .../deepseek-r1:32b/output/manual-review/570 | 4 + .../deepseek-r1:32b/output/manual-review/602 | 16 + .../deepseek-r1:32b/output/manual-review/817 | 4 + .../deepseek-r1:32b/output/manual-review/829 | 17 + .../deepseek-r1:32b/output/manual-review/833 | 45 + .../deepseek-r1:32b/output/manual-review/911 | 20 + .../deepseek-r1:32b/output/manual-review/957 | 74 ++ .../deepseek-r1:32b/output/manual-review/982 | 40 + .../classifier/deepseek-r1:32b/output/runtime/1010 | 81 ++ .../deepseek-r1:32b/output/runtime/1010484 | 9 + .../classifier/deepseek-r1:32b/output/runtime/1027 | 18 + .../deepseek-r1:32b/output/runtime/1031920 | 40 + .../classifier/deepseek-r1:32b/output/runtime/1034 | 20 + .../classifier/deepseek-r1:32b/output/runtime/1041 | 34 + .../classifier/deepseek-r1:32b/output/runtime/1044 | 4 + .../deepseek-r1:32b/output/runtime/1052857 | 18 + .../classifier/deepseek-r1:32b/output/runtime/1059 | 13 + .../deepseek-r1:32b/output/runtime/1068900 | 8 + .../classifier/deepseek-r1:32b/output/runtime/1070 | 13 + .../classifier/deepseek-r1:32b/output/runtime/1072 | 27 + .../classifier/deepseek-r1:32b/output/runtime/1075 | 19 + .../classifier/deepseek-r1:32b/output/runtime/1093 | 36 + .../deepseek-r1:32b/output/runtime/1095531 | 60 + .../deepseek-r1:32b/output/runtime/1098729 | 46 + .../classifier/deepseek-r1:32b/output/runtime/1102 | 41 + .../classifier/deepseek-r1:32b/output/runtime/1128 | 27 + .../classifier/deepseek-r1:32b/output/runtime/1143 | 81 ++ .../classifier/deepseek-r1:32b/output/runtime/1147 | 12 + .../deepseek-r1:32b/output/runtime/1165383 | 6 + .../deepseek-r1:32b/output/runtime/1172613 | 66 + .../deepseek-r1:32b/output/runtime/1182490 | 79 ++ .../deepseek-r1:32b/output/runtime/1187319 | 11 + .../deepseek-r1:32b/output/runtime/1207896 | 6 + .../classifier/deepseek-r1:32b/output/runtime/1209 | 8 + .../classifier/deepseek-r1:32b/output/runtime/1211 | 10 + .../deepseek-r1:32b/output/runtime/1221966 | 37 + .../classifier/deepseek-r1:32b/output/runtime/1228 | 46 + .../deepseek-r1:32b/output/runtime/1233225 | 27 + .../deepseek-r1:32b/output/runtime/1245703 | 12 + .../deepseek-r1:32b/output/runtime/1246990 | 41 + .../classifier/deepseek-r1:32b/output/runtime/1248 | 14 + .../deepseek-r1:32b/output/runtime/1254672 | 44 + .../deepseek-r1:32b/output/runtime/1254828 | 40 + .../classifier/deepseek-r1:32b/output/runtime/1255 | 14 + .../deepseek-r1:32b/output/runtime/1261743 | 8 + .../deepseek-r1:32b/output/runtime/1263747 | 32 + .../classifier/deepseek-r1:32b/output/runtime/1267 | 96 ++ .../deepseek-r1:32b/output/runtime/1285363 | 48 + .../deepseek-r1:32b/output/runtime/1287195 | 6 + .../deepseek-r1:32b/output/runtime/1294898 | 81 ++ .../deepseek-r1:32b/output/runtime/1311614 | 50 + .../deepseek-r1:32b/output/runtime/1346769 | 39 + .../deepseek-r1:32b/output/runtime/1346784 | 70 + .../deepseek-r1:32b/output/runtime/1357206 | 62 + .../deepseek-r1:32b/output/runtime/1357226 | 14 + .../classifier/deepseek-r1:32b/output/runtime/1361 | 23 + .../deepseek-r1:32b/output/runtime/1361912 | 12 + .../deepseek-r1:32b/output/runtime/1362635 | 45 + .../classifier/deepseek-r1:32b/output/runtime/1368 | 41 + .../classifier/deepseek-r1:32b/output/runtime/1388 | 17 + .../classifier/deepseek-r1:32b/output/runtime/1397 | 4 + .../classifier/deepseek-r1:32b/output/runtime/140 | 4 + .../classifier/deepseek-r1:32b/output/runtime/1412 | 8 + .../deepseek-r1:32b/output/runtime/1429313 | 12 + .../classifier/deepseek-r1:32b/output/runtime/1435 | 19 + .../classifier/deepseek-r1:32b/output/runtime/1478 | 69 + .../classifier/deepseek-r1:32b/output/runtime/1495 | 9 + .../deepseek-r1:32b/output/runtime/1519037 | 10 + .../deepseek-r1:32b/output/runtime/1527765 | 75 ++ .../classifier/deepseek-r1:32b/output/runtime/1528 | 12 + .../deepseek-r1:32b/output/runtime/1528239 | 48 + .../classifier/deepseek-r1:32b/output/runtime/1531 | 18 + .../deepseek-r1:32b/output/runtime/1533141 | 18 + .../classifier/deepseek-r1:32b/output/runtime/1541 | 35 + .../deepseek-r1:32b/output/runtime/1550503 | 16 + .../deepseek-r1:32b/output/runtime/1568107 | 12 + .../deepseek-r1:32b/output/runtime/1591611 | 26 + .../classifier/deepseek-r1:32b/output/runtime/1593 | 10 + .../deepseek-r1:32b/output/runtime/1603734 | 10 + .../deepseek-r1:32b/output/runtime/1614348 | 42 + .../deepseek-r1:32b/output/runtime/1623020 | 58 + .../deepseek-r1:32b/output/runtime/1641861 | 39 + .../classifier/deepseek-r1:32b/output/runtime/1648 | 61 + .../classifier/deepseek-r1:32b/output/runtime/1650 | 17 + .../deepseek-r1:32b/output/runtime/1654137 | 10 + .../deepseek-r1:32b/output/runtime/1659901 | 12 + .../deepseek-r1:32b/output/runtime/1661815 | 29 + .../deepseek-r1:32b/output/runtime/1667401 | 70 + .../classifier/deepseek-r1:32b/output/runtime/1671 | 1360 ++++++++++++++++++++ .../deepseek-r1:32b/output/runtime/1696353 | 38 + .../classifier/deepseek-r1:32b/output/runtime/1697 | 22 + .../deepseek-r1:32b/output/runtime/1704638 | 68 + .../classifier/deepseek-r1:32b/output/runtime/1707 | 26 + .../deepseek-r1:32b/output/runtime/1715162 | 75 ++ .../deepseek-r1:32b/output/runtime/1716767 | 37 + .../deepseek-r1:32b/output/runtime/1724485 | 21 + .../deepseek-r1:32b/output/runtime/1725267 | 34 + .../classifier/deepseek-r1:32b/output/runtime/1734 | 19 + .../deepseek-r1:32b/output/runtime/1735384 | 23 + .../classifier/deepseek-r1:32b/output/runtime/1736 | 70 + .../deepseek-r1:32b/output/runtime/1737444 | 96 ++ .../deepseek-r1:32b/output/runtime/1738545 | 34 + .../deepseek-r1:32b/output/runtime/1740219 | 62 + .../classifier/deepseek-r1:32b/output/runtime/1741 | 4 + .../deepseek-r1:32b/output/runtime/1748612 | 18 + .../classifier/deepseek-r1:32b/output/runtime/1755 | 23 + .../classifier/deepseek-r1:32b/output/runtime/1756 | 46 + .../deepseek-r1:32b/output/runtime/1756519 | 49 + .../deepseek-r1:32b/output/runtime/1756807 | 70 + .../deepseek-r1:32b/output/runtime/1756927 | 21 + .../deepseek-r1:32b/output/runtime/1761401 | 13 + .../deepseek-r1:32b/output/runtime/1761535 | 39 + .../classifier/deepseek-r1:32b/output/runtime/1763 | 15 + .../deepseek-r1:32b/output/runtime/1765970 | 64 + .../classifier/deepseek-r1:32b/output/runtime/1768 | 35 + .../deepseek-r1:32b/output/runtime/1768246 | 16 + .../deepseek-r1:32b/output/runtime/1773743 | 24 + .../deepseek-r1:32b/output/runtime/1774149 | 79 ++ .../deepseek-r1:32b/output/runtime/1777226 | 18 + .../classifier/deepseek-r1:32b/output/runtime/1779 | 33 + .../deepseek-r1:32b/output/runtime/1779634 | 38 + .../deepseek-r1:32b/output/runtime/1785734 | 78 ++ .../deepseek-r1:32b/output/runtime/1793539 | 12 + .../deepseek-r1:32b/output/runtime/1796520 | 39 + .../classifier/deepseek-r1:32b/output/runtime/1798 | 4 + .../deepseek-r1:32b/output/runtime/1799200 | 43 + .../classifier/deepseek-r1:32b/output/runtime/1805 | 69 + .../classifier/deepseek-r1:32b/output/runtime/1807 | 27 + .../deepseek-r1:32b/output/runtime/1808563 | 20 + .../deepseek-r1:32b/output/runtime/1808565 | 10 + .../classifier/deepseek-r1:32b/output/runtime/1812 | 28 + .../deepseek-r1:32b/output/runtime/1812451 | 17 + .../deepseek-r1:32b/output/runtime/1812861 | 25 + .../deepseek-r1:32b/output/runtime/1813398 | 44 + .../deepseek-r1:32b/output/runtime/1814128 | 158 +++ .../deepseek-r1:32b/output/runtime/1818483 | 45 + .../classifier/deepseek-r1:32b/output/runtime/1819 | 13 + .../deepseek-r1:32b/output/runtime/1821515 | 41 + .../deepseek-r1:32b/output/runtime/1829459 | 38 + .../classifier/deepseek-r1:32b/output/runtime/1830 | 29 + .../deepseek-r1:32b/output/runtime/1832353 | 23 + .../deepseek-r1:32b/output/runtime/1832916 | 8 + .../deepseek-r1:32b/output/runtime/1833668 | 30 + .../deepseek-r1:32b/output/runtime/1834496 | 30 + .../deepseek-r1:32b/output/runtime/1835693 | 20 + .../deepseek-r1:32b/output/runtime/1835839 | 24 + .../deepseek-r1:32b/output/runtime/1836078 | 25 + .../deepseek-r1:32b/output/runtime/1836192 | 24 + .../deepseek-r1:32b/output/runtime/1836558 | 51 + .../deepseek-r1:32b/output/runtime/1840922 | 24 + .../classifier/deepseek-r1:32b/output/runtime/1854 | 21 + .../classifier/deepseek-r1:32b/output/runtime/1857 | 55 + .../deepseek-r1:32b/output/runtime/1858415 | 27 + .../deepseek-r1:32b/output/runtime/1860056 | 23 + .../deepseek-r1:32b/output/runtime/1860610 | 10 + .../deepseek-r1:32b/output/runtime/1861605 | 19 + .../deepseek-r1:32b/output/runtime/1862167 | 6 + .../deepseek-r1:32b/output/runtime/1862986 | 67 + .../deepseek-r1:32b/output/runtime/1863445 | 19 + .../deepseek-r1:32b/output/runtime/1869782 | 16 + .../deepseek-r1:32b/output/runtime/1870477 | 36 + .../deepseek-r1:32b/output/runtime/1878501 | 34 + .../deepseek-r1:32b/output/runtime/1880225 | 140 ++ .../deepseek-r1:32b/output/runtime/1880332 | 10 + .../deepseek-r1:32b/output/runtime/1880722 | 17 + .../deepseek-r1:32b/output/runtime/1883268 | 40 + .../deepseek-r1:32b/output/runtime/1883784 | 12 + .../deepseek-r1:32b/output/runtime/1885350 | 26 + .../deepseek-r1:32b/output/runtime/1886097 | 36 + .../deepseek-r1:32b/output/runtime/1887306 | 58 + .../deepseek-r1:32b/output/runtime/1888303 | 23 + .../deepseek-r1:32b/output/runtime/1888728 | 22 + .../deepseek-r1:32b/output/runtime/1889411 | 66 + .../classifier/deepseek-r1:32b/output/runtime/1890 | 28 + .../deepseek-r1:32b/output/runtime/1892081 | 17 + .../deepseek-r1:32b/output/runtime/1894029 | 42 + .../classifier/deepseek-r1:32b/output/runtime/1895 | 149 +++ .../deepseek-r1:32b/output/runtime/1895080 | 39 + .../deepseek-r1:32b/output/runtime/1895305 | 51 + .../deepseek-r1:32b/output/runtime/1895471 | 26 + .../deepseek-r1:32b/output/runtime/1895703 | 21 + .../deepseek-r1:32b/output/runtime/1904259 | 32 + .../deepseek-r1:32b/output/runtime/1906536 | 33 + .../deepseek-r1:32b/output/runtime/1907817 | 46 + .../deepseek-r1:32b/output/runtime/1907969 | 61 + .../classifier/deepseek-r1:32b/output/runtime/1908 | 52 + .../deepseek-r1:32b/output/runtime/1908551 | 57 + .../deepseek-r1:32b/output/runtime/1909921 | 25 + .../classifier/deepseek-r1:32b/output/runtime/1910 | 65 + .../classifier/deepseek-r1:32b/output/runtime/1913 | 22 + .../deepseek-r1:32b/output/runtime/1914870 | 60 + .../deepseek-r1:32b/output/runtime/1915531 | 57 + .../deepseek-r1:32b/output/runtime/1915925 | 20 + .../deepseek-r1:32b/output/runtime/1916344 | 27 + .../deepseek-r1:32b/output/runtime/1917184 | 8 + .../deepseek-r1:32b/output/runtime/1918026 | 32 + .../deepseek-r1:32b/output/runtime/1926044 | 33 + .../deepseek-r1:32b/output/runtime/1926202 | 21 + .../deepseek-r1:32b/output/runtime/1926246 | 53 + .../deepseek-r1:32b/output/runtime/1927530 | 42 + .../classifier/deepseek-r1:32b/output/runtime/1930 | 49 + .../deepseek-r1:32b/output/runtime/1936977 | 10 + .../classifier/deepseek-r1:32b/output/runtime/1941 | 105 ++ .../classifier/deepseek-r1:32b/output/runtime/1952 | 99 ++ .../classifier/deepseek-r1:32b/output/runtime/1953 | 149 +++ .../classifier/deepseek-r1:32b/output/runtime/2027 | 236 ++++ .../classifier/deepseek-r1:32b/output/runtime/2035 | 57 + .../deepseek-r1:32b/output/runtime/2072564 | 48 + .../classifier/deepseek-r1:32b/output/runtime/2082 | 47 + .../classifier/deepseek-r1:32b/output/runtime/2119 | 4 + .../classifier/deepseek-r1:32b/output/runtime/2127 | 4 + .../classifier/deepseek-r1:32b/output/runtime/2156 | 18 + .../classifier/deepseek-r1:32b/output/runtime/2157 | 46 + .../classifier/deepseek-r1:32b/output/runtime/2208 | 91 ++ .../classifier/deepseek-r1:32b/output/runtime/2223 | 38 + .../classifier/deepseek-r1:32b/output/runtime/2304 | 41 + .../classifier/deepseek-r1:32b/output/runtime/2336 | 26 + .../classifier/deepseek-r1:32b/output/runtime/2353 | 59 + .../classifier/deepseek-r1:32b/output/runtime/2448 | 49 + .../classifier/deepseek-r1:32b/output/runtime/2460 | 11 + .../classifier/deepseek-r1:32b/output/runtime/2486 | 15 + .../classifier/deepseek-r1:32b/output/runtime/2505 | 4 + .../classifier/deepseek-r1:32b/output/runtime/2525 | 4 + .../classifier/deepseek-r1:32b/output/runtime/2536 | 4 + .../classifier/deepseek-r1:32b/output/runtime/2560 | 108 ++ .../classifier/deepseek-r1:32b/output/runtime/2569 | 8 + .../classifier/deepseek-r1:32b/output/runtime/2580 | 15 + .../classifier/deepseek-r1:32b/output/runtime/2590 | 26 + .../classifier/deepseek-r1:32b/output/runtime/2596 | 4 + .../classifier/deepseek-r1:32b/output/runtime/2598 | 4 + .../classifier/deepseek-r1:32b/output/runtime/2606 | 201 +++ .../classifier/deepseek-r1:32b/output/runtime/261 | 4 + .../classifier/deepseek-r1:32b/output/runtime/2619 | 4 + .../classifier/deepseek-r1:32b/output/runtime/2628 | 23 + .../classifier/deepseek-r1:32b/output/runtime/2632 | 86 ++ .../classifier/deepseek-r1:32b/output/runtime/2647 | 50 + .../classifier/deepseek-r1:32b/output/runtime/2655 | 42 + .../classifier/deepseek-r1:32b/output/runtime/2672 | 23 + .../classifier/deepseek-r1:32b/output/runtime/2683 | 42 + .../classifier/deepseek-r1:32b/output/runtime/2730 | 13 + .../classifier/deepseek-r1:32b/output/runtime/2738 | 13 + .../classifier/deepseek-r1:32b/output/runtime/275 | 4 + .../classifier/deepseek-r1:32b/output/runtime/276 | 4 + .../classifier/deepseek-r1:32b/output/runtime/2761 | 11 + .../classifier/deepseek-r1:32b/output/runtime/280 | 4 + .../classifier/deepseek-r1:32b/output/runtime/2802 | 29 + .../classifier/deepseek-r1:32b/output/runtime/2815 | 4 + .../classifier/deepseek-r1:32b/output/runtime/2846 | 4 + .../classifier/deepseek-r1:32b/output/runtime/311 | 4 + .../classifier/deepseek-r1:32b/output/runtime/333 | 4 + .../classifier/deepseek-r1:32b/output/runtime/355 | 4 + .../classifier/deepseek-r1:32b/output/runtime/361 | 4 + .../classifier/deepseek-r1:32b/output/runtime/385 | 4 + .../classifier/deepseek-r1:32b/output/runtime/419 | 4 + .../classifier/deepseek-r1:32b/output/runtime/442 | 4 + .../classifier/deepseek-r1:32b/output/runtime/447 | 4 + .../classifier/deepseek-r1:32b/output/runtime/514 | 28 + .../deepseek-r1:32b/output/runtime/562107 | 15 + .../classifier/deepseek-r1:32b/output/runtime/616 | 110 ++ .../classifier/deepseek-r1:32b/output/runtime/633 | 35 + .../deepseek-r1:32b/output/runtime/645662 | 43 + .../classifier/deepseek-r1:32b/output/runtime/693 | 13 + .../classifier/deepseek-r1:32b/output/runtime/695 | 4 + .../classifier/deepseek-r1:32b/output/runtime/697 | 4 + .../classifier/deepseek-r1:32b/output/runtime/698 | 361 ++++++ .../classifier/deepseek-r1:32b/output/runtime/704 | 4 + .../classifier/deepseek-r1:32b/output/runtime/714 | 46 + .../deepseek-r1:32b/output/runtime/739785 | 37 + .../deepseek-r1:32b/output/runtime/754635 | 58 + .../deepseek-r1:32b/output/runtime/796480 | 48 + .../classifier/deepseek-r1:32b/output/runtime/805 | 17 + .../classifier/deepseek-r1:32b/output/runtime/834 | 62 + .../classifier/deepseek-r1:32b/output/runtime/856 | 64 + .../classifier/deepseek-r1:32b/output/runtime/866 | 56 + .../deepseek-r1:32b/output/runtime/886621 | 295 +++++ .../classifier/deepseek-r1:32b/output/runtime/909 | 14 + .../classifier/deepseek-r1:32b/output/runtime/922 | 23 + .../classifier/deepseek-r1:32b/output/runtime/939 | 78 ++ .../classifier/deepseek-r1:32b/output/runtime/947 | 16 + .../classifier/deepseek-r1:32b/output/runtime/95 | 4 + .../classifier/deepseek-r1:32b/output/runtime/967 | 227 ++++ .../classifier/deepseek-r1:32b/output/syscall/1007 | 4 + .../classifier/deepseek-r1:32b/output/syscall/1012 | 44 + .../deepseek-r1:32b/output/syscall/1076445 | 48 + .../classifier/deepseek-r1:32b/output/syscall/1111 | 21 + .../classifier/deepseek-r1:32b/output/syscall/121 | 4 + .../classifier/deepseek-r1:32b/output/syscall/1238 | 122 ++ .../classifier/deepseek-r1:32b/output/syscall/1261 | 28 + .../deepseek-r1:32b/output/syscall/1319100 | 72 ++ .../deepseek-r1:32b/output/syscall/1356916 | 9 + .../deepseek-r1:32b/output/syscall/1457275 | 108 ++ .../deepseek-r1:32b/output/syscall/1462640 | 38 + .../deepseek-r1:32b/output/syscall/1470170 | 43 + .../classifier/deepseek-r1:32b/output/syscall/1494 | 935 ++++++++++++++ .../deepseek-r1:32b/output/syscall/1516408 | 34 + .../deepseek-r1:32b/output/syscall/1563612 | 53 + .../deepseek-r1:32b/output/syscall/1585840 | 12 + .../deepseek-r1:32b/output/syscall/1594394 | 44 + .../deepseek-r1:32b/output/syscall/1605443 | 14 + .../deepseek-r1:32b/output/syscall/1617929 | 53 + .../deepseek-r1:32b/output/syscall/1619896 | 53 + .../deepseek-r1:32b/output/syscall/1689367 | 29 + .../deepseek-r1:32b/output/syscall/1696773 | 10 + .../deepseek-r1:32b/output/syscall/1701808 | 19 + .../deepseek-r1:32b/output/syscall/1701971 | 48 + .../deepseek-r1:32b/output/syscall/1701974 | 20 + .../deepseek-r1:32b/output/syscall/1716292 | 33 + .../deepseek-r1:32b/output/syscall/1726394 | 8 + .../deepseek-r1:32b/output/syscall/1728116 | 50 + .../deepseek-r1:32b/output/syscall/1749393 | 29 + .../deepseek-r1:32b/output/syscall/1763536 | 86 ++ .../classifier/deepseek-r1:32b/output/syscall/1770 | 25 + .../deepseek-r1:32b/output/syscall/1776478 | 49 + .../deepseek-r1:32b/output/syscall/1785203 | 46 + .../deepseek-r1:32b/output/syscall/1791763 | 16 + .../deepseek-r1:32b/output/syscall/1791796 | 126 ++ .../deepseek-r1:32b/output/syscall/1813307 | 24 + .../deepseek-r1:32b/output/syscall/1821006 | 38 + .../deepseek-r1:32b/output/syscall/1857811 | 10 + .../deepseek-r1:32b/output/syscall/1858461 | 26 + .../deepseek-r1:32b/output/syscall/1860053 | 23 + .../deepseek-r1:32b/output/syscall/1861341 | 33 + .../deepseek-r1:32b/output/syscall/1876373 | 51 + .../deepseek-r1:32b/output/syscall/1884719 | 135 ++ .../deepseek-r1:32b/output/syscall/1893010 | 8 + .../deepseek-r1:32b/output/syscall/1894361 | 8 + .../deepseek-r1:32b/output/syscall/1906193 | 60 + .../deepseek-r1:32b/output/syscall/1926996 | 23 + .../classifier/deepseek-r1:32b/output/syscall/2112 | 29 + .../classifier/deepseek-r1:32b/output/syscall/2123 | 34 + .../classifier/deepseek-r1:32b/output/syscall/2168 | 35 + .../classifier/deepseek-r1:32b/output/syscall/2170 | 47 + .../classifier/deepseek-r1:32b/output/syscall/2197 | 61 + .../classifier/deepseek-r1:32b/output/syscall/2309 | 34 + .../classifier/deepseek-r1:32b/output/syscall/2390 | 66 + .../classifier/deepseek-r1:32b/output/syscall/2485 | 50 + .../classifier/deepseek-r1:32b/output/syscall/2504 | 10 + .../classifier/deepseek-r1:32b/output/syscall/2592 | 40 + .../classifier/deepseek-r1:32b/output/syscall/263 | 4 + .../classifier/deepseek-r1:32b/output/syscall/2825 | 40 + .../classifier/deepseek-r1:32b/output/syscall/306 | 4 + .../classifier/deepseek-r1:32b/output/syscall/324 | 4 + .../classifier/deepseek-r1:32b/output/syscall/326 | 4 + .../classifier/deepseek-r1:32b/output/syscall/356 | 4 + .../classifier/deepseek-r1:32b/output/syscall/456 | 32 + .../classifier/deepseek-r1:32b/output/syscall/470 | 4 + .../classifier/deepseek-r1:32b/output/syscall/577 | 28 + .../classifier/deepseek-r1:32b/output/syscall/578 | 33 + .../classifier/deepseek-r1:32b/output/syscall/579 | 53 + .../classifier/deepseek-r1:32b/output/syscall/654 | 26 + .../classifier/deepseek-r1:32b/output/syscall/690 | 22 + .../classifier/deepseek-r1:32b/output/syscall/836 | 88 ++ .../classifier/deepseek-r1:32b/output/syscall/871 | 17 + .../classifier/deepseek-r1:32b/output/syscall/885 | 4 + .../classifier/deepseek-r1:32b/output/syscall/927 | 35 + .../deepseek-r1:32b/reasoning/instruction/1022 | 18 + .../deepseek-r1:32b/reasoning/instruction/1028 | 17 + .../deepseek-r1:32b/reasoning/instruction/1051 | 17 + .../deepseek-r1:32b/reasoning/instruction/1054812 | 49 + .../deepseek-r1:32b/reasoning/instruction/1086 | 15 + .../deepseek-r1:32b/reasoning/instruction/1092 | 19 + .../deepseek-r1:32b/reasoning/instruction/1095857 | 21 + .../deepseek-r1:32b/reasoning/instruction/1129571 | 19 + .../deepseek-r1:32b/reasoning/instruction/1156 | 17 + .../deepseek-r1:32b/reasoning/instruction/1178 | 15 + .../deepseek-r1:32b/reasoning/instruction/1245543 | 9 + .../deepseek-r1:32b/reasoning/instruction/1248168 | 19 + .../deepseek-r1:32b/reasoning/instruction/1251 | 13 + .../deepseek-r1:32b/reasoning/instruction/1254786 | 22 + .../deepseek-r1:32b/reasoning/instruction/1267955 | 15 + .../deepseek-r1:32b/reasoning/instruction/1283519 | 29 + .../deepseek-r1:32b/reasoning/instruction/1308381 | 13 + .../deepseek-r1:32b/reasoning/instruction/1328996 | 15 + .../deepseek-r1:32b/reasoning/instruction/1339 | 22 + .../deepseek-r1:32b/reasoning/instruction/1370 | 21 + .../deepseek-r1:32b/reasoning/instruction/1371 | 15 + .../deepseek-r1:32b/reasoning/instruction/1372 | 20 + .../deepseek-r1:32b/reasoning/instruction/1373 | 26 + .../deepseek-r1:32b/reasoning/instruction/1374 | 13 + .../deepseek-r1:32b/reasoning/instruction/1375 | 14 + .../deepseek-r1:32b/reasoning/instruction/1376 | 23 + .../deepseek-r1:32b/reasoning/instruction/1404690 | 30 + .../deepseek-r1:32b/reasoning/instruction/1428352 | 15 + .../deepseek-r1:32b/reasoning/instruction/1441 | 17 + .../deepseek-r1:32b/reasoning/instruction/1452 | 17 + .../deepseek-r1:32b/reasoning/instruction/1469342 | 19 + .../deepseek-r1:32b/reasoning/instruction/1471 | 27 + .../deepseek-r1:32b/reasoning/instruction/1512 | 18 + .../deepseek-r1:32b/reasoning/instruction/1536 | 13 + .../deepseek-r1:32b/reasoning/instruction/1547 | 13 + .../deepseek-r1:32b/reasoning/instruction/1553 | 29 + .../deepseek-r1:32b/reasoning/instruction/1574346 | 15 + .../deepseek-r1:32b/reasoning/instruction/1590336 | 7 + .../deepseek-r1:32b/reasoning/instruction/1594069 | 15 + .../deepseek-r1:32b/reasoning/instruction/1605123 | 13 + .../deepseek-r1:32b/reasoning/instruction/1606 | 15 + .../deepseek-r1:32b/reasoning/instruction/1611394 | 17 + .../deepseek-r1:32b/reasoning/instruction/1612 | 13 + .../deepseek-r1:32b/reasoning/instruction/1613817 | 20 + .../deepseek-r1:32b/reasoning/instruction/1620 | 24 + .../deepseek-r1:32b/reasoning/instruction/1637 | 13 + .../deepseek-r1:32b/reasoning/instruction/1641637 | 14 + .../deepseek-r1:32b/reasoning/instruction/1642 | 13 + .../deepseek-r1:32b/reasoning/instruction/1701821 | 15 + .../deepseek-r1:32b/reasoning/instruction/1713066 | 23 + .../deepseek-r1:32b/reasoning/instruction/1722 | 24 + .../deepseek-r1:32b/reasoning/instruction/1727737 | 21 + .../deepseek-r1:32b/reasoning/instruction/1737 | 15 + .../deepseek-r1:32b/reasoning/instruction/1738434 | 13 + .../deepseek-r1:32b/reasoning/instruction/1748296 | 17 + .../deepseek-r1:32b/reasoning/instruction/1751422 | 13 + .../deepseek-r1:32b/reasoning/instruction/1771 | 19 + .../deepseek-r1:32b/reasoning/instruction/1780 | 17 + .../deepseek-r1:32b/reasoning/instruction/1781281 | 19 + .../deepseek-r1:32b/reasoning/instruction/1790 | 21 + .../deepseek-r1:32b/reasoning/instruction/1793119 | 17 + .../deepseek-r1:32b/reasoning/instruction/1793608 | 20 + .../deepseek-r1:32b/reasoning/instruction/1806243 | 32 + .../deepseek-r1:32b/reasoning/instruction/1815024 | 17 + .../deepseek-r1:32b/reasoning/instruction/1818075 | 13 + .../deepseek-r1:32b/reasoning/instruction/1820686 | 11 + .../deepseek-r1:32b/reasoning/instruction/1821430 | 17 + .../deepseek-r1:32b/reasoning/instruction/1821444 | 11 + .../deepseek-r1:32b/reasoning/instruction/1824344 | 11 + .../deepseek-r1:32b/reasoning/instruction/1826568 | 22 + .../deepseek-r1:32b/reasoning/instruction/1828867 | 13 + .../deepseek-r1:32b/reasoning/instruction/1832422 | 19 + .../deepseek-r1:32b/reasoning/instruction/1833 | 21 + .../deepseek-r1:32b/reasoning/instruction/1841990 | 11 + .../deepseek-r1:32b/reasoning/instruction/1847467 | 15 + .../deepseek-r1:32b/reasoning/instruction/1854738 | 40 + .../deepseek-r1:32b/reasoning/instruction/1859713 | 17 + .../deepseek-r1:32b/reasoning/instruction/1861404 | 11 + .../deepseek-r1:32b/reasoning/instruction/1863247 | 14 + .../deepseek-r1:32b/reasoning/instruction/1873898 | 21 + .../deepseek-r1:32b/reasoning/instruction/1874888 | 19 + .../deepseek-r1:32b/reasoning/instruction/1877794 | 21 + .../deepseek-r1:32b/reasoning/instruction/1881450 | 21 + .../deepseek-r1:32b/reasoning/instruction/1889288 | 13 + .../deepseek-r1:32b/reasoning/instruction/1901 | 15 + .../deepseek-r1:32b/reasoning/instruction/1904210 | 15 + .../deepseek-r1:32b/reasoning/instruction/1905356 | 21 + .../deepseek-r1:32b/reasoning/instruction/1908626 | 23 + .../deepseek-r1:32b/reasoning/instruction/1909 | 23 + .../deepseek-r1:32b/reasoning/instruction/1912934 | 21 + .../deepseek-r1:32b/reasoning/instruction/1913913 | 17 + .../deepseek-r1:32b/reasoning/instruction/1914021 | 21 + .../deepseek-r1:32b/reasoning/instruction/1915327 | 15 + .../deepseek-r1:32b/reasoning/instruction/1916269 | 13 + .../deepseek-r1:32b/reasoning/instruction/1922887 | 13 + .../deepseek-r1:32b/reasoning/instruction/1925512 | 13 + .../deepseek-r1:32b/reasoning/instruction/1926759 | 15 + .../deepseek-r1:32b/reasoning/instruction/1967248 | 21 + .../deepseek-r1:32b/reasoning/instruction/2078 | 19 + .../deepseek-r1:32b/reasoning/instruction/2083 | 19 + .../deepseek-r1:32b/reasoning/instruction/2089 | 15 + .../deepseek-r1:32b/reasoning/instruction/2136 | 19 + .../deepseek-r1:32b/reasoning/instruction/2175 | 19 + .../deepseek-r1:32b/reasoning/instruction/2203 | 11 + .../deepseek-r1:32b/reasoning/instruction/2302 | 23 + .../deepseek-r1:32b/reasoning/instruction/2317 | 13 + .../deepseek-r1:32b/reasoning/instruction/2318 | 19 + .../deepseek-r1:32b/reasoning/instruction/2319 | 13 + .../deepseek-r1:32b/reasoning/instruction/2371 | 19 + .../deepseek-r1:32b/reasoning/instruction/2372 | 11 + .../deepseek-r1:32b/reasoning/instruction/2373 | 18 + .../deepseek-r1:32b/reasoning/instruction/2374 | 11 + .../deepseek-r1:32b/reasoning/instruction/2375 | 15 + .../deepseek-r1:32b/reasoning/instruction/2376 | 19 + .../deepseek-r1:32b/reasoning/instruction/2386 | 121 ++ .../deepseek-r1:32b/reasoning/instruction/2419 | 15 + .../deepseek-r1:32b/reasoning/instruction/2422 | 9 + .../deepseek-r1:32b/reasoning/instruction/2474 | 17 + .../deepseek-r1:32b/reasoning/instruction/2483 | 22 + .../deepseek-r1:32b/reasoning/instruction/2487 | 15 + .../deepseek-r1:32b/reasoning/instruction/2495 | 23 + .../deepseek-r1:32b/reasoning/instruction/2497 | 25 + .../deepseek-r1:32b/reasoning/instruction/2498 | 13 + .../deepseek-r1:32b/reasoning/instruction/2499 | 18 + .../deepseek-r1:32b/reasoning/instruction/2500 | 17 + .../deepseek-r1:32b/reasoning/instruction/2595 | 15 + .../deepseek-r1:32b/reasoning/instruction/2604 | 23 + .../deepseek-r1:32b/reasoning/instruction/266 | 21 + .../deepseek-r1:32b/reasoning/instruction/2696 | 19 + .../deepseek-r1:32b/reasoning/instruction/2775 | 13 + .../deepseek-r1:32b/reasoning/instruction/2865 | 23 + .../deepseek-r1:32b/reasoning/instruction/2878 | 19 + .../deepseek-r1:32b/reasoning/instruction/2971 | 15 + .../deepseek-r1:32b/reasoning/instruction/312 | 13 + .../deepseek-r1:32b/reasoning/instruction/364 | 20 + .../deepseek-r1:32b/reasoning/instruction/381 | 13 + .../deepseek-r1:32b/reasoning/instruction/390 | 19 + .../deepseek-r1:32b/reasoning/instruction/422 | 13 + .../deepseek-r1:32b/reasoning/instruction/427 | 17 + .../deepseek-r1:32b/reasoning/instruction/449 | 15 + .../deepseek-r1:32b/reasoning/instruction/494 | 19 + .../deepseek-r1:32b/reasoning/instruction/508 | 15 + .../deepseek-r1:32b/reasoning/instruction/618 | 17 + .../deepseek-r1:32b/reasoning/instruction/625 | 19 + .../deepseek-r1:32b/reasoning/instruction/754 | 17 + .../deepseek-r1:32b/reasoning/instruction/799 | 19 + .../deepseek-r1:32b/reasoning/instruction/824 | 11 + .../deepseek-r1:32b/reasoning/instruction/826 | 15 + .../deepseek-r1:32b/reasoning/instruction/837 | 15 + .../deepseek-r1:32b/reasoning/instruction/890 | 18 + .../deepseek-r1:32b/reasoning/instruction/904308 | 15 + .../deepseek-r1:32b/reasoning/instruction/952 | 15 + .../deepseek-r1:32b/reasoning/instruction/979 | 18 + .../deepseek-r1:32b/reasoning/instruction/984 | 16 + .../deepseek-r1:32b/reasoning/instruction/993 | 17 + .../deepseek-r1:32b/reasoning/instruction/998 | 11 + .../deepseek-r1:32b/reasoning/manual-review/1033 | 13 + .../reasoning/manual-review/1054831 | 31 + .../reasoning/manual-review/1066909 | 18 + .../reasoning/manual-review/1075272 | 19 + .../reasoning/manual-review/1075339 | 19 + .../deepseek-r1:32b/reasoning/manual-review/122 | 15 + .../deepseek-r1:32b/reasoning/manual-review/127 | 18 + .../deepseek-r1:32b/reasoning/manual-review/1394 | 15 + .../reasoning/manual-review/1416988 | 17 + .../reasoning/manual-review/1643619 | 17 + .../reasoning/manual-review/1673976 | 23 + .../reasoning/manual-review/1701973 | 19 + .../deepseek-r1:32b/reasoning/manual-review/1729 | 18 + .../reasoning/manual-review/1734792 | 17 + .../deepseek-r1:32b/reasoning/manual-review/1760 | 22 + .../reasoning/manual-review/1761153 | 21 + .../reasoning/manual-review/1783362 | 19 + .../reasoning/manual-review/1805913 | 22 + .../reasoning/manual-review/1810433 | 19 + .../deepseek-r1:32b/reasoning/manual-review/1837 | 11 + .../reasoning/manual-review/1869073 | 25 + .../reasoning/manual-review/1869241 | 22 + .../reasoning/manual-review/1910605 | 21 + .../reasoning/manual-review/1926521 | 17 + .../deepseek-r1:32b/reasoning/manual-review/2101 | 23 + .../deepseek-r1:32b/reasoning/manual-review/2122 | 15 + .../deepseek-r1:32b/reasoning/manual-review/2248 | 642 +++++++++ .../deepseek-r1:32b/reasoning/manual-review/2262 | 17 + .../deepseek-r1:32b/reasoning/manual-review/2333 | 23 + .../deepseek-r1:32b/reasoning/manual-review/2410 | 14 + .../deepseek-r1:32b/reasoning/manual-review/2446 | 21 + .../deepseek-r1:32b/reasoning/manual-review/2553 | 22 + .../deepseek-r1:32b/reasoning/manual-review/570 | 11 + .../deepseek-r1:32b/reasoning/manual-review/602 | 15 + .../deepseek-r1:32b/reasoning/manual-review/817 | 13 + .../deepseek-r1:32b/reasoning/manual-review/829 | 21 + .../deepseek-r1:32b/reasoning/manual-review/833 | 21 + .../deepseek-r1:32b/reasoning/manual-review/911 | 17 + .../deepseek-r1:32b/reasoning/manual-review/957 | 17 + .../deepseek-r1:32b/reasoning/manual-review/982 | 27 + .../deepseek-r1:32b/reasoning/runtime/1010 | 11 + .../deepseek-r1:32b/reasoning/runtime/1010484 | 9 + .../deepseek-r1:32b/reasoning/runtime/1027 | 21 + .../deepseek-r1:32b/reasoning/runtime/1031920 | 11 + .../deepseek-r1:32b/reasoning/runtime/1034 | 17 + .../deepseek-r1:32b/reasoning/runtime/1041 | 11 + .../deepseek-r1:32b/reasoning/runtime/1044 | 17 + .../deepseek-r1:32b/reasoning/runtime/1052857 | 11 + .../deepseek-r1:32b/reasoning/runtime/1059 | 21 + .../deepseek-r1:32b/reasoning/runtime/1068900 | 17 + .../deepseek-r1:32b/reasoning/runtime/1070 | 15 + .../deepseek-r1:32b/reasoning/runtime/1072 | 19 + .../deepseek-r1:32b/reasoning/runtime/1075 | 34 + .../deepseek-r1:32b/reasoning/runtime/1093 | 15 + .../deepseek-r1:32b/reasoning/runtime/1095531 | 19 + .../deepseek-r1:32b/reasoning/runtime/1098729 | 22 + .../deepseek-r1:32b/reasoning/runtime/1102 | 24 + .../deepseek-r1:32b/reasoning/runtime/1128 | 13 + .../deepseek-r1:32b/reasoning/runtime/1143 | 13 + .../deepseek-r1:32b/reasoning/runtime/1147 | 20 + .../deepseek-r1:32b/reasoning/runtime/1165383 | 7 + .../deepseek-r1:32b/reasoning/runtime/1172613 | 17 + .../deepseek-r1:32b/reasoning/runtime/1182490 | 11 + .../deepseek-r1:32b/reasoning/runtime/1187319 | 17 + .../deepseek-r1:32b/reasoning/runtime/1207896 | 17 + .../deepseek-r1:32b/reasoning/runtime/1209 | 9 + .../deepseek-r1:32b/reasoning/runtime/1211 | 13 + .../deepseek-r1:32b/reasoning/runtime/1221966 | 11 + .../deepseek-r1:32b/reasoning/runtime/1228 | 21 + .../deepseek-r1:32b/reasoning/runtime/1233225 | 21 + .../deepseek-r1:32b/reasoning/runtime/1245703 | 19 + .../deepseek-r1:32b/reasoning/runtime/1246990 | 19 + .../deepseek-r1:32b/reasoning/runtime/1248 | 14 + .../deepseek-r1:32b/reasoning/runtime/1254672 | 21 + .../deepseek-r1:32b/reasoning/runtime/1254828 | 23 + .../deepseek-r1:32b/reasoning/runtime/1255 | 23 + .../deepseek-r1:32b/reasoning/runtime/1261743 | 17 + .../deepseek-r1:32b/reasoning/runtime/1263747 | 19 + .../deepseek-r1:32b/reasoning/runtime/1267 | 19 + .../deepseek-r1:32b/reasoning/runtime/1285363 | 13 + .../deepseek-r1:32b/reasoning/runtime/1287195 | 7 + .../deepseek-r1:32b/reasoning/runtime/1294898 | 15 + .../deepseek-r1:32b/reasoning/runtime/1311614 | 15 + .../deepseek-r1:32b/reasoning/runtime/1346769 | 13 + .../deepseek-r1:32b/reasoning/runtime/1346784 | 11 + .../deepseek-r1:32b/reasoning/runtime/1357206 | 13 + .../deepseek-r1:32b/reasoning/runtime/1357226 | 15 + .../deepseek-r1:32b/reasoning/runtime/1361 | 21 + .../deepseek-r1:32b/reasoning/runtime/1361912 | 11 + .../deepseek-r1:32b/reasoning/runtime/1362635 | 11 + .../deepseek-r1:32b/reasoning/runtime/1368 | 23 + .../deepseek-r1:32b/reasoning/runtime/1388 | 31 + .../deepseek-r1:32b/reasoning/runtime/1397 | 21 + .../deepseek-r1:32b/reasoning/runtime/140 | 11 + .../deepseek-r1:32b/reasoning/runtime/1412 | 19 + .../deepseek-r1:32b/reasoning/runtime/1429313 | 13 + .../deepseek-r1:32b/reasoning/runtime/1435 | 11 + .../deepseek-r1:32b/reasoning/runtime/1478 | 25 + .../deepseek-r1:32b/reasoning/runtime/1495 | 21 + .../deepseek-r1:32b/reasoning/runtime/1519037 | 23 + .../deepseek-r1:32b/reasoning/runtime/1527765 | 20 + .../deepseek-r1:32b/reasoning/runtime/1528 | 13 + .../deepseek-r1:32b/reasoning/runtime/1528239 | 13 + .../deepseek-r1:32b/reasoning/runtime/1531 | 19 + .../deepseek-r1:32b/reasoning/runtime/1533141 | 15 + .../deepseek-r1:32b/reasoning/runtime/1541 | 19 + .../deepseek-r1:32b/reasoning/runtime/1550503 | 24 + .../deepseek-r1:32b/reasoning/runtime/1568107 | 23 + .../deepseek-r1:32b/reasoning/runtime/1591611 | 16 + .../deepseek-r1:32b/reasoning/runtime/1593 | 17 + .../deepseek-r1:32b/reasoning/runtime/1603734 | 13 + .../deepseek-r1:32b/reasoning/runtime/1614348 | 17 + .../deepseek-r1:32b/reasoning/runtime/1623020 | 15 + .../deepseek-r1:32b/reasoning/runtime/1641861 | 13 + .../deepseek-r1:32b/reasoning/runtime/1648 | 17 + .../deepseek-r1:32b/reasoning/runtime/1650 | 13 + .../deepseek-r1:32b/reasoning/runtime/1654137 | 19 + .../deepseek-r1:32b/reasoning/runtime/1659901 | 15 + .../deepseek-r1:32b/reasoning/runtime/1661815 | 9 + .../deepseek-r1:32b/reasoning/runtime/1667401 | 13 + .../deepseek-r1:32b/reasoning/runtime/1671 | 17 + .../deepseek-r1:32b/reasoning/runtime/1696353 | 13 + .../deepseek-r1:32b/reasoning/runtime/1697 | 13 + .../deepseek-r1:32b/reasoning/runtime/1704638 | 17 + .../deepseek-r1:32b/reasoning/runtime/1707 | 17 + .../deepseek-r1:32b/reasoning/runtime/1715162 | 11 + .../deepseek-r1:32b/reasoning/runtime/1716767 | 19 + .../deepseek-r1:32b/reasoning/runtime/1724485 | 15 + .../deepseek-r1:32b/reasoning/runtime/1725267 | 15 + .../deepseek-r1:32b/reasoning/runtime/1734 | 15 + .../deepseek-r1:32b/reasoning/runtime/1735384 | 21 + .../deepseek-r1:32b/reasoning/runtime/1736 | 29 + .../deepseek-r1:32b/reasoning/runtime/1737444 | 17 + .../deepseek-r1:32b/reasoning/runtime/1738545 | 15 + .../deepseek-r1:32b/reasoning/runtime/1740219 | 34 + .../deepseek-r1:32b/reasoning/runtime/1741 | 13 + .../deepseek-r1:32b/reasoning/runtime/1748612 | 23 + .../deepseek-r1:32b/reasoning/runtime/1755 | 23 + .../deepseek-r1:32b/reasoning/runtime/1756 | 19 + .../deepseek-r1:32b/reasoning/runtime/1756519 | 13 + .../deepseek-r1:32b/reasoning/runtime/1756807 | 27 + .../deepseek-r1:32b/reasoning/runtime/1756927 | 21 + .../deepseek-r1:32b/reasoning/runtime/1761401 | 13 + .../deepseek-r1:32b/reasoning/runtime/1761535 | 13 + .../deepseek-r1:32b/reasoning/runtime/1763 | 25 + .../deepseek-r1:32b/reasoning/runtime/1765970 | 15 + .../deepseek-r1:32b/reasoning/runtime/1768 | 9 + .../deepseek-r1:32b/reasoning/runtime/1768246 | 17 + .../deepseek-r1:32b/reasoning/runtime/1773743 | 13 + .../deepseek-r1:32b/reasoning/runtime/1774149 | 21 + .../deepseek-r1:32b/reasoning/runtime/1777226 | 27 + .../deepseek-r1:32b/reasoning/runtime/1779 | 25 + .../deepseek-r1:32b/reasoning/runtime/1779634 | 20 + .../deepseek-r1:32b/reasoning/runtime/1785734 | 11 + .../deepseek-r1:32b/reasoning/runtime/1793539 | 19 + .../deepseek-r1:32b/reasoning/runtime/1796520 | 9 + .../deepseek-r1:32b/reasoning/runtime/1798 | 11 + .../deepseek-r1:32b/reasoning/runtime/1799200 | 13 + .../deepseek-r1:32b/reasoning/runtime/1805 | 19 + .../deepseek-r1:32b/reasoning/runtime/1807 | 36 + .../deepseek-r1:32b/reasoning/runtime/1808563 | 15 + .../deepseek-r1:32b/reasoning/runtime/1808565 | 15 + .../deepseek-r1:32b/reasoning/runtime/1812 | 19 + .../deepseek-r1:32b/reasoning/runtime/1812451 | 15 + .../deepseek-r1:32b/reasoning/runtime/1812861 | 15 + .../deepseek-r1:32b/reasoning/runtime/1813398 | 13 + .../deepseek-r1:32b/reasoning/runtime/1814128 | 27 + .../deepseek-r1:32b/reasoning/runtime/1818483 | 17 + .../deepseek-r1:32b/reasoning/runtime/1819 | 13 + .../deepseek-r1:32b/reasoning/runtime/1821515 | 19 + .../deepseek-r1:32b/reasoning/runtime/1829459 | 17 + .../deepseek-r1:32b/reasoning/runtime/1830 | 25 + .../deepseek-r1:32b/reasoning/runtime/1832353 | 21 + .../deepseek-r1:32b/reasoning/runtime/1832916 | 17 + .../deepseek-r1:32b/reasoning/runtime/1833668 | 11 + .../deepseek-r1:32b/reasoning/runtime/1834496 | 17 + .../deepseek-r1:32b/reasoning/runtime/1835693 | 21 + .../deepseek-r1:32b/reasoning/runtime/1835839 | 23 + .../deepseek-r1:32b/reasoning/runtime/1836078 | 17 + .../deepseek-r1:32b/reasoning/runtime/1836192 | 19 + .../deepseek-r1:32b/reasoning/runtime/1836558 | 21 + .../deepseek-r1:32b/reasoning/runtime/1840922 | 19 + .../deepseek-r1:32b/reasoning/runtime/1854 | 15 + .../deepseek-r1:32b/reasoning/runtime/1857 | 17 + .../deepseek-r1:32b/reasoning/runtime/1858415 | 20 + .../deepseek-r1:32b/reasoning/runtime/1860056 | 19 + .../deepseek-r1:32b/reasoning/runtime/1860610 | 11 + .../deepseek-r1:32b/reasoning/runtime/1861605 | 13 + .../deepseek-r1:32b/reasoning/runtime/1862167 | 17 + .../deepseek-r1:32b/reasoning/runtime/1862986 | 13 + .../deepseek-r1:32b/reasoning/runtime/1863445 | 11 + .../deepseek-r1:32b/reasoning/runtime/1869782 | 21 + .../deepseek-r1:32b/reasoning/runtime/1870477 | 21 + .../deepseek-r1:32b/reasoning/runtime/1878501 | 18 + .../deepseek-r1:32b/reasoning/runtime/1880225 | 23 + .../deepseek-r1:32b/reasoning/runtime/1880332 | 16 + .../deepseek-r1:32b/reasoning/runtime/1880722 | 15 + .../deepseek-r1:32b/reasoning/runtime/1883268 | 15 + .../deepseek-r1:32b/reasoning/runtime/1883784 | 15 + .../deepseek-r1:32b/reasoning/runtime/1885350 | 13 + .../deepseek-r1:32b/reasoning/runtime/1886097 | 13 + .../deepseek-r1:32b/reasoning/runtime/1887306 | 23 + .../deepseek-r1:32b/reasoning/runtime/1888303 | 23 + .../deepseek-r1:32b/reasoning/runtime/1888728 | 23 + .../deepseek-r1:32b/reasoning/runtime/1889411 | 15 + .../deepseek-r1:32b/reasoning/runtime/1890 | 17 + .../deepseek-r1:32b/reasoning/runtime/1892081 | 20 + .../deepseek-r1:32b/reasoning/runtime/1894029 | 29 + .../deepseek-r1:32b/reasoning/runtime/1895 | 19 + .../deepseek-r1:32b/reasoning/runtime/1895080 | 13 + .../deepseek-r1:32b/reasoning/runtime/1895305 | 19 + .../deepseek-r1:32b/reasoning/runtime/1895471 | 11 + .../deepseek-r1:32b/reasoning/runtime/1895703 | 13 + .../deepseek-r1:32b/reasoning/runtime/1904259 | 21 + .../deepseek-r1:32b/reasoning/runtime/1906536 | 13 + .../deepseek-r1:32b/reasoning/runtime/1907817 | 13 + .../deepseek-r1:32b/reasoning/runtime/1907969 | 19 + .../deepseek-r1:32b/reasoning/runtime/1908 | 18 + .../deepseek-r1:32b/reasoning/runtime/1908551 | 19 + .../deepseek-r1:32b/reasoning/runtime/1909921 | 11 + .../deepseek-r1:32b/reasoning/runtime/1910 | 15 + .../deepseek-r1:32b/reasoning/runtime/1913 | 25 + .../deepseek-r1:32b/reasoning/runtime/1914870 | 29 + .../deepseek-r1:32b/reasoning/runtime/1915531 | 11 + .../deepseek-r1:32b/reasoning/runtime/1915925 | 17 + .../deepseek-r1:32b/reasoning/runtime/1916344 | 13 + .../deepseek-r1:32b/reasoning/runtime/1917184 | 9 + .../deepseek-r1:32b/reasoning/runtime/1918026 | 13 + .../deepseek-r1:32b/reasoning/runtime/1926044 | 13 + .../deepseek-r1:32b/reasoning/runtime/1926202 | 15 + .../deepseek-r1:32b/reasoning/runtime/1926246 | 13 + .../deepseek-r1:32b/reasoning/runtime/1927530 | 21 + .../deepseek-r1:32b/reasoning/runtime/1930 | 33 + .../deepseek-r1:32b/reasoning/runtime/1936977 | 23 + .../deepseek-r1:32b/reasoning/runtime/1941 | 13 + .../deepseek-r1:32b/reasoning/runtime/1952 | 11 + .../deepseek-r1:32b/reasoning/runtime/1953 | 15 + .../deepseek-r1:32b/reasoning/runtime/2027 | 13 + .../deepseek-r1:32b/reasoning/runtime/2035 | 39 + .../deepseek-r1:32b/reasoning/runtime/2072564 | 15 + .../deepseek-r1:32b/reasoning/runtime/2082 | 11 + .../deepseek-r1:32b/reasoning/runtime/2119 | 15 + .../deepseek-r1:32b/reasoning/runtime/2127 | 13 + .../deepseek-r1:32b/reasoning/runtime/2156 | 18 + .../deepseek-r1:32b/reasoning/runtime/2157 | 13 + .../deepseek-r1:32b/reasoning/runtime/2208 | 13 + .../deepseek-r1:32b/reasoning/runtime/2223 | 13 + .../deepseek-r1:32b/reasoning/runtime/2304 | 21 + .../deepseek-r1:32b/reasoning/runtime/2336 | 20 + .../deepseek-r1:32b/reasoning/runtime/2353 | 13 + .../deepseek-r1:32b/reasoning/runtime/2448 | 21 + .../deepseek-r1:32b/reasoning/runtime/2460 | 15 + .../deepseek-r1:32b/reasoning/runtime/2486 | 19 + .../deepseek-r1:32b/reasoning/runtime/2505 | 17 + .../deepseek-r1:32b/reasoning/runtime/2525 | 13 + .../deepseek-r1:32b/reasoning/runtime/2536 | 16 + .../deepseek-r1:32b/reasoning/runtime/2560 | 15 + .../deepseek-r1:32b/reasoning/runtime/2569 | 15 + .../deepseek-r1:32b/reasoning/runtime/2580 | 19 + .../deepseek-r1:32b/reasoning/runtime/2590 | 13 + .../deepseek-r1:32b/reasoning/runtime/2596 | 15 + .../deepseek-r1:32b/reasoning/runtime/2598 | 11 + .../deepseek-r1:32b/reasoning/runtime/2606 | 13 + .../deepseek-r1:32b/reasoning/runtime/261 | 13 + .../deepseek-r1:32b/reasoning/runtime/2619 | 15 + .../deepseek-r1:32b/reasoning/runtime/2628 | 19 + .../deepseek-r1:32b/reasoning/runtime/2632 | 13 + .../deepseek-r1:32b/reasoning/runtime/2647 | 13 + .../deepseek-r1:32b/reasoning/runtime/2655 | 19 + .../deepseek-r1:32b/reasoning/runtime/2672 | 20 + .../deepseek-r1:32b/reasoning/runtime/2683 | 17 + .../deepseek-r1:32b/reasoning/runtime/2730 | 13 + .../deepseek-r1:32b/reasoning/runtime/2738 | 13 + .../deepseek-r1:32b/reasoning/runtime/275 | 17 + .../deepseek-r1:32b/reasoning/runtime/276 | 9 + .../deepseek-r1:32b/reasoning/runtime/2761 | 17 + .../deepseek-r1:32b/reasoning/runtime/280 | 13 + .../deepseek-r1:32b/reasoning/runtime/2802 | 17 + .../deepseek-r1:32b/reasoning/runtime/2815 | 11 + .../deepseek-r1:32b/reasoning/runtime/2846 | 21 + .../deepseek-r1:32b/reasoning/runtime/311 | 13 + .../deepseek-r1:32b/reasoning/runtime/333 | 18 + .../deepseek-r1:32b/reasoning/runtime/355 | 13 + .../deepseek-r1:32b/reasoning/runtime/361 | 13 + .../deepseek-r1:32b/reasoning/runtime/385 | 15 + .../deepseek-r1:32b/reasoning/runtime/419 | 19 + .../deepseek-r1:32b/reasoning/runtime/442 | 17 + .../deepseek-r1:32b/reasoning/runtime/447 | 11 + .../deepseek-r1:32b/reasoning/runtime/514 | 13 + .../deepseek-r1:32b/reasoning/runtime/562107 | 11 + .../deepseek-r1:32b/reasoning/runtime/616 | 15 + .../deepseek-r1:32b/reasoning/runtime/633 | 17 + .../deepseek-r1:32b/reasoning/runtime/645662 | 25 + .../deepseek-r1:32b/reasoning/runtime/693 | 16 + .../deepseek-r1:32b/reasoning/runtime/695 | 11 + .../deepseek-r1:32b/reasoning/runtime/697 | 11 + .../deepseek-r1:32b/reasoning/runtime/698 | 11 + .../deepseek-r1:32b/reasoning/runtime/704 | 13 + .../deepseek-r1:32b/reasoning/runtime/714 | 17 + .../deepseek-r1:32b/reasoning/runtime/739785 | 31 + .../deepseek-r1:32b/reasoning/runtime/754635 | 17 + .../deepseek-r1:32b/reasoning/runtime/796480 | 11 + .../deepseek-r1:32b/reasoning/runtime/805 | 21 + .../deepseek-r1:32b/reasoning/runtime/834 | 21 + .../deepseek-r1:32b/reasoning/runtime/856 | 15 + .../deepseek-r1:32b/reasoning/runtime/866 | 23 + .../deepseek-r1:32b/reasoning/runtime/886621 | 25 + .../deepseek-r1:32b/reasoning/runtime/909 | 21 + .../deepseek-r1:32b/reasoning/runtime/922 | 23 + .../deepseek-r1:32b/reasoning/runtime/939 | 19 + .../deepseek-r1:32b/reasoning/runtime/947 | 15 + .../deepseek-r1:32b/reasoning/runtime/95 | 17 + .../deepseek-r1:32b/reasoning/runtime/967 | 11 + .../deepseek-r1:32b/reasoning/syscall/1007 | 13 + .../deepseek-r1:32b/reasoning/syscall/1012 | 19 + .../deepseek-r1:32b/reasoning/syscall/1076445 | 19 + .../deepseek-r1:32b/reasoning/syscall/1111 | 23 + .../deepseek-r1:32b/reasoning/syscall/121 | 13 + .../deepseek-r1:32b/reasoning/syscall/1238 | 13 + .../deepseek-r1:32b/reasoning/syscall/1261 | 11 + .../deepseek-r1:32b/reasoning/syscall/1319100 | 17 + .../deepseek-r1:32b/reasoning/syscall/1356916 | 14 + .../deepseek-r1:32b/reasoning/syscall/1457275 | 21 + .../deepseek-r1:32b/reasoning/syscall/1462640 | 20 + .../deepseek-r1:32b/reasoning/syscall/1470170 | 13 + .../deepseek-r1:32b/reasoning/syscall/1494 | 21 + .../deepseek-r1:32b/reasoning/syscall/1516408 | 14 + .../deepseek-r1:32b/reasoning/syscall/1563612 | 21 + .../deepseek-r1:32b/reasoning/syscall/1585840 | 33 + .../deepseek-r1:32b/reasoning/syscall/1594394 | 11 + .../deepseek-r1:32b/reasoning/syscall/1605443 | 27 + .../deepseek-r1:32b/reasoning/syscall/1617929 | 28 + .../deepseek-r1:32b/reasoning/syscall/1619896 | 17 + .../deepseek-r1:32b/reasoning/syscall/1689367 | 16 + .../deepseek-r1:32b/reasoning/syscall/1696773 | 15 + .../deepseek-r1:32b/reasoning/syscall/1701808 | 15 + .../deepseek-r1:32b/reasoning/syscall/1701971 | 15 + .../deepseek-r1:32b/reasoning/syscall/1701974 | 19 + .../deepseek-r1:32b/reasoning/syscall/1716292 | 17 + .../deepseek-r1:32b/reasoning/syscall/1726394 | 15 + .../deepseek-r1:32b/reasoning/syscall/1728116 | 21 + .../deepseek-r1:32b/reasoning/syscall/1749393 | 15 + .../deepseek-r1:32b/reasoning/syscall/1763536 | 57 + .../deepseek-r1:32b/reasoning/syscall/1770 | 13 + .../deepseek-r1:32b/reasoning/syscall/1776478 | 23 + .../deepseek-r1:32b/reasoning/syscall/1785203 | 17 + .../deepseek-r1:32b/reasoning/syscall/1791763 | 11 + .../deepseek-r1:32b/reasoning/syscall/1791796 | 19 + .../deepseek-r1:32b/reasoning/syscall/1813307 | 9 + .../deepseek-r1:32b/reasoning/syscall/1821006 | 15 + .../deepseek-r1:32b/reasoning/syscall/1857811 | 23 + .../deepseek-r1:32b/reasoning/syscall/1858461 | 13 + .../deepseek-r1:32b/reasoning/syscall/1860053 | 21 + .../deepseek-r1:32b/reasoning/syscall/1861341 | 13 + .../deepseek-r1:32b/reasoning/syscall/1876373 | 15 + .../deepseek-r1:32b/reasoning/syscall/1884719 | 24 + .../deepseek-r1:32b/reasoning/syscall/1893010 | 17 + .../deepseek-r1:32b/reasoning/syscall/1894361 | 20 + .../deepseek-r1:32b/reasoning/syscall/1906193 | 18 + .../deepseek-r1:32b/reasoning/syscall/1926996 | 22 + .../deepseek-r1:32b/reasoning/syscall/2112 | 21 + .../deepseek-r1:32b/reasoning/syscall/2123 | 19 + .../deepseek-r1:32b/reasoning/syscall/2168 | 19 + .../deepseek-r1:32b/reasoning/syscall/2170 | 21 + .../deepseek-r1:32b/reasoning/syscall/2197 | 18 + .../deepseek-r1:32b/reasoning/syscall/2309 | 21 + .../deepseek-r1:32b/reasoning/syscall/2390 | 19 + .../deepseek-r1:32b/reasoning/syscall/2485 | 22 + .../deepseek-r1:32b/reasoning/syscall/2504 | 17 + .../deepseek-r1:32b/reasoning/syscall/2592 | 19 + .../deepseek-r1:32b/reasoning/syscall/263 | 19 + .../deepseek-r1:32b/reasoning/syscall/2825 | 21 + .../deepseek-r1:32b/reasoning/syscall/306 | 9 + .../deepseek-r1:32b/reasoning/syscall/324 | 13 + .../deepseek-r1:32b/reasoning/syscall/326 | 21 + .../deepseek-r1:32b/reasoning/syscall/356 | 21 + .../deepseek-r1:32b/reasoning/syscall/456 | 21 + .../deepseek-r1:32b/reasoning/syscall/470 | 21 + .../deepseek-r1:32b/reasoning/syscall/577 | 19 + .../deepseek-r1:32b/reasoning/syscall/578 | 15 + .../deepseek-r1:32b/reasoning/syscall/579 | 23 + .../deepseek-r1:32b/reasoning/syscall/654 | 25 + .../deepseek-r1:32b/reasoning/syscall/690 | 23 + .../deepseek-r1:32b/reasoning/syscall/836 | 37 + .../deepseek-r1:32b/reasoning/syscall/871 | 19 + .../deepseek-r1:32b/reasoning/syscall/885 | 19 + .../deepseek-r1:32b/reasoning/syscall/927 | 23 + results/classifier/gemma3:27b/analysis.csv | 2 + results/classifier/gemma3:27b/categories.csv | 6 + results/classifier/gemma3:27b/instruction/1022 | 36 + results/classifier/gemma3:27b/instruction/1028 | 37 + results/classifier/gemma3:27b/instruction/1051 | 4 + results/classifier/gemma3:27b/instruction/1079080 | 43 + results/classifier/gemma3:27b/instruction/1092 | 17 + results/classifier/gemma3:27b/instruction/1095531 | 60 + results/classifier/gemma3:27b/instruction/1095857 | 14 + results/classifier/gemma3:27b/instruction/1128 | 27 + results/classifier/gemma3:27b/instruction/1129571 | 17 + results/classifier/gemma3:27b/instruction/1143 | 81 ++ results/classifier/gemma3:27b/instruction/1156 | 4 + results/classifier/gemma3:27b/instruction/1156313 | 128 ++ results/classifier/gemma3:27b/instruction/1178 | 4 + results/classifier/gemma3:27b/instruction/1245543 | 26 + results/classifier/gemma3:27b/instruction/1248 | 14 + results/classifier/gemma3:27b/instruction/1248168 | 27 + results/classifier/gemma3:27b/instruction/1251 | 18 + results/classifier/gemma3:27b/instruction/1254786 | 45 + results/classifier/gemma3:27b/instruction/1267 | 96 ++ results/classifier/gemma3:27b/instruction/1267955 | 45 + results/classifier/gemma3:27b/instruction/1283519 | 13 + results/classifier/gemma3:27b/instruction/1308381 | 17 + results/classifier/gemma3:27b/instruction/1328996 | 6 + results/classifier/gemma3:27b/instruction/1339 | 19 + results/classifier/gemma3:27b/instruction/1368 | 41 + results/classifier/gemma3:27b/instruction/1370 | 16 + results/classifier/gemma3:27b/instruction/1371 | 22 + results/classifier/gemma3:27b/instruction/1372 | 23 + results/classifier/gemma3:27b/instruction/1373 | 23 + results/classifier/gemma3:27b/instruction/1374 | 25 + results/classifier/gemma3:27b/instruction/1375 | 22 + results/classifier/gemma3:27b/instruction/1376 | 18 + results/classifier/gemma3:27b/instruction/1377 | 27 + results/classifier/gemma3:27b/instruction/1397 | 4 + results/classifier/gemma3:27b/instruction/1404690 | 41 + results/classifier/gemma3:27b/instruction/1412 | 8 + results/classifier/gemma3:27b/instruction/1428352 | 47 + results/classifier/gemma3:27b/instruction/1435 | 19 + results/classifier/gemma3:27b/instruction/1441 | 37 + results/classifier/gemma3:27b/instruction/1452 | 4 + results/classifier/gemma3:27b/instruction/1469342 | 6 + results/classifier/gemma3:27b/instruction/1531 | 18 + results/classifier/gemma3:27b/instruction/1536 | 19 + results/classifier/gemma3:27b/instruction/1574346 | 15 + results/classifier/gemma3:27b/instruction/1590336 | 18 + results/classifier/gemma3:27b/instruction/1594069 | 11 + results/classifier/gemma3:27b/instruction/1605123 | 31 + results/classifier/gemma3:27b/instruction/1606 | 32 + results/classifier/gemma3:27b/instruction/1611394 | 32 + results/classifier/gemma3:27b/instruction/1612 | 54 + results/classifier/gemma3:27b/instruction/1613817 | 59 + results/classifier/gemma3:27b/instruction/1614348 | 42 + results/classifier/gemma3:27b/instruction/1620 | 97 ++ results/classifier/gemma3:27b/instruction/1637 | 4 + results/classifier/gemma3:27b/instruction/1641637 | 716 +++++++++++ results/classifier/gemma3:27b/instruction/1642 | 25 + results/classifier/gemma3:27b/instruction/1701821 | 217 ++++ results/classifier/gemma3:27b/instruction/1713066 | 22 + results/classifier/gemma3:27b/instruction/1722 | 90 ++ results/classifier/gemma3:27b/instruction/1724485 | 21 + results/classifier/gemma3:27b/instruction/1727737 | 28 + results/classifier/gemma3:27b/instruction/1736 | 70 + results/classifier/gemma3:27b/instruction/1737 | 52 + results/classifier/gemma3:27b/instruction/1738434 | 31 + results/classifier/gemma3:27b/instruction/1748296 | 28 + results/classifier/gemma3:27b/instruction/1751422 | 7 + results/classifier/gemma3:27b/instruction/1751494 | 38 + results/classifier/gemma3:27b/instruction/1756927 | 21 + results/classifier/gemma3:27b/instruction/1761401 | 13 + results/classifier/gemma3:27b/instruction/1771 | 36 + results/classifier/gemma3:27b/instruction/1779 | 33 + results/classifier/gemma3:27b/instruction/1780 | 20 + results/classifier/gemma3:27b/instruction/1781281 | 31 + results/classifier/gemma3:27b/instruction/1785734 | 78 ++ results/classifier/gemma3:27b/instruction/1790 | 32 + results/classifier/gemma3:27b/instruction/1793119 | 32 + results/classifier/gemma3:27b/instruction/1793608 | 19 + results/classifier/gemma3:27b/instruction/1796520 | 39 + results/classifier/gemma3:27b/instruction/1806243 | 87 ++ results/classifier/gemma3:27b/instruction/1809546 | 90 ++ results/classifier/gemma3:27b/instruction/1815024 | 18 + results/classifier/gemma3:27b/instruction/1818075 | 56 + results/classifier/gemma3:27b/instruction/1820686 | 8 + results/classifier/gemma3:27b/instruction/1821430 | 35 + results/classifier/gemma3:27b/instruction/1821444 | 32 + results/classifier/gemma3:27b/instruction/1824344 | 48 + results/classifier/gemma3:27b/instruction/1824778 | 30 + results/classifier/gemma3:27b/instruction/1826568 | 16 + results/classifier/gemma3:27b/instruction/1828867 | 11 + results/classifier/gemma3:27b/instruction/1832422 | 12 + results/classifier/gemma3:27b/instruction/1833 | 87 ++ results/classifier/gemma3:27b/instruction/1841990 | 41 + results/classifier/gemma3:27b/instruction/1847467 | 19 + results/classifier/gemma3:27b/instruction/1854738 | 31 + results/classifier/gemma3:27b/instruction/1859713 | 28 + results/classifier/gemma3:27b/instruction/1861404 | 53 + results/classifier/gemma3:27b/instruction/1861605 | 19 + results/classifier/gemma3:27b/instruction/1862167 | 6 + results/classifier/gemma3:27b/instruction/1863247 | 11 + results/classifier/gemma3:27b/instruction/1873898 | 41 + results/classifier/gemma3:27b/instruction/1874888 | 46 + results/classifier/gemma3:27b/instruction/1877794 | 6 + results/classifier/gemma3:27b/instruction/1881450 | 26 + results/classifier/gemma3:27b/instruction/1885350 | 26 + results/classifier/gemma3:27b/instruction/1889288 | 10 + results/classifier/gemma3:27b/instruction/1892081 | 17 + results/classifier/gemma3:27b/instruction/1898954 | 73 ++ results/classifier/gemma3:27b/instruction/1901 | 22 + results/classifier/gemma3:27b/instruction/1904210 | 54 + results/classifier/gemma3:27b/instruction/1905356 | 15 + results/classifier/gemma3:27b/instruction/1906536 | 33 + results/classifier/gemma3:27b/instruction/1908626 | 68 + results/classifier/gemma3:27b/instruction/1910 | 65 + results/classifier/gemma3:27b/instruction/1912934 | 20 + results/classifier/gemma3:27b/instruction/1913913 | 21 + results/classifier/gemma3:27b/instruction/1914021 | 30 + results/classifier/gemma3:27b/instruction/1914870 | 60 + results/classifier/gemma3:27b/instruction/1915327 | 37 + results/classifier/gemma3:27b/instruction/1916269 | 22 + results/classifier/gemma3:27b/instruction/1918026 | 32 + results/classifier/gemma3:27b/instruction/1922887 | 33 + results/classifier/gemma3:27b/instruction/1925512 | 21 + results/classifier/gemma3:27b/instruction/1926202 | 21 + results/classifier/gemma3:27b/instruction/1926759 | 21 + results/classifier/gemma3:27b/instruction/1941 | 105 ++ results/classifier/gemma3:27b/instruction/1955 | 39 + results/classifier/gemma3:27b/instruction/1967248 | 41 + results/classifier/gemma3:27b/instruction/2078 | 37 + results/classifier/gemma3:27b/instruction/2083 | 114 ++ results/classifier/gemma3:27b/instruction/2089 | 30 + results/classifier/gemma3:27b/instruction/2136 | 38 + results/classifier/gemma3:27b/instruction/2175 | 41 + results/classifier/gemma3:27b/instruction/2203 | 4 + results/classifier/gemma3:27b/instruction/2208 | 91 ++ results/classifier/gemma3:27b/instruction/2248 | 39 + results/classifier/gemma3:27b/instruction/2302 | 28 + results/classifier/gemma3:27b/instruction/2317 | 41 + results/classifier/gemma3:27b/instruction/2318 | 37 + results/classifier/gemma3:27b/instruction/2319 | 20 + results/classifier/gemma3:27b/instruction/2371 | 55 + results/classifier/gemma3:27b/instruction/2372 | 112 ++ results/classifier/gemma3:27b/instruction/2373 | 98 ++ results/classifier/gemma3:27b/instruction/2374 | 114 ++ results/classifier/gemma3:27b/instruction/2375 | 88 ++ results/classifier/gemma3:27b/instruction/2376 | 117 ++ results/classifier/gemma3:27b/instruction/2386 | 46 + results/classifier/gemma3:27b/instruction/2419 | 21 + results/classifier/gemma3:27b/instruction/2422 | 72 ++ results/classifier/gemma3:27b/instruction/2474 | 99 ++ results/classifier/gemma3:27b/instruction/2483 | 23 + results/classifier/gemma3:27b/instruction/2487 | 71 + results/classifier/gemma3:27b/instruction/2495 | 75 ++ results/classifier/gemma3:27b/instruction/2497 | 6 + results/classifier/gemma3:27b/instruction/2498 | 54 + results/classifier/gemma3:27b/instruction/2499 | 33 + results/classifier/gemma3:27b/instruction/2500 | 7 + results/classifier/gemma3:27b/instruction/2536 | 4 + results/classifier/gemma3:27b/instruction/2595 | 138 ++ results/classifier/gemma3:27b/instruction/2604 | 47 + results/classifier/gemma3:27b/instruction/2632 | 86 ++ results/classifier/gemma3:27b/instruction/266 | 4 + results/classifier/gemma3:27b/instruction/2672 | 23 + results/classifier/gemma3:27b/instruction/2696 | 15 + results/classifier/gemma3:27b/instruction/2730 | 13 + results/classifier/gemma3:27b/instruction/2775 | 137 ++ results/classifier/gemma3:27b/instruction/2802 | 29 + results/classifier/gemma3:27b/instruction/2865 | 55 + results/classifier/gemma3:27b/instruction/2878 | 4 + results/classifier/gemma3:27b/instruction/2971 | 47 + results/classifier/gemma3:27b/instruction/312 | 4 + results/classifier/gemma3:27b/instruction/361 | 4 + results/classifier/gemma3:27b/instruction/364 | 4 + results/classifier/gemma3:27b/instruction/381 | 4 + results/classifier/gemma3:27b/instruction/385 | 4 + results/classifier/gemma3:27b/instruction/390 | 4 + results/classifier/gemma3:27b/instruction/422 | 4 + results/classifier/gemma3:27b/instruction/427 | 4 + results/classifier/gemma3:27b/instruction/449 | 71 + results/classifier/gemma3:27b/instruction/494 | 4 + results/classifier/gemma3:27b/instruction/508 | 4 + results/classifier/gemma3:27b/instruction/514 | 28 + results/classifier/gemma3:27b/instruction/616 | 110 ++ results/classifier/gemma3:27b/instruction/618 | 98 ++ results/classifier/gemma3:27b/instruction/754 | 210 +++ results/classifier/gemma3:27b/instruction/796480 | 48 + results/classifier/gemma3:27b/instruction/799 | 50 + results/classifier/gemma3:27b/instruction/824 | 15 + results/classifier/gemma3:27b/instruction/826 | 19 + results/classifier/gemma3:27b/instruction/837 | 33 + results/classifier/gemma3:27b/instruction/890 | 4 + results/classifier/gemma3:27b/instruction/904308 | 101 ++ results/classifier/gemma3:27b/instruction/947 | 16 + results/classifier/gemma3:27b/instruction/952 | 100 ++ results/classifier/gemma3:27b/instruction/979 | 10 + results/classifier/gemma3:27b/instruction/984 | 26 + results/classifier/gemma3:27b/instruction/993 | 84 ++ results/classifier/gemma3:27b/instruction/998 | 63 + .../classifier/gemma3:27b/manual-review/1533141 | 18 + results/classifier/gemma3:27b/performance/1895703 | 21 + results/classifier/gemma3:27b/runtime/1010484 | 9 + results/classifier/gemma3:27b/runtime/1027 | 18 + results/classifier/gemma3:27b/runtime/1031920 | 40 + results/classifier/gemma3:27b/runtime/1034 | 20 + results/classifier/gemma3:27b/runtime/1041 | 34 + results/classifier/gemma3:27b/runtime/1044 | 4 + results/classifier/gemma3:27b/runtime/1052857 | 18 + results/classifier/gemma3:27b/runtime/1054812 | 8 + results/classifier/gemma3:27b/runtime/1059 | 13 + results/classifier/gemma3:27b/runtime/1068900 | 8 + results/classifier/gemma3:27b/runtime/1070 | 13 + results/classifier/gemma3:27b/runtime/1072 | 27 + results/classifier/gemma3:27b/runtime/1075 | 19 + results/classifier/gemma3:27b/runtime/1086 | 72 ++ results/classifier/gemma3:27b/runtime/1093 | 36 + results/classifier/gemma3:27b/runtime/1098729 | 46 + results/classifier/gemma3:27b/runtime/1102 | 41 + results/classifier/gemma3:27b/runtime/1147 | 12 + results/classifier/gemma3:27b/runtime/1165383 | 6 + results/classifier/gemma3:27b/runtime/1172613 | 66 + results/classifier/gemma3:27b/runtime/1182490 | 79 ++ results/classifier/gemma3:27b/runtime/1187319 | 11 + results/classifier/gemma3:27b/runtime/1207896 | 6 + results/classifier/gemma3:27b/runtime/1209 | 8 + results/classifier/gemma3:27b/runtime/121 | 4 + results/classifier/gemma3:27b/runtime/1211 | 10 + results/classifier/gemma3:27b/runtime/122 | 4 + results/classifier/gemma3:27b/runtime/1221966 | 37 + results/classifier/gemma3:27b/runtime/1228 | 46 + results/classifier/gemma3:27b/runtime/1233225 | 27 + results/classifier/gemma3:27b/runtime/1245703 | 12 + results/classifier/gemma3:27b/runtime/1246990 | 41 + results/classifier/gemma3:27b/runtime/1254672 | 44 + results/classifier/gemma3:27b/runtime/1254828 | 40 + results/classifier/gemma3:27b/runtime/1255 | 14 + results/classifier/gemma3:27b/runtime/1261743 | 8 + results/classifier/gemma3:27b/runtime/1263747 | 32 + results/classifier/gemma3:27b/runtime/1285363 | 48 + results/classifier/gemma3:27b/runtime/1287195 | 6 + results/classifier/gemma3:27b/runtime/1294898 | 81 ++ results/classifier/gemma3:27b/runtime/1311614 | 50 + results/classifier/gemma3:27b/runtime/1346784 | 70 + results/classifier/gemma3:27b/runtime/1357206 | 62 + results/classifier/gemma3:27b/runtime/1357226 | 14 + results/classifier/gemma3:27b/runtime/1361912 | 12 + results/classifier/gemma3:27b/runtime/1362635 | 45 + results/classifier/gemma3:27b/runtime/1388 | 17 + results/classifier/gemma3:27b/runtime/1429313 | 12 + results/classifier/gemma3:27b/runtime/1471 | 19 + results/classifier/gemma3:27b/runtime/1478 | 69 + results/classifier/gemma3:27b/runtime/1494 | 935 ++++++++++++++ results/classifier/gemma3:27b/runtime/1495 | 9 + results/classifier/gemma3:27b/runtime/1512 | 4 + results/classifier/gemma3:27b/runtime/1519037 | 10 + results/classifier/gemma3:27b/runtime/1527765 | 75 ++ results/classifier/gemma3:27b/runtime/1528 | 12 + results/classifier/gemma3:27b/runtime/1528239 | 48 + results/classifier/gemma3:27b/runtime/1541 | 35 + results/classifier/gemma3:27b/runtime/1547 | 15 + results/classifier/gemma3:27b/runtime/1550503 | 16 + results/classifier/gemma3:27b/runtime/1553 | 15 + results/classifier/gemma3:27b/runtime/1568107 | 12 + results/classifier/gemma3:27b/runtime/1585840 | 12 + results/classifier/gemma3:27b/runtime/1591611 | 26 + results/classifier/gemma3:27b/runtime/1593 | 10 + results/classifier/gemma3:27b/runtime/1603734 | 10 + results/classifier/gemma3:27b/runtime/1623020 | 58 + results/classifier/gemma3:27b/runtime/1641861 | 39 + results/classifier/gemma3:27b/runtime/1648 | 61 + results/classifier/gemma3:27b/runtime/1654137 | 10 + results/classifier/gemma3:27b/runtime/1659901 | 12 + results/classifier/gemma3:27b/runtime/1661815 | 29 + results/classifier/gemma3:27b/runtime/1671 | 1360 ++++++++++++++++++++ results/classifier/gemma3:27b/runtime/1696353 | 38 + results/classifier/gemma3:27b/runtime/1696773 | 10 + results/classifier/gemma3:27b/runtime/1697 | 22 + results/classifier/gemma3:27b/runtime/1704638 | 68 + results/classifier/gemma3:27b/runtime/1715162 | 75 ++ results/classifier/gemma3:27b/runtime/1716767 | 37 + results/classifier/gemma3:27b/runtime/1725267 | 34 + results/classifier/gemma3:27b/runtime/1735384 | 23 + results/classifier/gemma3:27b/runtime/1737444 | 96 ++ results/classifier/gemma3:27b/runtime/1740219 | 62 + results/classifier/gemma3:27b/runtime/1741 | 4 + results/classifier/gemma3:27b/runtime/1748612 | 18 + results/classifier/gemma3:27b/runtime/1755 | 23 + results/classifier/gemma3:27b/runtime/1756519 | 49 + results/classifier/gemma3:27b/runtime/1756807 | 70 + results/classifier/gemma3:27b/runtime/1761535 | 39 + results/classifier/gemma3:27b/runtime/1763 | 15 + results/classifier/gemma3:27b/runtime/1763536 | 86 ++ results/classifier/gemma3:27b/runtime/1765970 | 64 + results/classifier/gemma3:27b/runtime/1768 | 35 + results/classifier/gemma3:27b/runtime/1768246 | 16 + results/classifier/gemma3:27b/runtime/1773743 | 24 + results/classifier/gemma3:27b/runtime/1774149 | 79 ++ results/classifier/gemma3:27b/runtime/1776478 | 49 + results/classifier/gemma3:27b/runtime/1779634 | 38 + results/classifier/gemma3:27b/runtime/1793539 | 12 + results/classifier/gemma3:27b/runtime/1798 | 4 + results/classifier/gemma3:27b/runtime/1799200 | 43 + results/classifier/gemma3:27b/runtime/1805 | 69 + results/classifier/gemma3:27b/runtime/1807 | 27 + results/classifier/gemma3:27b/runtime/1808565 | 10 + results/classifier/gemma3:27b/runtime/1812 | 28 + results/classifier/gemma3:27b/runtime/1812451 | 17 + results/classifier/gemma3:27b/runtime/1812861 | 25 + results/classifier/gemma3:27b/runtime/1813307 | 24 + results/classifier/gemma3:27b/runtime/1813398 | 44 + results/classifier/gemma3:27b/runtime/1814128 | 158 +++ results/classifier/gemma3:27b/runtime/1818483 | 45 + results/classifier/gemma3:27b/runtime/1819 | 13 + results/classifier/gemma3:27b/runtime/1821515 | 41 + results/classifier/gemma3:27b/runtime/1830 | 29 + results/classifier/gemma3:27b/runtime/1832353 | 23 + results/classifier/gemma3:27b/runtime/1832916 | 8 + results/classifier/gemma3:27b/runtime/1833668 | 30 + results/classifier/gemma3:27b/runtime/1834496 | 30 + results/classifier/gemma3:27b/runtime/1835693 | 20 + results/classifier/gemma3:27b/runtime/1835839 | 24 + results/classifier/gemma3:27b/runtime/1836078 | 25 + results/classifier/gemma3:27b/runtime/1836192 | 24 + results/classifier/gemma3:27b/runtime/1836558 | 51 + results/classifier/gemma3:27b/runtime/1840922 | 24 + results/classifier/gemma3:27b/runtime/1854 | 21 + results/classifier/gemma3:27b/runtime/1857 | 55 + results/classifier/gemma3:27b/runtime/1858415 | 27 + results/classifier/gemma3:27b/runtime/1860056 | 23 + results/classifier/gemma3:27b/runtime/1860610 | 10 + results/classifier/gemma3:27b/runtime/1862986 | 67 + results/classifier/gemma3:27b/runtime/1863445 | 19 + results/classifier/gemma3:27b/runtime/1869073 | 10 + results/classifier/gemma3:27b/runtime/1869241 | 22 + results/classifier/gemma3:27b/runtime/1869782 | 16 + results/classifier/gemma3:27b/runtime/1870477 | 36 + results/classifier/gemma3:27b/runtime/1878501 | 34 + results/classifier/gemma3:27b/runtime/1880225 | 140 ++ results/classifier/gemma3:27b/runtime/1880332 | 10 + results/classifier/gemma3:27b/runtime/1880722 | 17 + results/classifier/gemma3:27b/runtime/1883268 | 40 + results/classifier/gemma3:27b/runtime/1883784 | 12 + results/classifier/gemma3:27b/runtime/1888303 | 23 + results/classifier/gemma3:27b/runtime/1888728 | 22 + results/classifier/gemma3:27b/runtime/1889411 | 66 + results/classifier/gemma3:27b/runtime/1890 | 28 + results/classifier/gemma3:27b/runtime/1894029 | 42 + results/classifier/gemma3:27b/runtime/1895 | 149 +++ results/classifier/gemma3:27b/runtime/1895080 | 39 + results/classifier/gemma3:27b/runtime/1895471 | 26 + results/classifier/gemma3:27b/runtime/1904259 | 32 + results/classifier/gemma3:27b/runtime/1907817 | 46 + results/classifier/gemma3:27b/runtime/1907969 | 61 + results/classifier/gemma3:27b/runtime/1908 | 52 + results/classifier/gemma3:27b/runtime/1908551 | 57 + results/classifier/gemma3:27b/runtime/1909 | 53 + results/classifier/gemma3:27b/runtime/1909921 | 25 + results/classifier/gemma3:27b/runtime/1913 | 22 + results/classifier/gemma3:27b/runtime/1915531 | 57 + results/classifier/gemma3:27b/runtime/1916344 | 27 + results/classifier/gemma3:27b/runtime/1917184 | 8 + results/classifier/gemma3:27b/runtime/1927530 | 42 + results/classifier/gemma3:27b/runtime/1930 | 49 + results/classifier/gemma3:27b/runtime/1936977 | 10 + results/classifier/gemma3:27b/runtime/1952 | 99 ++ results/classifier/gemma3:27b/runtime/1953 | 149 +++ results/classifier/gemma3:27b/runtime/2027 | 236 ++++ results/classifier/gemma3:27b/runtime/2035 | 57 + results/classifier/gemma3:27b/runtime/2072564 | 48 + results/classifier/gemma3:27b/runtime/2082 | 47 + results/classifier/gemma3:27b/runtime/2101 | 20 + results/classifier/gemma3:27b/runtime/2119 | 4 + results/classifier/gemma3:27b/runtime/2122 | 10 + results/classifier/gemma3:27b/runtime/2123 | 34 + results/classifier/gemma3:27b/runtime/2127 | 4 + results/classifier/gemma3:27b/runtime/2156 | 18 + results/classifier/gemma3:27b/runtime/2157 | 46 + results/classifier/gemma3:27b/runtime/2223 | 38 + results/classifier/gemma3:27b/runtime/2304 | 41 + results/classifier/gemma3:27b/runtime/2309 | 34 + results/classifier/gemma3:27b/runtime/2336 | 26 + results/classifier/gemma3:27b/runtime/2448 | 49 + results/classifier/gemma3:27b/runtime/2460 | 11 + results/classifier/gemma3:27b/runtime/2486 | 15 + results/classifier/gemma3:27b/runtime/2505 | 4 + results/classifier/gemma3:27b/runtime/2525 | 4 + results/classifier/gemma3:27b/runtime/2560 | 108 ++ results/classifier/gemma3:27b/runtime/2569 | 8 + results/classifier/gemma3:27b/runtime/2580 | 15 + results/classifier/gemma3:27b/runtime/2590 | 26 + results/classifier/gemma3:27b/runtime/2592 | 40 + results/classifier/gemma3:27b/runtime/2596 | 4 + results/classifier/gemma3:27b/runtime/2598 | 4 + results/classifier/gemma3:27b/runtime/261 | 4 + results/classifier/gemma3:27b/runtime/2619 | 4 + results/classifier/gemma3:27b/runtime/2628 | 23 + results/classifier/gemma3:27b/runtime/2647 | 50 + results/classifier/gemma3:27b/runtime/2655 | 42 + results/classifier/gemma3:27b/runtime/2683 | 42 + results/classifier/gemma3:27b/runtime/2738 | 13 + results/classifier/gemma3:27b/runtime/275 | 4 + results/classifier/gemma3:27b/runtime/276 | 4 + results/classifier/gemma3:27b/runtime/2761 | 11 + results/classifier/gemma3:27b/runtime/280 | 4 + results/classifier/gemma3:27b/runtime/2815 | 4 + results/classifier/gemma3:27b/runtime/2846 | 4 + results/classifier/gemma3:27b/runtime/311 | 4 + results/classifier/gemma3:27b/runtime/324 | 4 + results/classifier/gemma3:27b/runtime/326 | 4 + results/classifier/gemma3:27b/runtime/333 | 4 + results/classifier/gemma3:27b/runtime/355 | 4 + results/classifier/gemma3:27b/runtime/419 | 4 + results/classifier/gemma3:27b/runtime/442 | 4 + results/classifier/gemma3:27b/runtime/447 | 4 + results/classifier/gemma3:27b/runtime/562107 | 15 + results/classifier/gemma3:27b/runtime/625 | 26 + results/classifier/gemma3:27b/runtime/645662 | 43 + results/classifier/gemma3:27b/runtime/690 | 22 + results/classifier/gemma3:27b/runtime/693 | 13 + results/classifier/gemma3:27b/runtime/695 | 4 + results/classifier/gemma3:27b/runtime/697 | 4 + results/classifier/gemma3:27b/runtime/698 | 361 ++++++ results/classifier/gemma3:27b/runtime/704 | 4 + results/classifier/gemma3:27b/runtime/739785 | 37 + results/classifier/gemma3:27b/runtime/754635 | 58 + results/classifier/gemma3:27b/runtime/805 | 17 + results/classifier/gemma3:27b/runtime/866 | 56 + results/classifier/gemma3:27b/runtime/886621 | 295 +++++ results/classifier/gemma3:27b/runtime/909 | 14 + results/classifier/gemma3:27b/runtime/922 | 23 + results/classifier/gemma3:27b/runtime/939 | 78 ++ results/classifier/gemma3:27b/runtime/967 | 227 ++++ results/classifier/gemma3:27b/syscall/1007 | 4 + results/classifier/gemma3:27b/syscall/1010 | 81 ++ results/classifier/gemma3:27b/syscall/1012 | 44 + results/classifier/gemma3:27b/syscall/1033 | 30 + results/classifier/gemma3:27b/syscall/1054831 | 20 + results/classifier/gemma3:27b/syscall/1066909 | 10 + results/classifier/gemma3:27b/syscall/1075272 | 16 + results/classifier/gemma3:27b/syscall/1075339 | 6 + results/classifier/gemma3:27b/syscall/1076445 | 48 + results/classifier/gemma3:27b/syscall/1111 | 21 + results/classifier/gemma3:27b/syscall/1238 | 122 ++ results/classifier/gemma3:27b/syscall/1261 | 28 + results/classifier/gemma3:27b/syscall/127 | 4 + results/classifier/gemma3:27b/syscall/1319100 | 72 ++ results/classifier/gemma3:27b/syscall/1346769 | 39 + results/classifier/gemma3:27b/syscall/1356916 | 9 + results/classifier/gemma3:27b/syscall/1361 | 23 + results/classifier/gemma3:27b/syscall/1394 | 64 + results/classifier/gemma3:27b/syscall/140 | 4 + results/classifier/gemma3:27b/syscall/1416988 | 35 + results/classifier/gemma3:27b/syscall/1457275 | 108 ++ results/classifier/gemma3:27b/syscall/1462640 | 38 + results/classifier/gemma3:27b/syscall/1470170 | 43 + results/classifier/gemma3:27b/syscall/1516408 | 34 + results/classifier/gemma3:27b/syscall/1563612 | 53 + results/classifier/gemma3:27b/syscall/1594394 | 44 + results/classifier/gemma3:27b/syscall/1605443 | 14 + results/classifier/gemma3:27b/syscall/1617929 | 53 + results/classifier/gemma3:27b/syscall/1619896 | 53 + results/classifier/gemma3:27b/syscall/1643619 | 35 + results/classifier/gemma3:27b/syscall/1650 | 17 + results/classifier/gemma3:27b/syscall/1667401 | 70 + results/classifier/gemma3:27b/syscall/1673976 | 14 + results/classifier/gemma3:27b/syscall/1689367 | 29 + results/classifier/gemma3:27b/syscall/1701808 | 19 + results/classifier/gemma3:27b/syscall/1701971 | 48 + results/classifier/gemma3:27b/syscall/1701973 | 20 + results/classifier/gemma3:27b/syscall/1701974 | 20 + results/classifier/gemma3:27b/syscall/1707 | 26 + results/classifier/gemma3:27b/syscall/1716292 | 33 + results/classifier/gemma3:27b/syscall/1726394 | 8 + results/classifier/gemma3:27b/syscall/1728116 | 50 + results/classifier/gemma3:27b/syscall/1729 | 50 + results/classifier/gemma3:27b/syscall/1734 | 19 + results/classifier/gemma3:27b/syscall/1734792 | 10 + results/classifier/gemma3:27b/syscall/1738545 | 34 + results/classifier/gemma3:27b/syscall/1749393 | 29 + results/classifier/gemma3:27b/syscall/1756 | 46 + results/classifier/gemma3:27b/syscall/1760 | 56 + results/classifier/gemma3:27b/syscall/1761153 | 26 + results/classifier/gemma3:27b/syscall/1770 | 25 + results/classifier/gemma3:27b/syscall/1777226 | 18 + results/classifier/gemma3:27b/syscall/1783362 | 50 + results/classifier/gemma3:27b/syscall/1785203 | 46 + results/classifier/gemma3:27b/syscall/1791763 | 16 + results/classifier/gemma3:27b/syscall/1791796 | 126 ++ results/classifier/gemma3:27b/syscall/1805913 | 24 + results/classifier/gemma3:27b/syscall/1808563 | 20 + results/classifier/gemma3:27b/syscall/1810433 | 50 + results/classifier/gemma3:27b/syscall/1821006 | 38 + results/classifier/gemma3:27b/syscall/1829459 | 38 + results/classifier/gemma3:27b/syscall/1837 | 38 + results/classifier/gemma3:27b/syscall/1857811 | 10 + results/classifier/gemma3:27b/syscall/1858461 | 26 + results/classifier/gemma3:27b/syscall/1860053 | 23 + results/classifier/gemma3:27b/syscall/1861341 | 33 + results/classifier/gemma3:27b/syscall/1876373 | 51 + results/classifier/gemma3:27b/syscall/1884719 | 135 ++ results/classifier/gemma3:27b/syscall/1886097 | 36 + results/classifier/gemma3:27b/syscall/1887306 | 58 + results/classifier/gemma3:27b/syscall/1893010 | 8 + results/classifier/gemma3:27b/syscall/1894361 | 8 + results/classifier/gemma3:27b/syscall/1895305 | 51 + results/classifier/gemma3:27b/syscall/1906193 | 60 + results/classifier/gemma3:27b/syscall/1910605 | 19 + results/classifier/gemma3:27b/syscall/1915925 | 20 + results/classifier/gemma3:27b/syscall/1926044 | 33 + results/classifier/gemma3:27b/syscall/1926246 | 53 + results/classifier/gemma3:27b/syscall/1926521 | 65 + results/classifier/gemma3:27b/syscall/1926996 | 23 + results/classifier/gemma3:27b/syscall/2112 | 29 + results/classifier/gemma3:27b/syscall/2168 | 35 + results/classifier/gemma3:27b/syscall/2170 | 47 + results/classifier/gemma3:27b/syscall/2197 | 61 + results/classifier/gemma3:27b/syscall/2262 | 202 +++ results/classifier/gemma3:27b/syscall/2333 | 48 + results/classifier/gemma3:27b/syscall/2353 | 59 + results/classifier/gemma3:27b/syscall/2390 | 66 + results/classifier/gemma3:27b/syscall/2410 | 95 ++ results/classifier/gemma3:27b/syscall/2446 | 63 + results/classifier/gemma3:27b/syscall/2485 | 50 + results/classifier/gemma3:27b/syscall/2504 | 10 + results/classifier/gemma3:27b/syscall/2553 | 85 ++ results/classifier/gemma3:27b/syscall/2606 | 201 +++ results/classifier/gemma3:27b/syscall/263 | 4 + results/classifier/gemma3:27b/syscall/2825 | 40 + results/classifier/gemma3:27b/syscall/306 | 4 + results/classifier/gemma3:27b/syscall/356 | 4 + results/classifier/gemma3:27b/syscall/456 | 32 + results/classifier/gemma3:27b/syscall/470 | 4 + results/classifier/gemma3:27b/syscall/570 | 4 + results/classifier/gemma3:27b/syscall/577 | 28 + results/classifier/gemma3:27b/syscall/578 | 33 + results/classifier/gemma3:27b/syscall/579 | 53 + results/classifier/gemma3:27b/syscall/602 | 16 + results/classifier/gemma3:27b/syscall/633 | 35 + results/classifier/gemma3:27b/syscall/654 | 26 + results/classifier/gemma3:27b/syscall/714 | 46 + results/classifier/gemma3:27b/syscall/817 | 4 + results/classifier/gemma3:27b/syscall/829 | 17 + results/classifier/gemma3:27b/syscall/833 | 45 + results/classifier/gemma3:27b/syscall/834 | 62 + results/classifier/gemma3:27b/syscall/836 | 88 ++ results/classifier/gemma3:27b/syscall/856 | 64 + results/classifier/gemma3:27b/syscall/871 | 17 + results/classifier/gemma3:27b/syscall/885 | 4 + results/classifier/gemma3:27b/syscall/911 | 20 + results/classifier/gemma3:27b/syscall/927 | 35 + results/classifier/gemma3:27b/syscall/95 | 4 + results/classifier/gemma3:27b/syscall/957 | 74 ++ results/classifier/gemma3:27b/syscall/982 | 40 + results/classifier/semantic-bugs/1079080 | 33 +- results/classifier/semantic-bugs/1377 | 12 - results/classifier/semantic-bugs/1809546 | 47 +- results/classifier/semantic-bugs/1824778 | 23 +- results/classifier/semantic-bugs/1898954 | 47 +- results/classifier/user-mode-bugs/1079080 | 12 + results/classifier/user-mode-bugs/1156313 | 128 ++ results/classifier/user-mode-bugs/1377 | 15 + results/classifier/user-mode-bugs/1751494 | 38 + results/classifier/user-mode-bugs/1809546 | 45 + results/classifier/user-mode-bugs/1824778 | 9 + results/classifier/user-mode-bugs/1898954 | 28 + results/classifier/user-mode-bugs/1955 | 39 + 1656 files changed, 57451 insertions(+), 159 deletions(-) create mode 100644 classification/preambel-user-mode create mode 100644 results/classifier/deepseek-r1:32b/analysis.csv create mode 100644 results/classifier/deepseek-r1:32b/categories.csv create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1022 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1028 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1051 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1054812 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1086 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1092 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1095857 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1129571 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1156 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1178 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1245543 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1248168 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1251 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1254786 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1267955 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1283519 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1308381 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1328996 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1339 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1370 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1371 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1372 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1373 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1374 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1375 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1376 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1404690 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1428352 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1441 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1452 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1469342 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1471 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1512 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1536 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1547 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1553 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1574346 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1590336 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1594069 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1605123 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1606 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1611394 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1612 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1613817 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1620 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1637 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1641637 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1642 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1701821 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1713066 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1722 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1727737 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1737 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1738434 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1748296 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1751422 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1771 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1780 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1781281 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1790 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1793119 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1793608 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1806243 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1815024 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1818075 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1820686 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1821430 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1821444 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1824344 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1826568 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1828867 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1832422 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1833 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1841990 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1847467 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1854738 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1859713 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1861404 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1863247 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1873898 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1874888 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1877794 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1881450 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1889288 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1901 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1904210 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1905356 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1908626 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1909 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1912934 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1913913 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1914021 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1915327 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1916269 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1922887 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1925512 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1926759 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/1967248 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2078 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2083 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2089 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2136 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2175 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2203 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2302 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2317 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2318 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2319 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2371 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2372 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2373 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2374 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2375 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2376 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2386 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2419 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2422 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2474 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2483 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2487 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2495 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2497 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2498 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2499 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2500 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2595 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2604 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/266 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2696 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2775 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2865 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2878 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/2971 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/312 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/364 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/381 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/390 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/422 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/427 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/449 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/494 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/508 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/618 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/625 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/754 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/799 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/824 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/826 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/837 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/890 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/904308 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/952 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/979 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/984 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/993 create mode 100644 results/classifier/deepseek-r1:32b/output/instruction/998 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1033 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1054831 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1066909 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1075272 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1075339 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/122 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/127 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1394 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1416988 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1643619 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1673976 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1701973 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1729 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1734792 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1760 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1761153 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1783362 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1805913 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1810433 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1837 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1869073 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1869241 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1910605 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/1926521 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/2101 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/2122 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/2248 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/2262 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/2333 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/2410 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/2446 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/2553 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/570 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/602 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/817 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/829 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/833 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/911 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/957 create mode 100644 results/classifier/deepseek-r1:32b/output/manual-review/982 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1010 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1010484 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1027 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1031920 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1034 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1041 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1044 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1052857 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1059 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1068900 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1070 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1072 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1075 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1093 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1095531 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1098729 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1102 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1128 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1143 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1147 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1165383 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1172613 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1182490 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1187319 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1207896 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1209 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1211 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1221966 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1228 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1233225 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1245703 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1246990 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1248 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1254672 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1254828 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1255 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1261743 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1263747 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1267 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1285363 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1287195 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1294898 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1311614 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1346769 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1346784 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1357206 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1357226 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1361 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1361912 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1362635 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1368 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1388 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1397 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/140 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1412 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1429313 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1435 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1478 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1495 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1519037 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1527765 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1528 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1528239 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1531 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1533141 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1541 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1550503 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1568107 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1591611 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1593 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1603734 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1614348 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1623020 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1641861 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1648 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1650 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1654137 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1659901 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1661815 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1667401 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1671 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1696353 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1697 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1704638 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1707 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1715162 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1716767 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1724485 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1725267 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1734 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1735384 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1736 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1737444 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1738545 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1740219 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1741 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1748612 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1755 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1756 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1756519 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1756807 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1756927 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1761401 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1761535 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1763 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1765970 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1768 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1768246 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1773743 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1774149 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1777226 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1779 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1779634 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1785734 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1793539 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1796520 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1798 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1799200 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1805 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1807 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1808563 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1808565 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1812 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1812451 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1812861 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1813398 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1814128 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1818483 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1819 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1821515 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1829459 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1830 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1832353 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1832916 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1833668 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1834496 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1835693 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1835839 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1836078 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1836192 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1836558 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1840922 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1854 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1857 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1858415 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1860056 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1860610 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1861605 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1862167 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1862986 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1863445 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1869782 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1870477 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1878501 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1880225 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1880332 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1880722 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1883268 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1883784 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1885350 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1886097 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1887306 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1888303 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1888728 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1889411 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1890 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1892081 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1894029 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1895 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1895080 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1895305 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1895471 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1895703 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1904259 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1906536 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1907817 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1907969 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1908 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1908551 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1909921 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1910 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1913 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1914870 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1915531 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1915925 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1916344 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1917184 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1918026 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1926044 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1926202 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1926246 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1927530 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1930 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1936977 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1941 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1952 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/1953 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2027 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2035 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2072564 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2082 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2119 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2127 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2156 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2157 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2208 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2223 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2304 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2336 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2353 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2448 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2460 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2486 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2505 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2525 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2536 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2560 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2569 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2580 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2590 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2596 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2598 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2606 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/261 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2619 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2628 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2632 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2647 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2655 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2672 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2683 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2730 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2738 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/275 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/276 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2761 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/280 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2802 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2815 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/2846 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/311 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/333 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/355 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/361 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/385 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/419 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/442 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/447 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/514 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/562107 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/616 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/633 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/645662 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/693 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/695 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/697 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/698 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/704 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/714 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/739785 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/754635 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/796480 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/805 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/834 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/856 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/866 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/886621 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/909 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/922 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/939 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/947 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/95 create mode 100644 results/classifier/deepseek-r1:32b/output/runtime/967 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1007 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1012 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1076445 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1111 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/121 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1238 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1261 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1319100 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1356916 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1457275 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1462640 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1470170 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1494 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1516408 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1563612 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1585840 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1594394 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1605443 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1617929 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1619896 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1689367 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1696773 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1701808 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1701971 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1701974 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1716292 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1726394 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1728116 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1749393 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1763536 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1770 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1776478 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1785203 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1791763 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1791796 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1813307 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1821006 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1857811 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1858461 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1860053 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1861341 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1876373 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1884719 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1893010 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1894361 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1906193 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/1926996 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/2112 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/2123 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/2168 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/2170 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/2197 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/2309 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/2390 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/2485 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/2504 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/2592 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/263 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/2825 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/306 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/324 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/326 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/356 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/456 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/470 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/577 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/578 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/579 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/654 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/690 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/836 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/871 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/885 create mode 100644 results/classifier/deepseek-r1:32b/output/syscall/927 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1022 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1028 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1051 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1054812 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1086 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1092 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1095857 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1129571 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1156 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1178 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1245543 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1248168 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1251 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1254786 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1267955 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1283519 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1308381 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1328996 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1339 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1370 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1371 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1372 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1373 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1374 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1375 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1376 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1404690 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1428352 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1441 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1452 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1469342 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1471 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1512 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1536 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1547 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1553 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1574346 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1590336 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1594069 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1605123 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1606 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1611394 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1612 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1613817 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1620 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1637 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1641637 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1642 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1701821 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1713066 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1722 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1727737 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1737 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1738434 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1748296 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1751422 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1771 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1780 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1781281 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1790 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1793119 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1793608 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1806243 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1815024 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1818075 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1820686 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1821430 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1821444 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1824344 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1826568 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1828867 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1832422 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1833 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1841990 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1847467 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1854738 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1859713 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1861404 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1863247 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1873898 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1874888 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1877794 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1881450 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1889288 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1901 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1904210 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1905356 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1908626 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1909 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1912934 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1913913 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1914021 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1915327 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1916269 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1922887 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1925512 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1926759 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/1967248 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2078 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2083 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2089 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2136 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2175 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2203 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2302 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2317 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2318 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2319 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2371 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2372 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2373 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2374 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2375 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2376 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2386 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2419 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2422 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2474 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2483 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2487 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2495 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2497 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2498 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2499 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2500 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2595 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2604 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/266 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2696 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2775 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2865 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2878 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/2971 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/312 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/364 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/381 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/390 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/422 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/427 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/449 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/494 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/508 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/618 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/625 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/754 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/799 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/824 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/826 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/837 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/890 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/904308 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/952 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/979 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/984 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/993 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/instruction/998 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1033 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1054831 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1066909 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1075272 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1075339 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/122 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/127 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1394 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1416988 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1643619 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1673976 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1701973 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1729 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1734792 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1760 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1761153 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1783362 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1805913 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1810433 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1837 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1869073 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1869241 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1910605 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/1926521 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/2101 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/2122 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/2248 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/2262 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/2333 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/2410 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/2446 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/2553 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/570 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/602 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/817 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/829 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/833 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/911 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/957 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/manual-review/982 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1010 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1010484 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1027 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1031920 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1034 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1041 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1044 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1052857 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1059 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1068900 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1070 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1072 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1075 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1093 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1095531 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1098729 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1102 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1128 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1143 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1147 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1165383 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1172613 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1182490 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1187319 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1207896 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1209 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1211 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1221966 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1228 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1233225 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1245703 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1246990 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1248 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1254672 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1254828 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1255 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1261743 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1263747 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1267 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1285363 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1287195 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1294898 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1311614 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1346769 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1346784 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1357206 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1357226 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1361 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1361912 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1362635 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1368 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1388 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1397 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/140 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1412 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1429313 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1435 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1478 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1495 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1519037 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1527765 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1528 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1528239 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1531 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1533141 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1541 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1550503 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1568107 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1591611 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1593 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1603734 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1614348 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1623020 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1641861 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1648 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1650 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1654137 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1659901 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1661815 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1667401 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1671 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1696353 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1697 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1704638 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1707 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1715162 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1716767 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1724485 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1725267 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1734 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1735384 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1736 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1737444 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1738545 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1740219 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1741 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1748612 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1755 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1756 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1756519 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1756807 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1756927 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1761401 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1761535 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1763 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1765970 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1768 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1768246 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1773743 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1774149 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1777226 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1779 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1779634 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1785734 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1793539 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1796520 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1798 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1799200 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1805 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1807 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1808563 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1808565 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1812 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1812451 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1812861 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1813398 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1814128 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1818483 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1819 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1821515 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1829459 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1830 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1832353 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1832916 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1833668 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1834496 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1835693 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1835839 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1836078 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1836192 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1836558 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1840922 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1854 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1857 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1858415 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1860056 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1860610 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1861605 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1862167 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1862986 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1863445 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1869782 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1870477 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1878501 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1880225 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1880332 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1880722 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1883268 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1883784 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1885350 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1886097 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1887306 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1888303 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1888728 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1889411 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1890 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1892081 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1894029 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1895 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1895080 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1895305 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1895471 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1895703 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1904259 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1906536 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1907817 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1907969 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1908 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1908551 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1909921 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1910 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1913 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1914870 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1915531 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1915925 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1916344 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1917184 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1918026 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1926044 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1926202 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1926246 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1927530 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1930 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1936977 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1941 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1952 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/1953 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2027 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2035 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2072564 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2082 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2119 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2127 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2156 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2157 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2208 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2223 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2304 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2336 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2353 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2448 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2460 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2486 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2505 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2525 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2536 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2560 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2569 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2580 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2590 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2596 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2598 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2606 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/261 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2619 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2628 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2632 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2647 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2655 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2672 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2683 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2730 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2738 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/275 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/276 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2761 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/280 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2802 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2815 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/2846 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/311 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/333 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/355 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/361 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/385 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/419 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/442 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/447 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/514 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/562107 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/616 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/633 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/645662 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/693 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/695 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/697 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/698 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/704 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/714 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/739785 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/754635 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/796480 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/805 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/834 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/856 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/866 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/886621 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/909 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/922 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/939 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/947 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/95 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/runtime/967 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1007 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1012 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1076445 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1111 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/121 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1238 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1261 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1319100 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1356916 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1457275 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1462640 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1470170 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1494 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1516408 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1563612 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1585840 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1594394 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1605443 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1617929 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1619896 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1689367 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1696773 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1701808 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1701971 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1701974 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1716292 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1726394 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1728116 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1749393 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1763536 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1770 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1776478 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1785203 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1791763 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1791796 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1813307 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1821006 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1857811 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1858461 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1860053 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1861341 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1876373 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1884719 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1893010 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1894361 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1906193 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/1926996 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/2112 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/2123 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/2168 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/2170 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/2197 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/2309 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/2390 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/2485 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/2504 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/2592 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/263 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/2825 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/306 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/324 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/326 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/356 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/456 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/470 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/577 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/578 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/579 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/654 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/690 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/836 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/871 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/885 create mode 100644 results/classifier/deepseek-r1:32b/reasoning/syscall/927 create mode 100644 results/classifier/gemma3:27b/analysis.csv create mode 100644 results/classifier/gemma3:27b/categories.csv create mode 100644 results/classifier/gemma3:27b/instruction/1022 create mode 100644 results/classifier/gemma3:27b/instruction/1028 create mode 100644 results/classifier/gemma3:27b/instruction/1051 create mode 100644 results/classifier/gemma3:27b/instruction/1079080 create mode 100644 results/classifier/gemma3:27b/instruction/1092 create mode 100644 results/classifier/gemma3:27b/instruction/1095531 create mode 100644 results/classifier/gemma3:27b/instruction/1095857 create mode 100644 results/classifier/gemma3:27b/instruction/1128 create mode 100644 results/classifier/gemma3:27b/instruction/1129571 create mode 100644 results/classifier/gemma3:27b/instruction/1143 create mode 100644 results/classifier/gemma3:27b/instruction/1156 create mode 100644 results/classifier/gemma3:27b/instruction/1156313 create mode 100644 results/classifier/gemma3:27b/instruction/1178 create mode 100644 results/classifier/gemma3:27b/instruction/1245543 create mode 100644 results/classifier/gemma3:27b/instruction/1248 create mode 100644 results/classifier/gemma3:27b/instruction/1248168 create mode 100644 results/classifier/gemma3:27b/instruction/1251 create mode 100644 results/classifier/gemma3:27b/instruction/1254786 create mode 100644 results/classifier/gemma3:27b/instruction/1267 create mode 100644 results/classifier/gemma3:27b/instruction/1267955 create mode 100644 results/classifier/gemma3:27b/instruction/1283519 create mode 100644 results/classifier/gemma3:27b/instruction/1308381 create mode 100644 results/classifier/gemma3:27b/instruction/1328996 create mode 100644 results/classifier/gemma3:27b/instruction/1339 create mode 100644 results/classifier/gemma3:27b/instruction/1368 create mode 100644 results/classifier/gemma3:27b/instruction/1370 create mode 100644 results/classifier/gemma3:27b/instruction/1371 create mode 100644 results/classifier/gemma3:27b/instruction/1372 create mode 100644 results/classifier/gemma3:27b/instruction/1373 create mode 100644 results/classifier/gemma3:27b/instruction/1374 create mode 100644 results/classifier/gemma3:27b/instruction/1375 create mode 100644 results/classifier/gemma3:27b/instruction/1376 create mode 100644 results/classifier/gemma3:27b/instruction/1377 create mode 100644 results/classifier/gemma3:27b/instruction/1397 create mode 100644 results/classifier/gemma3:27b/instruction/1404690 create mode 100644 results/classifier/gemma3:27b/instruction/1412 create mode 100644 results/classifier/gemma3:27b/instruction/1428352 create mode 100644 results/classifier/gemma3:27b/instruction/1435 create mode 100644 results/classifier/gemma3:27b/instruction/1441 create mode 100644 results/classifier/gemma3:27b/instruction/1452 create mode 100644 results/classifier/gemma3:27b/instruction/1469342 create mode 100644 results/classifier/gemma3:27b/instruction/1531 create mode 100644 results/classifier/gemma3:27b/instruction/1536 create mode 100644 results/classifier/gemma3:27b/instruction/1574346 create mode 100644 results/classifier/gemma3:27b/instruction/1590336 create mode 100644 results/classifier/gemma3:27b/instruction/1594069 create mode 100644 results/classifier/gemma3:27b/instruction/1605123 create mode 100644 results/classifier/gemma3:27b/instruction/1606 create mode 100644 results/classifier/gemma3:27b/instruction/1611394 create mode 100644 results/classifier/gemma3:27b/instruction/1612 create mode 100644 results/classifier/gemma3:27b/instruction/1613817 create mode 100644 results/classifier/gemma3:27b/instruction/1614348 create mode 100644 results/classifier/gemma3:27b/instruction/1620 create mode 100644 results/classifier/gemma3:27b/instruction/1637 create mode 100644 results/classifier/gemma3:27b/instruction/1641637 create mode 100644 results/classifier/gemma3:27b/instruction/1642 create mode 100644 results/classifier/gemma3:27b/instruction/1701821 create mode 100644 results/classifier/gemma3:27b/instruction/1713066 create mode 100644 results/classifier/gemma3:27b/instruction/1722 create mode 100644 results/classifier/gemma3:27b/instruction/1724485 create mode 100644 results/classifier/gemma3:27b/instruction/1727737 create mode 100644 results/classifier/gemma3:27b/instruction/1736 create mode 100644 results/classifier/gemma3:27b/instruction/1737 create mode 100644 results/classifier/gemma3:27b/instruction/1738434 create mode 100644 results/classifier/gemma3:27b/instruction/1748296 create mode 100644 results/classifier/gemma3:27b/instruction/1751422 create mode 100644 results/classifier/gemma3:27b/instruction/1751494 create mode 100644 results/classifier/gemma3:27b/instruction/1756927 create mode 100644 results/classifier/gemma3:27b/instruction/1761401 create mode 100644 results/classifier/gemma3:27b/instruction/1771 create mode 100644 results/classifier/gemma3:27b/instruction/1779 create mode 100644 results/classifier/gemma3:27b/instruction/1780 create mode 100644 results/classifier/gemma3:27b/instruction/1781281 create mode 100644 results/classifier/gemma3:27b/instruction/1785734 create mode 100644 results/classifier/gemma3:27b/instruction/1790 create mode 100644 results/classifier/gemma3:27b/instruction/1793119 create mode 100644 results/classifier/gemma3:27b/instruction/1793608 create mode 100644 results/classifier/gemma3:27b/instruction/1796520 create mode 100644 results/classifier/gemma3:27b/instruction/1806243 create mode 100644 results/classifier/gemma3:27b/instruction/1809546 create mode 100644 results/classifier/gemma3:27b/instruction/1815024 create mode 100644 results/classifier/gemma3:27b/instruction/1818075 create mode 100644 results/classifier/gemma3:27b/instruction/1820686 create mode 100644 results/classifier/gemma3:27b/instruction/1821430 create mode 100644 results/classifier/gemma3:27b/instruction/1821444 create mode 100644 results/classifier/gemma3:27b/instruction/1824344 create mode 100644 results/classifier/gemma3:27b/instruction/1824778 create mode 100644 results/classifier/gemma3:27b/instruction/1826568 create mode 100644 results/classifier/gemma3:27b/instruction/1828867 create mode 100644 results/classifier/gemma3:27b/instruction/1832422 create mode 100644 results/classifier/gemma3:27b/instruction/1833 create mode 100644 results/classifier/gemma3:27b/instruction/1841990 create mode 100644 results/classifier/gemma3:27b/instruction/1847467 create mode 100644 results/classifier/gemma3:27b/instruction/1854738 create mode 100644 results/classifier/gemma3:27b/instruction/1859713 create mode 100644 results/classifier/gemma3:27b/instruction/1861404 create mode 100644 results/classifier/gemma3:27b/instruction/1861605 create mode 100644 results/classifier/gemma3:27b/instruction/1862167 create mode 100644 results/classifier/gemma3:27b/instruction/1863247 create mode 100644 results/classifier/gemma3:27b/instruction/1873898 create mode 100644 results/classifier/gemma3:27b/instruction/1874888 create mode 100644 results/classifier/gemma3:27b/instruction/1877794 create mode 100644 results/classifier/gemma3:27b/instruction/1881450 create mode 100644 results/classifier/gemma3:27b/instruction/1885350 create mode 100644 results/classifier/gemma3:27b/instruction/1889288 create mode 100644 results/classifier/gemma3:27b/instruction/1892081 create mode 100644 results/classifier/gemma3:27b/instruction/1898954 create mode 100644 results/classifier/gemma3:27b/instruction/1901 create mode 100644 results/classifier/gemma3:27b/instruction/1904210 create mode 100644 results/classifier/gemma3:27b/instruction/1905356 create mode 100644 results/classifier/gemma3:27b/instruction/1906536 create mode 100644 results/classifier/gemma3:27b/instruction/1908626 create mode 100644 results/classifier/gemma3:27b/instruction/1910 create mode 100644 results/classifier/gemma3:27b/instruction/1912934 create mode 100644 results/classifier/gemma3:27b/instruction/1913913 create mode 100644 results/classifier/gemma3:27b/instruction/1914021 create mode 100644 results/classifier/gemma3:27b/instruction/1914870 create mode 100644 results/classifier/gemma3:27b/instruction/1915327 create mode 100644 results/classifier/gemma3:27b/instruction/1916269 create mode 100644 results/classifier/gemma3:27b/instruction/1918026 create mode 100644 results/classifier/gemma3:27b/instruction/1922887 create mode 100644 results/classifier/gemma3:27b/instruction/1925512 create mode 100644 results/classifier/gemma3:27b/instruction/1926202 create mode 100644 results/classifier/gemma3:27b/instruction/1926759 create mode 100644 results/classifier/gemma3:27b/instruction/1941 create mode 100644 results/classifier/gemma3:27b/instruction/1955 create mode 100644 results/classifier/gemma3:27b/instruction/1967248 create mode 100644 results/classifier/gemma3:27b/instruction/2078 create mode 100644 results/classifier/gemma3:27b/instruction/2083 create mode 100644 results/classifier/gemma3:27b/instruction/2089 create mode 100644 results/classifier/gemma3:27b/instruction/2136 create mode 100644 results/classifier/gemma3:27b/instruction/2175 create mode 100644 results/classifier/gemma3:27b/instruction/2203 create mode 100644 results/classifier/gemma3:27b/instruction/2208 create mode 100644 results/classifier/gemma3:27b/instruction/2248 create mode 100644 results/classifier/gemma3:27b/instruction/2302 create mode 100644 results/classifier/gemma3:27b/instruction/2317 create mode 100644 results/classifier/gemma3:27b/instruction/2318 create mode 100644 results/classifier/gemma3:27b/instruction/2319 create mode 100644 results/classifier/gemma3:27b/instruction/2371 create mode 100644 results/classifier/gemma3:27b/instruction/2372 create mode 100644 results/classifier/gemma3:27b/instruction/2373 create mode 100644 results/classifier/gemma3:27b/instruction/2374 create mode 100644 results/classifier/gemma3:27b/instruction/2375 create mode 100644 results/classifier/gemma3:27b/instruction/2376 create mode 100644 results/classifier/gemma3:27b/instruction/2386 create mode 100644 results/classifier/gemma3:27b/instruction/2419 create mode 100644 results/classifier/gemma3:27b/instruction/2422 create mode 100644 results/classifier/gemma3:27b/instruction/2474 create mode 100644 results/classifier/gemma3:27b/instruction/2483 create mode 100644 results/classifier/gemma3:27b/instruction/2487 create mode 100644 results/classifier/gemma3:27b/instruction/2495 create mode 100644 results/classifier/gemma3:27b/instruction/2497 create mode 100644 results/classifier/gemma3:27b/instruction/2498 create mode 100644 results/classifier/gemma3:27b/instruction/2499 create mode 100644 results/classifier/gemma3:27b/instruction/2500 create mode 100644 results/classifier/gemma3:27b/instruction/2536 create mode 100644 results/classifier/gemma3:27b/instruction/2595 create mode 100644 results/classifier/gemma3:27b/instruction/2604 create mode 100644 results/classifier/gemma3:27b/instruction/2632 create mode 100644 results/classifier/gemma3:27b/instruction/266 create mode 100644 results/classifier/gemma3:27b/instruction/2672 create mode 100644 results/classifier/gemma3:27b/instruction/2696 create mode 100644 results/classifier/gemma3:27b/instruction/2730 create mode 100644 results/classifier/gemma3:27b/instruction/2775 create mode 100644 results/classifier/gemma3:27b/instruction/2802 create mode 100644 results/classifier/gemma3:27b/instruction/2865 create mode 100644 results/classifier/gemma3:27b/instruction/2878 create mode 100644 results/classifier/gemma3:27b/instruction/2971 create mode 100644 results/classifier/gemma3:27b/instruction/312 create mode 100644 results/classifier/gemma3:27b/instruction/361 create mode 100644 results/classifier/gemma3:27b/instruction/364 create mode 100644 results/classifier/gemma3:27b/instruction/381 create mode 100644 results/classifier/gemma3:27b/instruction/385 create mode 100644 results/classifier/gemma3:27b/instruction/390 create mode 100644 results/classifier/gemma3:27b/instruction/422 create mode 100644 results/classifier/gemma3:27b/instruction/427 create mode 100644 results/classifier/gemma3:27b/instruction/449 create mode 100644 results/classifier/gemma3:27b/instruction/494 create mode 100644 results/classifier/gemma3:27b/instruction/508 create mode 100644 results/classifier/gemma3:27b/instruction/514 create mode 100644 results/classifier/gemma3:27b/instruction/616 create mode 100644 results/classifier/gemma3:27b/instruction/618 create mode 100644 results/classifier/gemma3:27b/instruction/754 create mode 100644 results/classifier/gemma3:27b/instruction/796480 create mode 100644 results/classifier/gemma3:27b/instruction/799 create mode 100644 results/classifier/gemma3:27b/instruction/824 create mode 100644 results/classifier/gemma3:27b/instruction/826 create mode 100644 results/classifier/gemma3:27b/instruction/837 create mode 100644 results/classifier/gemma3:27b/instruction/890 create mode 100644 results/classifier/gemma3:27b/instruction/904308 create mode 100644 results/classifier/gemma3:27b/instruction/947 create mode 100644 results/classifier/gemma3:27b/instruction/952 create mode 100644 results/classifier/gemma3:27b/instruction/979 create mode 100644 results/classifier/gemma3:27b/instruction/984 create mode 100644 results/classifier/gemma3:27b/instruction/993 create mode 100644 results/classifier/gemma3:27b/instruction/998 create mode 100644 results/classifier/gemma3:27b/manual-review/1533141 create mode 100644 results/classifier/gemma3:27b/performance/1895703 create mode 100644 results/classifier/gemma3:27b/runtime/1010484 create mode 100644 results/classifier/gemma3:27b/runtime/1027 create mode 100644 results/classifier/gemma3:27b/runtime/1031920 create mode 100644 results/classifier/gemma3:27b/runtime/1034 create mode 100644 results/classifier/gemma3:27b/runtime/1041 create mode 100644 results/classifier/gemma3:27b/runtime/1044 create mode 100644 results/classifier/gemma3:27b/runtime/1052857 create mode 100644 results/classifier/gemma3:27b/runtime/1054812 create mode 100644 results/classifier/gemma3:27b/runtime/1059 create mode 100644 results/classifier/gemma3:27b/runtime/1068900 create mode 100644 results/classifier/gemma3:27b/runtime/1070 create mode 100644 results/classifier/gemma3:27b/runtime/1072 create mode 100644 results/classifier/gemma3:27b/runtime/1075 create mode 100644 results/classifier/gemma3:27b/runtime/1086 create mode 100644 results/classifier/gemma3:27b/runtime/1093 create mode 100644 results/classifier/gemma3:27b/runtime/1098729 create mode 100644 results/classifier/gemma3:27b/runtime/1102 create mode 100644 results/classifier/gemma3:27b/runtime/1147 create mode 100644 results/classifier/gemma3:27b/runtime/1165383 create mode 100644 results/classifier/gemma3:27b/runtime/1172613 create mode 100644 results/classifier/gemma3:27b/runtime/1182490 create mode 100644 results/classifier/gemma3:27b/runtime/1187319 create mode 100644 results/classifier/gemma3:27b/runtime/1207896 create mode 100644 results/classifier/gemma3:27b/runtime/1209 create mode 100644 results/classifier/gemma3:27b/runtime/121 create mode 100644 results/classifier/gemma3:27b/runtime/1211 create mode 100644 results/classifier/gemma3:27b/runtime/122 create mode 100644 results/classifier/gemma3:27b/runtime/1221966 create mode 100644 results/classifier/gemma3:27b/runtime/1228 create mode 100644 results/classifier/gemma3:27b/runtime/1233225 create mode 100644 results/classifier/gemma3:27b/runtime/1245703 create mode 100644 results/classifier/gemma3:27b/runtime/1246990 create mode 100644 results/classifier/gemma3:27b/runtime/1254672 create mode 100644 results/classifier/gemma3:27b/runtime/1254828 create mode 100644 results/classifier/gemma3:27b/runtime/1255 create mode 100644 results/classifier/gemma3:27b/runtime/1261743 create mode 100644 results/classifier/gemma3:27b/runtime/1263747 create mode 100644 results/classifier/gemma3:27b/runtime/1285363 create mode 100644 results/classifier/gemma3:27b/runtime/1287195 create mode 100644 results/classifier/gemma3:27b/runtime/1294898 create mode 100644 results/classifier/gemma3:27b/runtime/1311614 create mode 100644 results/classifier/gemma3:27b/runtime/1346784 create mode 100644 results/classifier/gemma3:27b/runtime/1357206 create mode 100644 results/classifier/gemma3:27b/runtime/1357226 create mode 100644 results/classifier/gemma3:27b/runtime/1361912 create mode 100644 results/classifier/gemma3:27b/runtime/1362635 create mode 100644 results/classifier/gemma3:27b/runtime/1388 create mode 100644 results/classifier/gemma3:27b/runtime/1429313 create mode 100644 results/classifier/gemma3:27b/runtime/1471 create mode 100644 results/classifier/gemma3:27b/runtime/1478 create mode 100644 results/classifier/gemma3:27b/runtime/1494 create mode 100644 results/classifier/gemma3:27b/runtime/1495 create mode 100644 results/classifier/gemma3:27b/runtime/1512 create mode 100644 results/classifier/gemma3:27b/runtime/1519037 create mode 100644 results/classifier/gemma3:27b/runtime/1527765 create mode 100644 results/classifier/gemma3:27b/runtime/1528 create mode 100644 results/classifier/gemma3:27b/runtime/1528239 create mode 100644 results/classifier/gemma3:27b/runtime/1541 create mode 100644 results/classifier/gemma3:27b/runtime/1547 create mode 100644 results/classifier/gemma3:27b/runtime/1550503 create mode 100644 results/classifier/gemma3:27b/runtime/1553 create mode 100644 results/classifier/gemma3:27b/runtime/1568107 create mode 100644 results/classifier/gemma3:27b/runtime/1585840 create mode 100644 results/classifier/gemma3:27b/runtime/1591611 create mode 100644 results/classifier/gemma3:27b/runtime/1593 create mode 100644 results/classifier/gemma3:27b/runtime/1603734 create mode 100644 results/classifier/gemma3:27b/runtime/1623020 create mode 100644 results/classifier/gemma3:27b/runtime/1641861 create mode 100644 results/classifier/gemma3:27b/runtime/1648 create mode 100644 results/classifier/gemma3:27b/runtime/1654137 create mode 100644 results/classifier/gemma3:27b/runtime/1659901 create mode 100644 results/classifier/gemma3:27b/runtime/1661815 create mode 100644 results/classifier/gemma3:27b/runtime/1671 create mode 100644 results/classifier/gemma3:27b/runtime/1696353 create mode 100644 results/classifier/gemma3:27b/runtime/1696773 create mode 100644 results/classifier/gemma3:27b/runtime/1697 create mode 100644 results/classifier/gemma3:27b/runtime/1704638 create mode 100644 results/classifier/gemma3:27b/runtime/1715162 create mode 100644 results/classifier/gemma3:27b/runtime/1716767 create mode 100644 results/classifier/gemma3:27b/runtime/1725267 create mode 100644 results/classifier/gemma3:27b/runtime/1735384 create mode 100644 results/classifier/gemma3:27b/runtime/1737444 create mode 100644 results/classifier/gemma3:27b/runtime/1740219 create mode 100644 results/classifier/gemma3:27b/runtime/1741 create mode 100644 results/classifier/gemma3:27b/runtime/1748612 create mode 100644 results/classifier/gemma3:27b/runtime/1755 create mode 100644 results/classifier/gemma3:27b/runtime/1756519 create mode 100644 results/classifier/gemma3:27b/runtime/1756807 create mode 100644 results/classifier/gemma3:27b/runtime/1761535 create mode 100644 results/classifier/gemma3:27b/runtime/1763 create mode 100644 results/classifier/gemma3:27b/runtime/1763536 create mode 100644 results/classifier/gemma3:27b/runtime/1765970 create mode 100644 results/classifier/gemma3:27b/runtime/1768 create mode 100644 results/classifier/gemma3:27b/runtime/1768246 create mode 100644 results/classifier/gemma3:27b/runtime/1773743 create mode 100644 results/classifier/gemma3:27b/runtime/1774149 create mode 100644 results/classifier/gemma3:27b/runtime/1776478 create mode 100644 results/classifier/gemma3:27b/runtime/1779634 create mode 100644 results/classifier/gemma3:27b/runtime/1793539 create mode 100644 results/classifier/gemma3:27b/runtime/1798 create mode 100644 results/classifier/gemma3:27b/runtime/1799200 create mode 100644 results/classifier/gemma3:27b/runtime/1805 create mode 100644 results/classifier/gemma3:27b/runtime/1807 create mode 100644 results/classifier/gemma3:27b/runtime/1808565 create mode 100644 results/classifier/gemma3:27b/runtime/1812 create mode 100644 results/classifier/gemma3:27b/runtime/1812451 create mode 100644 results/classifier/gemma3:27b/runtime/1812861 create mode 100644 results/classifier/gemma3:27b/runtime/1813307 create mode 100644 results/classifier/gemma3:27b/runtime/1813398 create mode 100644 results/classifier/gemma3:27b/runtime/1814128 create mode 100644 results/classifier/gemma3:27b/runtime/1818483 create mode 100644 results/classifier/gemma3:27b/runtime/1819 create mode 100644 results/classifier/gemma3:27b/runtime/1821515 create mode 100644 results/classifier/gemma3:27b/runtime/1830 create mode 100644 results/classifier/gemma3:27b/runtime/1832353 create mode 100644 results/classifier/gemma3:27b/runtime/1832916 create mode 100644 results/classifier/gemma3:27b/runtime/1833668 create mode 100644 results/classifier/gemma3:27b/runtime/1834496 create mode 100644 results/classifier/gemma3:27b/runtime/1835693 create mode 100644 results/classifier/gemma3:27b/runtime/1835839 create mode 100644 results/classifier/gemma3:27b/runtime/1836078 create mode 100644 results/classifier/gemma3:27b/runtime/1836192 create mode 100644 results/classifier/gemma3:27b/runtime/1836558 create mode 100644 results/classifier/gemma3:27b/runtime/1840922 create mode 100644 results/classifier/gemma3:27b/runtime/1854 create mode 100644 results/classifier/gemma3:27b/runtime/1857 create mode 100644 results/classifier/gemma3:27b/runtime/1858415 create mode 100644 results/classifier/gemma3:27b/runtime/1860056 create mode 100644 results/classifier/gemma3:27b/runtime/1860610 create mode 100644 results/classifier/gemma3:27b/runtime/1862986 create mode 100644 results/classifier/gemma3:27b/runtime/1863445 create mode 100644 results/classifier/gemma3:27b/runtime/1869073 create mode 100644 results/classifier/gemma3:27b/runtime/1869241 create mode 100644 results/classifier/gemma3:27b/runtime/1869782 create mode 100644 results/classifier/gemma3:27b/runtime/1870477 create mode 100644 results/classifier/gemma3:27b/runtime/1878501 create mode 100644 results/classifier/gemma3:27b/runtime/1880225 create mode 100644 results/classifier/gemma3:27b/runtime/1880332 create mode 100644 results/classifier/gemma3:27b/runtime/1880722 create mode 100644 results/classifier/gemma3:27b/runtime/1883268 create mode 100644 results/classifier/gemma3:27b/runtime/1883784 create mode 100644 results/classifier/gemma3:27b/runtime/1888303 create mode 100644 results/classifier/gemma3:27b/runtime/1888728 create mode 100644 results/classifier/gemma3:27b/runtime/1889411 create mode 100644 results/classifier/gemma3:27b/runtime/1890 create mode 100644 results/classifier/gemma3:27b/runtime/1894029 create mode 100644 results/classifier/gemma3:27b/runtime/1895 create mode 100644 results/classifier/gemma3:27b/runtime/1895080 create mode 100644 results/classifier/gemma3:27b/runtime/1895471 create mode 100644 results/classifier/gemma3:27b/runtime/1904259 create mode 100644 results/classifier/gemma3:27b/runtime/1907817 create mode 100644 results/classifier/gemma3:27b/runtime/1907969 create mode 100644 results/classifier/gemma3:27b/runtime/1908 create mode 100644 results/classifier/gemma3:27b/runtime/1908551 create mode 100644 results/classifier/gemma3:27b/runtime/1909 create mode 100644 results/classifier/gemma3:27b/runtime/1909921 create mode 100644 results/classifier/gemma3:27b/runtime/1913 create mode 100644 results/classifier/gemma3:27b/runtime/1915531 create mode 100644 results/classifier/gemma3:27b/runtime/1916344 create mode 100644 results/classifier/gemma3:27b/runtime/1917184 create mode 100644 results/classifier/gemma3:27b/runtime/1927530 create mode 100644 results/classifier/gemma3:27b/runtime/1930 create mode 100644 results/classifier/gemma3:27b/runtime/1936977 create mode 100644 results/classifier/gemma3:27b/runtime/1952 create mode 100644 results/classifier/gemma3:27b/runtime/1953 create mode 100644 results/classifier/gemma3:27b/runtime/2027 create mode 100644 results/classifier/gemma3:27b/runtime/2035 create mode 100644 results/classifier/gemma3:27b/runtime/2072564 create mode 100644 results/classifier/gemma3:27b/runtime/2082 create mode 100644 results/classifier/gemma3:27b/runtime/2101 create mode 100644 results/classifier/gemma3:27b/runtime/2119 create mode 100644 results/classifier/gemma3:27b/runtime/2122 create mode 100644 results/classifier/gemma3:27b/runtime/2123 create mode 100644 results/classifier/gemma3:27b/runtime/2127 create mode 100644 results/classifier/gemma3:27b/runtime/2156 create mode 100644 results/classifier/gemma3:27b/runtime/2157 create mode 100644 results/classifier/gemma3:27b/runtime/2223 create mode 100644 results/classifier/gemma3:27b/runtime/2304 create mode 100644 results/classifier/gemma3:27b/runtime/2309 create mode 100644 results/classifier/gemma3:27b/runtime/2336 create mode 100644 results/classifier/gemma3:27b/runtime/2448 create mode 100644 results/classifier/gemma3:27b/runtime/2460 create mode 100644 results/classifier/gemma3:27b/runtime/2486 create mode 100644 results/classifier/gemma3:27b/runtime/2505 create mode 100644 results/classifier/gemma3:27b/runtime/2525 create mode 100644 results/classifier/gemma3:27b/runtime/2560 create mode 100644 results/classifier/gemma3:27b/runtime/2569 create mode 100644 results/classifier/gemma3:27b/runtime/2580 create mode 100644 results/classifier/gemma3:27b/runtime/2590 create mode 100644 results/classifier/gemma3:27b/runtime/2592 create mode 100644 results/classifier/gemma3:27b/runtime/2596 create mode 100644 results/classifier/gemma3:27b/runtime/2598 create mode 100644 results/classifier/gemma3:27b/runtime/261 create mode 100644 results/classifier/gemma3:27b/runtime/2619 create mode 100644 results/classifier/gemma3:27b/runtime/2628 create mode 100644 results/classifier/gemma3:27b/runtime/2647 create mode 100644 results/classifier/gemma3:27b/runtime/2655 create mode 100644 results/classifier/gemma3:27b/runtime/2683 create mode 100644 results/classifier/gemma3:27b/runtime/2738 create mode 100644 results/classifier/gemma3:27b/runtime/275 create mode 100644 results/classifier/gemma3:27b/runtime/276 create mode 100644 results/classifier/gemma3:27b/runtime/2761 create mode 100644 results/classifier/gemma3:27b/runtime/280 create mode 100644 results/classifier/gemma3:27b/runtime/2815 create mode 100644 results/classifier/gemma3:27b/runtime/2846 create mode 100644 results/classifier/gemma3:27b/runtime/311 create mode 100644 results/classifier/gemma3:27b/runtime/324 create mode 100644 results/classifier/gemma3:27b/runtime/326 create mode 100644 results/classifier/gemma3:27b/runtime/333 create mode 100644 results/classifier/gemma3:27b/runtime/355 create mode 100644 results/classifier/gemma3:27b/runtime/419 create mode 100644 results/classifier/gemma3:27b/runtime/442 create mode 100644 results/classifier/gemma3:27b/runtime/447 create mode 100644 results/classifier/gemma3:27b/runtime/562107 create mode 100644 results/classifier/gemma3:27b/runtime/625 create mode 100644 results/classifier/gemma3:27b/runtime/645662 create mode 100644 results/classifier/gemma3:27b/runtime/690 create mode 100644 results/classifier/gemma3:27b/runtime/693 create mode 100644 results/classifier/gemma3:27b/runtime/695 create mode 100644 results/classifier/gemma3:27b/runtime/697 create mode 100644 results/classifier/gemma3:27b/runtime/698 create mode 100644 results/classifier/gemma3:27b/runtime/704 create mode 100644 results/classifier/gemma3:27b/runtime/739785 create mode 100644 results/classifier/gemma3:27b/runtime/754635 create mode 100644 results/classifier/gemma3:27b/runtime/805 create mode 100644 results/classifier/gemma3:27b/runtime/866 create mode 100644 results/classifier/gemma3:27b/runtime/886621 create mode 100644 results/classifier/gemma3:27b/runtime/909 create mode 100644 results/classifier/gemma3:27b/runtime/922 create mode 100644 results/classifier/gemma3:27b/runtime/939 create mode 100644 results/classifier/gemma3:27b/runtime/967 create mode 100644 results/classifier/gemma3:27b/syscall/1007 create mode 100644 results/classifier/gemma3:27b/syscall/1010 create mode 100644 results/classifier/gemma3:27b/syscall/1012 create mode 100644 results/classifier/gemma3:27b/syscall/1033 create mode 100644 results/classifier/gemma3:27b/syscall/1054831 create mode 100644 results/classifier/gemma3:27b/syscall/1066909 create mode 100644 results/classifier/gemma3:27b/syscall/1075272 create mode 100644 results/classifier/gemma3:27b/syscall/1075339 create mode 100644 results/classifier/gemma3:27b/syscall/1076445 create mode 100644 results/classifier/gemma3:27b/syscall/1111 create mode 100644 results/classifier/gemma3:27b/syscall/1238 create mode 100644 results/classifier/gemma3:27b/syscall/1261 create mode 100644 results/classifier/gemma3:27b/syscall/127 create mode 100644 results/classifier/gemma3:27b/syscall/1319100 create mode 100644 results/classifier/gemma3:27b/syscall/1346769 create mode 100644 results/classifier/gemma3:27b/syscall/1356916 create mode 100644 results/classifier/gemma3:27b/syscall/1361 create mode 100644 results/classifier/gemma3:27b/syscall/1394 create mode 100644 results/classifier/gemma3:27b/syscall/140 create mode 100644 results/classifier/gemma3:27b/syscall/1416988 create mode 100644 results/classifier/gemma3:27b/syscall/1457275 create mode 100644 results/classifier/gemma3:27b/syscall/1462640 create mode 100644 results/classifier/gemma3:27b/syscall/1470170 create mode 100644 results/classifier/gemma3:27b/syscall/1516408 create mode 100644 results/classifier/gemma3:27b/syscall/1563612 create mode 100644 results/classifier/gemma3:27b/syscall/1594394 create mode 100644 results/classifier/gemma3:27b/syscall/1605443 create mode 100644 results/classifier/gemma3:27b/syscall/1617929 create mode 100644 results/classifier/gemma3:27b/syscall/1619896 create mode 100644 results/classifier/gemma3:27b/syscall/1643619 create mode 100644 results/classifier/gemma3:27b/syscall/1650 create mode 100644 results/classifier/gemma3:27b/syscall/1667401 create mode 100644 results/classifier/gemma3:27b/syscall/1673976 create mode 100644 results/classifier/gemma3:27b/syscall/1689367 create mode 100644 results/classifier/gemma3:27b/syscall/1701808 create mode 100644 results/classifier/gemma3:27b/syscall/1701971 create mode 100644 results/classifier/gemma3:27b/syscall/1701973 create mode 100644 results/classifier/gemma3:27b/syscall/1701974 create mode 100644 results/classifier/gemma3:27b/syscall/1707 create mode 100644 results/classifier/gemma3:27b/syscall/1716292 create mode 100644 results/classifier/gemma3:27b/syscall/1726394 create mode 100644 results/classifier/gemma3:27b/syscall/1728116 create mode 100644 results/classifier/gemma3:27b/syscall/1729 create mode 100644 results/classifier/gemma3:27b/syscall/1734 create mode 100644 results/classifier/gemma3:27b/syscall/1734792 create mode 100644 results/classifier/gemma3:27b/syscall/1738545 create mode 100644 results/classifier/gemma3:27b/syscall/1749393 create mode 100644 results/classifier/gemma3:27b/syscall/1756 create mode 100644 results/classifier/gemma3:27b/syscall/1760 create mode 100644 results/classifier/gemma3:27b/syscall/1761153 create mode 100644 results/classifier/gemma3:27b/syscall/1770 create mode 100644 results/classifier/gemma3:27b/syscall/1777226 create mode 100644 results/classifier/gemma3:27b/syscall/1783362 create mode 100644 results/classifier/gemma3:27b/syscall/1785203 create mode 100644 results/classifier/gemma3:27b/syscall/1791763 create mode 100644 results/classifier/gemma3:27b/syscall/1791796 create mode 100644 results/classifier/gemma3:27b/syscall/1805913 create mode 100644 results/classifier/gemma3:27b/syscall/1808563 create mode 100644 results/classifier/gemma3:27b/syscall/1810433 create mode 100644 results/classifier/gemma3:27b/syscall/1821006 create mode 100644 results/classifier/gemma3:27b/syscall/1829459 create mode 100644 results/classifier/gemma3:27b/syscall/1837 create mode 100644 results/classifier/gemma3:27b/syscall/1857811 create mode 100644 results/classifier/gemma3:27b/syscall/1858461 create mode 100644 results/classifier/gemma3:27b/syscall/1860053 create mode 100644 results/classifier/gemma3:27b/syscall/1861341 create mode 100644 results/classifier/gemma3:27b/syscall/1876373 create mode 100644 results/classifier/gemma3:27b/syscall/1884719 create mode 100644 results/classifier/gemma3:27b/syscall/1886097 create mode 100644 results/classifier/gemma3:27b/syscall/1887306 create mode 100644 results/classifier/gemma3:27b/syscall/1893010 create mode 100644 results/classifier/gemma3:27b/syscall/1894361 create mode 100644 results/classifier/gemma3:27b/syscall/1895305 create mode 100644 results/classifier/gemma3:27b/syscall/1906193 create mode 100644 results/classifier/gemma3:27b/syscall/1910605 create mode 100644 results/classifier/gemma3:27b/syscall/1915925 create mode 100644 results/classifier/gemma3:27b/syscall/1926044 create mode 100644 results/classifier/gemma3:27b/syscall/1926246 create mode 100644 results/classifier/gemma3:27b/syscall/1926521 create mode 100644 results/classifier/gemma3:27b/syscall/1926996 create mode 100644 results/classifier/gemma3:27b/syscall/2112 create mode 100644 results/classifier/gemma3:27b/syscall/2168 create mode 100644 results/classifier/gemma3:27b/syscall/2170 create mode 100644 results/classifier/gemma3:27b/syscall/2197 create mode 100644 results/classifier/gemma3:27b/syscall/2262 create mode 100644 results/classifier/gemma3:27b/syscall/2333 create mode 100644 results/classifier/gemma3:27b/syscall/2353 create mode 100644 results/classifier/gemma3:27b/syscall/2390 create mode 100644 results/classifier/gemma3:27b/syscall/2410 create mode 100644 results/classifier/gemma3:27b/syscall/2446 create mode 100644 results/classifier/gemma3:27b/syscall/2485 create mode 100644 results/classifier/gemma3:27b/syscall/2504 create mode 100644 results/classifier/gemma3:27b/syscall/2553 create mode 100644 results/classifier/gemma3:27b/syscall/2606 create mode 100644 results/classifier/gemma3:27b/syscall/263 create mode 100644 results/classifier/gemma3:27b/syscall/2825 create mode 100644 results/classifier/gemma3:27b/syscall/306 create mode 100644 results/classifier/gemma3:27b/syscall/356 create mode 100644 results/classifier/gemma3:27b/syscall/456 create mode 100644 results/classifier/gemma3:27b/syscall/470 create mode 100644 results/classifier/gemma3:27b/syscall/570 create mode 100644 results/classifier/gemma3:27b/syscall/577 create mode 100644 results/classifier/gemma3:27b/syscall/578 create mode 100644 results/classifier/gemma3:27b/syscall/579 create mode 100644 results/classifier/gemma3:27b/syscall/602 create mode 100644 results/classifier/gemma3:27b/syscall/633 create mode 100644 results/classifier/gemma3:27b/syscall/654 create mode 100644 results/classifier/gemma3:27b/syscall/714 create mode 100644 results/classifier/gemma3:27b/syscall/817 create mode 100644 results/classifier/gemma3:27b/syscall/829 create mode 100644 results/classifier/gemma3:27b/syscall/833 create mode 100644 results/classifier/gemma3:27b/syscall/834 create mode 100644 results/classifier/gemma3:27b/syscall/836 create mode 100644 results/classifier/gemma3:27b/syscall/856 create mode 100644 results/classifier/gemma3:27b/syscall/871 create mode 100644 results/classifier/gemma3:27b/syscall/885 create mode 100644 results/classifier/gemma3:27b/syscall/911 create mode 100644 results/classifier/gemma3:27b/syscall/927 create mode 100644 results/classifier/gemma3:27b/syscall/95 create mode 100644 results/classifier/gemma3:27b/syscall/957 create mode 100644 results/classifier/gemma3:27b/syscall/982 create mode 100644 results/classifier/user-mode-bugs/1079080 create mode 100644 results/classifier/user-mode-bugs/1156313 create mode 100644 results/classifier/user-mode-bugs/1377 create mode 100644 results/classifier/user-mode-bugs/1751494 create mode 100644 results/classifier/user-mode-bugs/1809546 create mode 100644 results/classifier/user-mode-bugs/1824778 create mode 100644 results/classifier/user-mode-bugs/1898954 create mode 100644 results/classifier/user-mode-bugs/1955 diff --git a/classification/preambel-user-mode b/classification/preambel-user-mode new file mode 100644 index 000000000..1f9de3061 --- /dev/null +++ b/classification/preambel-user-mode @@ -0,0 +1,10 @@ + +Classify the given bug report. It is part of qemu. + +These are the possible categories: + +instruction: A faulty instruction is described in the bug report +syscall: An error with a syscall +runtime: Other errors which happen in runtime. + +Respond only with a single word, the name of the category. diff --git a/results/classifier/accel-gemma3:12b/analysis.csv b/results/classifier/accel-gemma3:12b/analysis.csv index 873129d78..46cdfafd5 100644 --- a/results/classifier/accel-gemma3:12b/analysis.csv +++ b/results/classifier/accel-gemma3:12b/analysis.csv @@ -1,3 +1,3 @@ category, count -tcg, 71 +tcg, 114 kvm, 5 diff --git a/results/classifier/deepseek-r1:32b/analysis.csv b/results/classifier/deepseek-r1:32b/analysis.csv new file mode 100644 index 000000000..25b59b6d7 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/analysis.csv @@ -0,0 +1,4 @@ +category, count +instruction, 98 +runtime, 12 +manual-review, 1 diff --git a/results/classifier/deepseek-r1:32b/categories.csv b/results/classifier/deepseek-r1:32b/categories.csv new file mode 100644 index 000000000..a66597726 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/categories.csv @@ -0,0 +1,5 @@ +category, count +instruction, 156 +manual-review, 40 +runtime, 273 +syscall, 74 diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1022 b/results/classifier/deepseek-r1:32b/output/instruction/1022 new file mode 100644 index 000000000..d81aef62e --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1022 @@ -0,0 +1,36 @@ + + + +RISC-V: Simulation terminated with seg fault when encountering `vsra.vx` +Description of problem: +QEMU simulation terminated with segmentation fault. Here is the backtrace of the simulation + +``` +(gdb) r +Starting program: qemu/build/qemu-riscv64 -cpu rv64,vext_spec=v1.0,v=true,Zfh=true,Zve32f=true,Zve64f=true,vlen=128 -B 0x100000 a.out +Missing separate debuginfos, use: yum debuginfo-install glibc-2.28-164.el8_5.3.x86_64 +[Thread debugging using libthread_db enabled] +Using host libthread_db library "/lib64/libthread_db.so.1". +[New Thread 0x7ffff4edd700 (LWP 3239772)] + +Thread 1 "qemu-riscv64" received signal SIGSEGV, Segmentation fault. +0x00007fffe8004fad in code_gen_buffer () +Missing separate debuginfos, use: yum debuginfo-install glib2-2.56.4-156.el8.x86_64 gmp-6.1.2-10.el8.x86_64 gnutls-3.6.16-4.el8.x86_64 libffi-3.1-22.el8.x86_64 libidn2-2.2.0-1.el8.x86_64 libtasn1-4.13-3.el8.x86_64 libunistring-0.9.9-3.el8.x86_64 p11-kit-0.23.22-1.el8.x86_64 pcre-8.42-6.el8.x86_64 +(gdb) bt +#0 0x00007fffe8004fad in code_gen_buffer () +#1 0x00005555556a0b9b in cpu_tb_exec (tb_exit=, itb=, cpu=0x7fffe8005000 ) at ../accel/tcg/cpu-exec.c:358 +#2 cpu_loop_exec_tb (tb_exit=, last_tb=, tb=, cpu=0x7fffe8005000 ) at ../accel/tcg/cpu-exec.c:848 +#3 cpu_exec (cpu=cpu@entry=0x555555eed3d0) at ../accel/tcg/cpu-exec.c:1007 +#4 0x00005555555e6d30 in cpu_loop (env=0x555555ef56f0) at ../linux-user/riscv/cpu_loop.c:37 +#5 0x00005555555df9f7 in main (argc=, argv=, envp=) at ../linux-user/main.c:909 +``` +Steps to reproduce: +1. Checkout to QEMU's latest master (`ec11dc41eec5142b4776db1296972c6323ba5847`) +2. `mkdir build ; cd build ; ../configure ; make -j24` +3. `qemu-riscv64 -cpu rv64,vext_spec=v1.0,v=true,Zfh=true,Zve32f=true,Zve64f=true,vlen=128 -B 0x100000 ./a.out` +Additional information: +Attaching code (output.c) and binary (a.out) + +[a.out](/uploads/0ecfb436a439619527ef645bdc781a48/a.out) + +[output.c](/uploads/cd492b4c9468f0b48412e76e7f6fcf91/output.c) diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1028 b/results/classifier/deepseek-r1:32b/output/instruction/1028 new file mode 100644 index 000000000..cda1046c4 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1028 @@ -0,0 +1,37 @@ + + + +Assert fail for RISC-V RVV vmv.v.x for e64, vl == vl_max on RV32 guest +Description of problem: +assert message: +qemu/tcg/tcg-op-gvec.c:1714: tcg_gen_gvec_dup_i32: Assertion `vece <= MO_32' failed. + +For a e64 vmv.v.x, in the file trans_rvv.c.inc, function "trans_vmv_v_x", when s->vl_eq_vlmax is true, then "tcg_gen_gvec_dup_tl" (it's defined to tcg_gen_gvec_dup_i32 for RV32) is called. In "tcg_gen_gvec_dup_i32" the assert "tcg_debug_assert(vece <= MO_32) will be triggered, since vece == MO_64 for e64. +Steps to reproduce: +1.enable cfg.Zve64f + +2.Prepare a problem as set e64, vl == vl_max and use vmv.v.x, maybe as below +``` + li t0, -1, + vsetvli x0, t0, e64,m1,tu,mu + li t1, -1 + vmv.v.x v0, t1 +``` +Additional information: +Below is a possible solution if it's appropriate. +``` +#if TARGET_LONG_BITS == 32 + if (s->sew == 3) { + TCGv_i64 s1_i64 = tcg_temp_new_i64(); + tcg_gen_ext_tl_i64(s1_i64, s1); + tcg_gen_gvec_dup_i64(s->sew, vreg_ofs(s, a->rd), + MAXSZ(s), MAXSZ(s), s1_i64); + tcg_temp_free_i64(s1_i64); + } else { +#endif + tcg_gen_gvec_dup_tl(s->sew, vreg_ofs(s, a->rd), + MAXSZ(s), MAXSZ(s), s1); +#if TARGET_LONG_BITS == 32 + } +#endif +``` diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1051 b/results/classifier/deepseek-r1:32b/output/instruction/1051 new file mode 100644 index 000000000..fcbf675c2 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1051 @@ -0,0 +1,4 @@ + + + +or1k tcg SIGILL diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1054812 b/results/classifier/deepseek-r1:32b/output/instruction/1054812 new file mode 100644 index 000000000..8b1d033c4 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1054812 @@ -0,0 +1,8 @@ + + + +Configure uses wrong libtool on Darwin + +On Darwin/OS X, there are two versions of libtool: the GNU libtool, and Apple's libtool. Both are installed, but Apple's libtool (libtool) won't build libcacard that Qemu uses, but Gnu's libtool (glibtool) does. I get around using Apple's libtool by passing LIBTOOL=glibtool when configuring; unfortunately this variable isn't preserved so when Qemu's configure changes it's not passed. A simple switch in the configure script could check for Darwin, then if present, use glibtool. Or configure could check the features of libtool, see if they can build libcacard, then look for alternatives like glibtool. + +This bug was probably introduced when libcacard was added to Qemu, and is present in commit 93b6599734f81328ee3d608f57667742cafeea72. \ No newline at end of file diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1086 b/results/classifier/deepseek-r1:32b/output/instruction/1086 new file mode 100644 index 000000000..56d70a3b2 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1086 @@ -0,0 +1,72 @@ + + + +Numpy/scipy test suites fails in QEMU on ppc64le (but not on aarch64) +Description of problem: +I'm not really qualified to report this problem, but after being affected by it for ~2 years (and QEMU 7 not fixing things), I decided to give it a shot. Please excuse reporting deficiencies, I'll endeavour to fix them as best I can once pointed out. + +In my spare time, I help out for the packaging effort in the [conda-forge](https://conda-forge.org/) ecosystem, which is mostly associated/attached to the python world, but - in contrast to the vanilla python tools - also deals with non-python dependencies, and in particular has strong enough abstractions to deal with ABI-issues and generally provides much better integration than the packages on PyPI. + +This strength of abstraction has also allowed conda-forge to publish artefacts for many more architectures than most projects are commonly able to provide precompiled binaries for. Due to the lack of (reliable) public CI for aarch64 & ppc64le, these packages are mostly cross-compiled from linux-x86. Where cross compilation is not possible, the packages are compiled in emulation through QEMU, coming through https://github.com/multiarch/qemu-user-static (this is the part of the infrastructure I don't fully understand myself...). The full infrastructure is somewhat involved, but should not be relevant (hopefully) to the issue at hand (see instructions below) - and even if that turns out to be the case, that would be a great information gain as well. + +In either case, the tests for the package (ideally comprising the entire upstream test suite) are then run in emulation. + +Two of the so-called "feedstocks" I co-maintain are for [numpy](https://github.com/conda-forge/numpy-feedstock) and [scipy](https://github.com/conda-forge/scipy-feedstock), and there have been persistent issues with running the test suite in emulation on PPC (interestingly, the same setup on a different architecture - aarch64 - has no problems). However, the compiled artefacts on PPC run fine on native hardware. + +Said otherwise, it appears numpy/scipy are exercising QEMU enough to uncover some bugs. I've seen similar problems also in other packages (e.g. the cvxpy-stack), reinforcing the impression that this is a QEMU issue, and not one on the level of the individual packages. + +Depending on the exact combination of python version, the result of the numpy test suite might be as follows: +``` +320 failed, 18900 passed, 361 skipped, 36 xfailed, 9 xpassed, 144 warnings in 2516.49s (0:41:56) +``` + +Looking at the test failures, sometimes the results are garbage +``` +> assert_array_max_ulp(x, x+eps, maxulp=20) +E AssertionError: Arrays are not almost equal up to 20 ULP (max difference is 8.55554e+08 ULP) + +eps = 1.1920929e-07 +self = +x = array([ 2.3744986e-38, nan, 2.2482052e-15, 7.5780330e+28, + nan, nan, 5.8310814e+29, -5.6511531e+24, + 1.0010809e+00, 1.0101526e+00], dtype=float32) +``` +sometimes the values are permuted +``` +> assert_array_equal(actual, desired) +E AssertionError: +E Arrays are not equal +E +E x and y nan location mismatch: +E x: array([0.000000e+00, 6.704092e-39, 9.000000e+00, 2.350989e-38, +E 0.000000e+00, 0.000000e+00, 0.000000e+00, 0.000000e+00, +E 6.772341e-39, nan], dtype=float32) +E y: array([6.704092e-39, 6.772341e-39, 0.000000e+00, 0.000000e+00, +E 0.000000e+00, 0.000000e+00, nan, 2.350989e-38, +E 2.000000e+00, 7.000000e+00], dtype=float32) +``` +sometimes the results are fundamentally different (zero vs. non-zero) +``` +> raise AssertionError(msg) +E AssertionError: +E Arrays are not almost equal to 6 decimals +E +E Mismatched elements: 72 / 216 (33.3%) +E Max absolute difference: 1. +E Max relative difference: 1. +E x: array([[[[[0., 0., 0.], +E [0., 0., 0.], +E [0., 0., 0.]],... +E y: array([[[[[1., 0., 0.], +E [0., 1., 0.], +E [0., 0., 1.]],... +``` + +I don't know where it goes wrong, but it's not just a little tolerance violation. One PR that illustrates this is [here](https://github.com/conda-forge/numpy-feedstock/pull/274) and the respective CI run is [here](https://dev.azure.com/conda-forge/feedstock-builds/_build/results?buildId=526218&view=results) (ignore the errors for osx-arm64, those are unrelated). +Steps to reproduce: +1. In an emulated ppc64 machine, install miniforge from [here](https://github.com/conda-forge/miniforge/releases/latest/download/Miniforge3-Linux-ppc64le.sh) +2. Run `conda create -n test_env numpy pytest cython hypothesis typing_extensions` and then `conda activate test_env` +3. Run `python -c "import numpy; numpy.test()"` +4. Pick any test that fails and run it as `python -c "import numpy; numpy.test(tests='x.y.z')"` +Additional information: + diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1092 b/results/classifier/deepseek-r1:32b/output/instruction/1092 new file mode 100644 index 000000000..60c305b43 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1092 @@ -0,0 +1,17 @@ + + + +PPC: `sraw` instructions does not set `ca` and `ca32` flags. +Description of problem: +The translation of Power PC instruction `sraw` and `sraw.` don't set the `ca` or `ca32` flags although, according to +[PowerISA 3.1b](https://files.openpower.foundation/s/dAYSdGzTfW4j2r2) (page 140), they should. +Additional information: +This gets particular apparent if compared to `srawi` (which does set `ca`, `ca32`). + +**sraw** + +https://gitlab.com/qemu-project/qemu/-/blob/master/target/ppc/translate.c#L2914 + +**srawi** + +https://gitlab.com/qemu-project/qemu/-/blob/master/target/ppc/translate.c#L2924 diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1095857 b/results/classifier/deepseek-r1:32b/output/instruction/1095857 new file mode 100644 index 000000000..c82825908 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1095857 @@ -0,0 +1,14 @@ + + + +incorrect handling of [r32] address (long mode) + +while executing in Long Mode (x86-64) instructions such as + +mov eax,[r15d] + +end up executing as + +mov eax,[r15] + +according to x86 programmer manuals the behavior of using the Address-Size override (in long mode) is supposed to ignore the high 32bits of the register. I use this fact in my operating system to reduce register usage (the high 32 bits of r15 holds other data). consequently a general protection exception occurs since the memory address isn't "canonical". this error doesn't always appear since the high 32 bits might not be zero in those conditions. \ No newline at end of file diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1129571 b/results/classifier/deepseek-r1:32b/output/instruction/1129571 new file mode 100644 index 000000000..dc66da3bb --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1129571 @@ -0,0 +1,17 @@ + + + +libreoffice armhf FTBFS + +We have been experiencing FTBFS of LibreOffice 3.5.7, 12.04, armhf in the launchpad buildds. We believe this is likely due to an error in qemu. + +While we do not have a small test case yet, we do have a build log (attaching here). + +The relevant snippet from the build log is: + +3.5.7/solver/unxlngr.pro/bin/jaxp.jar:/build/buildd/libreoffice-3.5.7/solver/unxlngr.pro/bin/juh.jar:/build/buildd/libreoffice-3.5.7/solver/unxlngr.pro/bin/parser.jar:/build/buildd/libreoffice-3.5.7/solver/unxlngr.pro/bin/xt.jar:/build/buildd/libreoffice-3.5.7/solver/unxlngr.pro/bin/unoil.jar:/build/buildd/libreoffice-3.5.7/solver/unxlngr.pro/bin/ridl.jar:/build/buildd/libreoffice-3.5.7/solver/unxlngr.pro/bin/jurt.jar:/build/buildd/libreoffice-3.5.7/solver/unxlngr.pro/bin/xmlsearch.jar:/build/buildd/libreoffice-3.5.7/solver/unxlngr.pro/bin/LuceneHelpWrapper.jar:/build/buildd/libreoffice-3.5.7/solver/unxlngr.pro/bin/HelpIndexerTool.jar:/build/buildd/libreoffice-3.5.7/solver/unxlngr.pro/bin/lucene-core-2.3.jar:/build/buildd/libreoffice-3.5.7/solver/unxlngr.pro/bin/lucene-analyzers-2.3.jar" com.sun.star.help.HelpIndexerTool -lang cs -mod swriter -zipdir ../../unxlngr.pro/misc/ziptmpswriter_cs -o ../../unxlngr.pro/bin/swriter_cs.zip.unxlngr.pro +dmake: Error code 132, while making '../../unxlngr.pro/bin/swriter_cs.zip' + +We believe this is from bash error code 128 + 4, where 4 is illegal instruction, thus leading us to suspect qemu. + +Any help in tracking this down would be appreciated. \ No newline at end of file diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1156 b/results/classifier/deepseek-r1:32b/output/instruction/1156 new file mode 100644 index 000000000..76296a60b --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1156 @@ -0,0 +1,4 @@ + + + +Incorrect implementation of vmsumudm instruction diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1178 b/results/classifier/deepseek-r1:32b/output/instruction/1178 new file mode 100644 index 000000000..ffe5b226f --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1178 @@ -0,0 +1,4 @@ + + + +is that riscv64 `feq.s` only should consider the lowest 32-bits. diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1245543 b/results/classifier/deepseek-r1:32b/output/instruction/1245543 new file mode 100644 index 000000000..6ca865d95 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1245543 @@ -0,0 +1,26 @@ + + + +Wrong implementation of SSE4.1 pmovzxbw and similar instructions + +QEMU 1.5.0 (and git version, as far as I can tell from the source code) has incorrect implementation of pmovzxbw and similar SSE4.1 instructions. The instruction zero-extends the first 8 8-bit elements of a vector to 16bit vector and puts them to another vector. The current implementation applies this operation only to the first element and zeros out the rest. + +To verify, compile the attached program for SSE4.1 (g++ -msse4.1 cvtint.cc). On real hardware, it produces the following output: + +$ ./a.out +1 0 2 0 3 0 4 0 5 0 6 0 7 0 8 0 + +On QEMU, the output is as follows: + +$ ./a.out +1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 + +QEMU is invoked as: + +qemu-system-x86_64 \ + -M pc -cpu Haswell,+sse4.1,+avx,+avx2,+fma,enforce -m 512 \ + -serial stdio -no-reboot \ + -kernel vmlinuz -initrd initrd.img \ + -netdev user,id=user.0 -device rtl8139,netdev=user.0 -redir tcp:2222::22 \ + -hda ubuntu-amd64.ext3 \ + --append "rw console=tty root=/dev/sda" \ No newline at end of file diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1248168 b/results/classifier/deepseek-r1:32b/output/instruction/1248168 new file mode 100644 index 000000000..06b56a323 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1248168 @@ -0,0 +1,27 @@ + + + +MIPS, self-modifying code and uncached memory + +Self-modifying code does not work properly in MIPS in uncached and unmapped kseg1 memory region. + +For example, when running this code I get unexpected behavior: + + 0: e3000010 b 0x390 + 4: 00000000 nop + ... + 380: 00701f40 mfc0 ra,c0_epc + 384: 0400e0bb swr zero,4(ra) + 388: 18000042 eret + 38c: 00000000 nop + 390: 25500000 move t2,zero + 394: 02000b34 li t3,0x2 + 398: 23504b01 subu t2,t2,t3 + 39c: e9003c0b j 0xcf003a4 + 3a0: 0a004a21 addi t2,t2,10 + 3a4: ffff0010 b 0x3a4 + 3a8: 00000000 nop + 3ac: 00000000 nop + + I expect that swr instruction in line 384 would change `addi t2,t2,1`0 to `nop` +This should work because no cache is used for this memory region. \ No newline at end of file diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1251 b/results/classifier/deepseek-r1:32b/output/instruction/1251 new file mode 100644 index 000000000..a9c329191 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1251 @@ -0,0 +1,18 @@ + + + +Octeon Instruction BBIT Bug +Steps to reproduce: +1. Compile 64bit binary for Octeon with Octeon instructions +`mips64-octeon-linux-gnu-gcc -o hello hello.c` +2. Run with `qemu-mips64` +`qemu-mips64 -cpu Octeon68XX hello` +3. Get the output below: +``` +qemu: uncaught target signal 4 (Illegal instruction) - core dumped +Illegal instruction +``` +Additional information: +I have a patch for this that I will be submitting to trivial-patches. This is not enough to emulate Octeon specific binaries alone. For small binaries mapping the `CVMSEG_LM = 0xFFFFFFFFFFFF8000 - 0xFFFFFFFFFFFF9FFF` to empty RAM and using this patch is enough. There are additional support issues for `N32` binaries that will require a separate issue. + +[hello](/uploads/d8b5e631508fd232b4a7b3a40f7e08f6/hello) diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1254786 b/results/classifier/deepseek-r1:32b/output/instruction/1254786 new file mode 100644 index 000000000..315e2004c --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1254786 @@ -0,0 +1,45 @@ + + + +qemu-m68k-static: illegal instruction ebc0 during debootstrap second stage + +Host: Ubuntu Precise amd64 +Guest: Debian (ports) sid m68k + +$ sudo qemu-debootstrap --no-check-gpg --arch=m68k sid m68k http://ftp.debian-ports.org/debian +I: Running command: debootstrap --arch m68k --foreign --no-check-gpg sid m68k http://ftp.debian-ports.org/debian +[...] +I: Running command: chroot m68k /debootstrap/debootstrap --second-stage +qemu: fatal: Illegal instruction: ebc0 @ f67e5662 +D0 = 6ffffef5 A0 = f67fbf58 F0 = 0000000000000000 ( 0) +D1 = 0000010a A1 = 00000000 F1 = 0000000000000000 ( 0) +D2 = 0000000f A2 = 00000000 F2 = 0000000000000000 ( 0) +D3 = 00000000 A3 = f67e0000 F3 = 0000000000000000 ( 0) +D4 = 00000000 A4 = 00000000 F4 = 0000000000000000 ( 0) +D5 = 00000000 A5 = f67fc000 F5 = 0000000000000000 ( 0) +D6 = 00000000 A6 = f6fff7cc F6 = 0000000000000000 ( 0) +D7 = 00000000 A7 = f6fff580 F7 = 0000000000000000 ( 0) +PC = f67e5662 SR = 0000 ----- FPRESULT = 0 +Aborted (core dumped) + +ProblemType: Bug +DistroRelease: Ubuntu 12.04 +Package: qemu-user-static 1.0.50-2012.03-0ubuntu2.1 +ProcVersionSignature: Ubuntu 3.8.0-33.48~precise1-generic 3.8.13.11 +Uname: Linux 3.8.0-33-generic x86_64 +NonfreeKernelModules: wl +ApportVersion: 2.0.1-0ubuntu17.6 +Architecture: amd64 +Date: Mon Nov 25 16:08:26 2013 +Dependencies: + +InstallationMedia: Ubuntu 12.04.3 LTS "Precise Pangolin" - Release amd64 (20130820.1) +MarkForUpload: True +ProcEnviron: + LANGUAGE=en_GB:en + TERM=xterm + PATH=(custom, no user) + LANG=en_GB.UTF-8 + SHELL=/bin/bash +SourcePackage: qemu-linaro +UpgradeStatus: No upgrade log present (probably fresh install) \ No newline at end of file diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1267955 b/results/classifier/deepseek-r1:32b/output/instruction/1267955 new file mode 100644 index 000000000..f18f0c665 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1267955 @@ -0,0 +1,45 @@ + + + +[i386] Parity Flag Not Set On xor %eax,%eax + +Tested against qemu-1.7.0 as well as qemu-1.7.50 on Debian Sid + +Steps To Reproduce + +$ cat > prog.hex << EOF + +7f 45 4c 46 01 01 01 00 00 00 00 00 00 00 00 00 +02 00 03 00 01 00 00 00 54 80 04 08 34 00 00 00 +00 00 00 00 00 00 00 00 34 00 20 00 01 00 28 00 +00 00 00 00 01 00 00 00 00 00 00 00 00 80 04 08 +00 80 04 08 76 00 00 00 76 00 00 00 05 00 00 00 +00 10 00 00 + +31 c0 +9c + +b8 04 00 00 00 +bb 01 00 00 00 +89 e1 +ba 04 00 00 00 +cd 80 + +b8 01 00 00 00 +bb 00 00 00 00 +cd 80 + +EOF + +$ xxd -p -r prog.hex > prog +$ chmod 700 prog + +$ ./prog | hexdump -vC +00000000 46 02 00 00 |F...| +00000004 + +$ qemu-i386 ./prog | hexdump -vC +00000000 42 02 00 00 |B...| +00000004 + +On the other hand if [xor %eax, %eax] (31 c0) is replaced with sub %eax,%eax (29 c0), then the parity flag is set correctly. \ No newline at end of file diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1283519 b/results/classifier/deepseek-r1:32b/output/instruction/1283519 new file mode 100644 index 000000000..c65570d39 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1283519 @@ -0,0 +1,13 @@ + + + +PowerPC altivec rounding instructions vrfi(m|n|z)not correctly mapped + +When using ppc-linux-user/qemu-ppc on a ppc ELF executable, I see that QEMU wrongly recognizes the vrfim, vrfin and vrfiz instructions: + +If the binary contains vrfim QEMU sees vrfiz +If the binary contains vrfin QEMU sees vrfim +If the binary contains vrfiz QEMU sees vrfin +The vrfip instruction is correctly recognized. + +Those instructions normally round a floating-point altivec vector to zero (z), infinity (p), minus infinity (m) or nearest (n). \ No newline at end of file diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1308381 b/results/classifier/deepseek-r1:32b/output/instruction/1308381 new file mode 100644 index 000000000..0560b9306 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1308381 @@ -0,0 +1,17 @@ + + + +illegal instructions for AArch64 ARMv8 + +The test case is in the attachment. To reproduce as following (I tried both GCC and Clang): +$aarch64-linux-gnu-gcc qemu.c -o test +$./test +qemu: uncaught target signal 4 (Illegal instruction) - core dumped +Illegal instruction (core dumped) + +There are 3 intrinsics are tested in the test case: vqmovunh_s16, vqmovuns_s32, vqmovund_s64. They will be compiled into instructions: +SQXTUN Bd, Hn +SQXTUN Hd, Sn +SQXTUN Sd, Dn. + +It seems that these instructions are not supported in QEMU. Is this a bug? \ No newline at end of file diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1328996 b/results/classifier/deepseek-r1:32b/output/instruction/1328996 new file mode 100644 index 000000000..418a4353a --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1328996 @@ -0,0 +1,6 @@ + + + +[AArch64] - blr x30 is handled incorrectly + +Whenever x30 is used as the operand for blr, the result will be incorrect. There is no restriction on using x30 (LR) with the blr instruction in the ARMv8 manual. There are two statically linked 64-bit executables in files.tar.gz: good and bad. The executable "good" uses "blr x9", and the output is what is expected: "func". The executable "bad" uses "blr x30" and nothing is printed out. It prints "func" on the actual device. \ No newline at end of file diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1339 b/results/classifier/deepseek-r1:32b/output/instruction/1339 new file mode 100644 index 000000000..dd0af5e85 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1339 @@ -0,0 +1,19 @@ + + + +RVV vfncvt.rtz.x.f.w Assertion failed +Description of problem: +when execute +``` +vsetvli t0, x0, e16, m1 +vfncvt.rtz.x.f.w v0, v4 +``` +report error: + +qemu-riscv64: ../target/riscv/translate.c:212: decode_save_opc: Assertion \`ctx->insn_start != NULL' failed. Segmentation fault (core dumped) +Steps to reproduce: +1. write the code +2. compile +3. excute +Additional information: + diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1370 b/results/classifier/deepseek-r1:32b/output/instruction/1370 new file mode 100644 index 000000000..da284a401 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1370 @@ -0,0 +1,16 @@ + + + +x86 BLSI and BLSR semantic bug +Description of problem: +The result of instruction BLSI and BLSR is different from the CPU. The value of CF is different. +Steps to reproduce: +1. Compile this code +``` +void main() { + asm("blsi rax, rbx"); +} +``` +2. Execute and compare the result with the CPU. The value of `CF` is exactly the opposite. This problem happens with BLSR, too. +Additional information: +This bug is discovered by research conducted by KAIST SoftSec. diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1371 b/results/classifier/deepseek-r1:32b/output/instruction/1371 new file mode 100644 index 000000000..a77c0b1fe --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1371 @@ -0,0 +1,22 @@ + + + +x86 BLSMSK semantic bug +Description of problem: +The result of instruction BLSMSK is different with from the CPU. The value of CF is different. +Steps to reproduce: +1. Compile this code +``` +void main() { + asm("mov rax, 0x65b2e276ad27c67"); + asm("mov rbx, 0x62f34955226b2b5d"); + asm("blsmsk eax, ebx"); +} +``` +2. Execute and compare the result with the CPU. + - CPU + - CF = 0 + - QEMU + - CF = 1 +Additional information: +This bug is discovered by research conducted by KAIST SoftSec. diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1372 b/results/classifier/deepseek-r1:32b/output/instruction/1372 new file mode 100644 index 000000000..ded24a388 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1372 @@ -0,0 +1,23 @@ + + + +x86 BEXTR semantic bug +Description of problem: +The result of instruction BEXTR is different with from the CPU. The value of destination register is different. I think QEMU does not consider the operand size limit. +Steps to reproduce: +1. Compile this code +``` +void main() { + asm("mov rax, 0x17b3693f77fb6e9"); + asm("mov rbx, 0x8f635a775ad3b9b4"); + asm("mov rcx, 0xb717b75da9983018"); + asm("bextr eax, ebx, ecx"); +} +``` +2. Execute and compare the result with the CPU. + - CPU + - RAX = 0x5a + - QEMU + - RAX = 0x635a775a +Additional information: +This bug is discovered by research conducted by KAIST SoftSec. diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1373 b/results/classifier/deepseek-r1:32b/output/instruction/1373 new file mode 100644 index 000000000..d31e7a28e --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1373 @@ -0,0 +1,23 @@ + + + +x86 ADOX and ADCX semantic bug +Description of problem: +The result of instruction ADOX and ADCX are different from the CPU. The value of one of EFLAGS is different. +Steps to reproduce: +1. Compile this code +``` +void main() { + asm("push 512; popfq;"); + asm("mov rax, 0xffffffff84fdbf24"); + asm("mov rbx, 0xb197d26043bec15d"); + asm("adox eax, ebx"); +} +``` +2. Execute and compare the result with the CPU. This problem happens with ADCX, too (with CF). + - CPU + - OF = 0 + - QEMU + - OF = 1 +Additional information: +This bug is discovered by research conducted by KAIST SoftSec. diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1374 b/results/classifier/deepseek-r1:32b/output/instruction/1374 new file mode 100644 index 000000000..f23fdda34 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1374 @@ -0,0 +1,25 @@ + + + +x86 BZHI semantic bug +Description of problem: +The result of instruction BZHI is different from the CPU. The value of destination register and SF of EFLAGS are different. +Steps to reproduce: +1. Compile this code +``` +void main() { + asm("mov rax, 0xb1aa9da2fe33fe3"); + asm("mov rbx, 0x80000000ffffffff"); + asm("mov rcx, 0xf3fce8829b99a5c6"); + asm("bzhi rax, rbx, rcx"); +} +``` +2. Execute and compare the result with the CPU. + - CPU + - RAX = 0x0x80000000ffffffff + - SF = 1 + - QEMU + - RAX = 0xffffffff + - SF = 0 +Additional information: +This bug is discovered by research conducted by KAIST SoftSec. diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1375 b/results/classifier/deepseek-r1:32b/output/instruction/1375 new file mode 100644 index 000000000..62948b02a --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1375 @@ -0,0 +1,22 @@ + + + +x86 SSE/SSE2/SSE3 instruction semantic bugs with NaN +Description of problem: +The result of SSE/SSE2/SSE3 instructions with NaN is different from the CPU. From Intel manual Volume 1 Appendix D.4.2.2, they defined the behavior of such instructions with NaN. But I think QEMU did not implement this semantic exactly because the byte result is different. +Steps to reproduce: +1. Compile this code +``` +void main() { + asm("mov rax, 0x000000007fffffff; push rax; mov rax, 0x00000000ffffffff; push rax; movdqu XMM1, [rsp];"); + asm("mov rax, 0x2e711de7aa46af1a; push rax; mov rax, 0x7fffffff7fffffff; push rax; movdqu XMM2, [rsp];"); + asm("addsubps xmm1, xmm2"); +} +``` +2. Execute and compare the result with the CPU. This problem happens with other SSE/SSE2/SSE3 instructions specified in the manual, Volume 1 Appendix D.4.2.2. + - CPU + - xmm1[3] = 0xffffffff + - QEMU + - xmm1[3] = 0x7fffffff +Additional information: +This bug is discovered by research conducted by KAIST SoftSec. diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1376 b/results/classifier/deepseek-r1:32b/output/instruction/1376 new file mode 100644 index 000000000..f6ab4a57f --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1376 @@ -0,0 +1,18 @@ + + + +x86 LSL and LAR fault +Description of problem: +From the description of LSL and LAR instructions in manual, `If the segment descriptor cannot be accessed or is an invalid type for the instruction, the ZF flag is cleared and no value is loaded in the destination operand.`. When it happens at the CPU, it seems they do nothing (nop). However, in QEMU, it crashes. +Steps to reproduce: +1. Compile this code +``` +void main() { + asm("mov rax, 0xa02e698e741f5a6a"); + asm("mov rbx, 0x20959ddd7a0aef"); + asm("lsl ax, bx"); +} +``` +2. Execute. QEMU crashes but CPU does not. This problem happens with LAR, too. +Additional information: +This bug is discovered by research conducted by KAIST SoftSec. diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1404690 b/results/classifier/deepseek-r1:32b/output/instruction/1404690 new file mode 100644 index 000000000..7ef62aa3c --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1404690 @@ -0,0 +1,41 @@ + + + +Qemu crashes with chrooted m68k + +I'm using qemu-m68k 2.2.0 to chroot into a m68k coldfire linux, which works fine on the coldfire machine. + +I've been able to use binfmt_msc and used the above code to use qemu with strace: + +#include +#include + +int main(int argc, char **argv, char **envp) { + char *newargv[argc + 4]; + + newargv[0] = argv[0]; + newargv[1] = "-cpu"; + newargv[2] = "cfv4e"; + newargv[3] = "-strace"; + + memcpy(&newargv[4], &argv[1], sizeof(*argv) * (argc - 1)); + newargv[argc + 3] = NULL; + return execve("/usr/bin/qemu-m68k", newargv, envp); +} + +Everything works fine. I can run bash, busybox, ash, but when I try to run a ls or just type an invalid command, I got the attached sequence of messages, which end like so: + +11351 waitpid(-1,0xf6fffa00,0x3) = -1 errno=10 (No child processes) +qemu: fatal: Illegal instruction: 0000 @ f6fffa30 +D0 = ffffffff A0 = f67dcf50 F0 = 0000000000000000 ( 0) +D1 = 0000000a A1 = f66e0898 F1 = 0000000000000000 ( 0) +D2 = f6fffaa8 A2 = f67df268 F2 = 0000000000000000 ( 0) +D3 = 00000000 A3 = 00000000 F3 = 0000000000000000 ( 0) +D4 = 00000008 A4 = 800026c4 F4 = 0000000000000000 ( 0) +D5 = 00000000 A5 = f67d98e0 F5 = 0000000000000000 ( 0) +D6 = f6fffaa8 A6 = f6fffa7c F6 = 0000000000000000 ( 0) +D7 = 00000002 A7 = f6fffa24 F7 = 0000000000000000 ( 0) +PC = f6fffa30 SR = 0000 ----- FPRESULT = 0 +Aborted + +How can I debug it further to try to figure out if this is a qemu issue or not? Thanks \ No newline at end of file diff --git a/results/classifier/deepseek-r1:32b/output/instruction/1428352 b/results/classifier/deepseek-r1:32b/output/instruction/1428352 new file mode 100644 index 000000000..860650387 --- /dev/null +++ b/results/classifier/deepseek-r1:32b/output/instruction/1428352 @@ -0,0 +1,47 @@ + + + +SYSRET instruction incorrectly implemented + +The Intel architecture manual states that when returning to user mode, the SYSRET instruction will re-load the stack selector (%ss) from the IA32_STAR model specific register using the following logic: + +SS.Selector <-- (IA32_STAR[63:48]+8) OR 3; (* RPL forced to 3 *) + +Another description of the instruction behavior which shows the same logic in a slightly different form can also be found here: + +http://tptp.cc/mirrors/siyobik.info/instruction/SYSRET.html + +[...] + SS(SEL) = IA32_STAR[63:48] + 8; + SS(PL) = 0x3; +[...] + +In other words, the value of the %ss register is supposed to be loaded from bits 63:48 of the IA32_STAR model-specific register, incremented by 8, and then ORed with 3. ORing in the 3 sets the privilege level to 3 (user). This is done since SYSRET returns to user mode after a system call. + +However, helper_sysret() in target-i386/seg_helper.c does not do the "OR 3" step. The code looks like this: + + cpu_x86_load_seg_cache(env, R_SS, selector + 8, + 0, 0xffffffff, + DESC_G_MASK | DESC_B_MASK | DESC_P_MASK | + DESC_S_MASK | (3 << DESC_DPL_SHIFT) | + DESC_W_MASK | DESC_A_MASK); + +It should look like this: + + cpu_x86_load_seg_cache(env, R_SS, (selector + 8) | 3, + 0, 0xffffffff, + DESC_G_MASK | DESC_B_MASK | DESC_P_MASK | + DESC_S_MASK | (3 << DESC_DPL_SHIFT) | + DESC_W_MASK | DESC_A_MASK); + +The code does correctly set the privilege level bits for the code selector register (%cs) but not for the stack selector (%ss). + +The effect of this is that when SYSRET returns control to the user-mode caller, %ss will be have the privilege level bits cleared. In my case, it went from 0x2b to 0x28. This caused a crash later: when the user-mode code was preempted by an interrupt, and the interrupt handler would do an IRET, a general protection fault would occur because the %ss value being loaded from the exception frame was not valid for user mode. (At least, I think that's what happened.) + +This behavior seems inconsistent with real hardware, and also appears to be wrong with respect to the Intel documentation, so I'm pretty confident in calling this a bug. :) + +Note that this issue seems to have been around for a long time. I discovered it while using QEMU 2.2.0, but I happened to have the sources for QEMU 0.10.5, and the problem is there too (in os_helper.c). I am using FreeBSD/amd64 9.1-RELEASE as my host system, without KVM. + +The fix is fairly simple. I'm attaching a patch which worked for me. Using this fix, the code that I'm testing now behaves the same on the QEMU virtual machine as on real hardware. + +- Bill (