File size: 124,001 Bytes
add7dbd
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
1001
1002
1003
1004
1005
1006
1007
1008
1009
1010
1011
1012
1013
1014
1015
1016
1017
1018
1019
1020
1021
1022
1023
1024
1025
1026
1027
1028
1029
1030
1031
1032
1033
1034
1035
1036
1037
1038
1039
1040
1041
1042
1043
1044
1045
1046
1047
1048
1049
1050
1051
1052
1053
1054
1055
1056
1057
1058
1059
1060
1061
1062
1063
1064
1065
1066
1067
1068
1069
1070
1071
1072
1073
1074
1075
1076
1077
1078
1079
1080
1081
1082
1083
1084
1085
1086
1087
1088
1089
1090
1091
1092
1093
1094
1095
1096
1097
1098
1099
1100
1101
1102
1103
1104
1105
1106
1107
1108
1109
1110
1111
1112
1113
1114
1115
1116
1117
1118
1119
1120
1121
1122
1123
1124
1125
1126
1127
1128
1129
1130
1131
1132
1133
1134
1135
1136
1137
1138
1139
1140
1141
1142
1143
1144
1145
1146
1147
1148
1149
1150
1151
1152
1153
1154
1155
1156
1157
1158
1159
1160
1161
1162
1163
1164
1165
1166
1167
1168
1169
1170
1171
1172
1173
1174
1175
1176
1177
1178
1179
1180
1181
1182
1183
1184
1185
1186
1187
1188
1189
1190
1191
1192
1193
1194
1195
1196
1197
1198
1199
1200
1201
1202
1203
1204
1205
1206
1207
1208
1209
1210
1211
1212
1213
1214
1215
1216
1217
1218
1219
1220
1221
1222
1223
1224
1225
1226
1227
1228
1229
1230
1231
1232
1233
1234
1235
1236
1237
1238
1239
1240
1241
1242
1243
1244
1245
1246
1247
1248
1249
1250
1251
1252
1253
1254
1255
1256
1257
1258
1259
1260
1261
1262
1263
1264
1265
1266
1267
1268
1269
1270
1271
1272
1273
1274
1275
1276
1277
1278
1279
1280
1281
1282
1283
1284
1285
1286
1287
1288
1289
1290
1291
1292
1293
1294
1295
1296
1297
1298
1299
1300
1301
1302
1303
1304
1305
1306
1307
1308
1309
1310
1311
1312
1313
1314
1315
1316
1317
1318
1319
1320
1321
1322
1323
1324
1325
1326
1327
1328
1329
1330
1331
1332
1333
1334
1335
1336
1337
1338
1339
1340
1341
1342
1343
1344
1345
1346
1347
1348
1349
1350
1351
1352
1353
1354
1355
1356
1357
1358
1359
1360
1361
1362
1363
1364
1365
1366
1367
1368
1369
1370
1371
1372
1373
1374
1375
1376
1377
1378
1379
1380
1381
1382
1383
1384
1385
1386
1387
1388
1389
1390
1391
1392
1393
1394
1395
1396
1397
1398
1399
1400
1401
1402
1403
1404
1405
1406
1407
1408
1409
1410
1411
1412
1413
1414
1415
1416
1417
1418
1419
1420
1421
1422
1423
1424
1425
1426
1427
1428
1429
1430
1431
1432
1433
1434
1435
1436
1437
1438
1439
1440
1441
1442
1443
1444
1445
1446
1447
1448
1449
1450
1451
1452
1453
1454
1455
1456
1457
1458
1459
1460
1461
1462
1463
1464
1465
1466
1467
1468
1469
1470
1471
1472
1473
1474
1475
1476
1477
1478
1479
1480
1481
1482
1483
1484
1485
1486
1487
1488
1489
1490
1491
1492
1493
1494
1495
1496
1497
1498
1499
1500
1501
1502
1503
1504
1505
1506
1507
1508
1509
1510
1511
1512
1513
1514
1515
1516
1517
1518
1519
1520
1521
1522
1523
1524
1525
1526
1527
1528
1529
1530
1531
1532
1533
1534
1535
1536
1537
1538
1539
1540
1541
1542
1543
1544
1545
1546
1547
1548
1549
1550
1551
1552
1553
1554
1555
1556
1557
1558
1559
1560
1561
1562
1563
1564
1565
1566
1567
1568
1569
1570
1571
1572
1573
1574
1575
1576
1577
1578
1579
1580
1581
1582
1583
1584
1585
1586
1587
1588
1589
1590
1591
1592
1593
1594
1595
1596
1597
1598
1599
1600
1601
1602
1603
1604
1605
1606
1607
1608
1609
1610
1611
1612
1613
1614
1615
1616
1617
1618
1619
1620
1621
1622
1623
1624
1625
1626
1627
1628
1629
1630
1631
1632
1633
1634
1635
1636
1637
1638
1639
1640
1641
1642
1643
1644
1645
1646
1647
1648
1649
1650
1651
1652
1653
1654
1655
1656
1657
1658
1659
1660
1661
1662
1663
1664
1665
1666
1667
1668
1669
1670
1671
1672
1673
1674
1675
1676
1677
1678
1679
1680
1681
1682
1683
1684
1685
1686
1687
1688
1689
1690
1691
1692
1693
1694
1695
1696
1697
1698
1699
1700
1701
1702
1703
1704
1705
1706
1707
1708
1709
1710
1711
1712
1713
1714
1715
1716
1717
1718
1719
1720
1721
1722
1723
1724
1725
1726
1727
1728
1729
1730
1731
1732
1733
1734
1735
1736
1737
1738
1739
1740
1741
1742
1743
1744
1745
1746
1747
1748
1749
1750
1751
1752
1753
1754
1755
1756
1757
1758
1759
1760
1761
1762
1763
1764
1765
1766
1767
1768
1769
1770
1771
1772
1773
1774
1775
1776
1777
1778
1779
1780
1781
1782
1783
1784
1785
1786
1787
1788
1789
1790
1791
1792
1793
1794
1795
1796
1797
1798
1799
1800
1801
1802
1803
1804
1805
1806
1807
1808
1809
1810
1811
1812
1813
1814
1815
1816
1817
1818
1819
1820
1821
1822
1823
1824
1825
1826
1827
1828
1829
1830
1831
1832
1833
1834
1835
1836
1837
1838
1839
1840
1841
1842
1843
1844
1845
1846
1847
1848
1849
1850
1851
1852
1853
1854
1855
1856
1857
1858
1859
1860
1861
1862
1863
1864
1865
1866
1867
1868
1869
1870
1871
1872
1873
1874
1875
1876
1877
1878
1879
1880
1881
1882
1883
1884
1885
1886
1887
1888
1889
1890
1891
1892
1893
1894
1895
1896
1897
1898
1899
1900
1901
1902
1903
1904
1905
1906
1907
1908
1909
1910
1911
1912
1913
1914
1915
1916
1917
1918
1919
1920
1921
1922
1923
1924
1925
1926
1927
1928
1929
1930
1931
1932
1933
1934
1935
1936
1937
1938
1939
1940
1941
1942
1943
1944
1945
1946
1947
1948
1949
1950
1951
1952
1953
1954
1955
1956
1957
1958
1959
1960
1961
1962
1963
1964
1965
1966
1967
1968
1969
1970
1971
1972
1973
1974
1975
1976
1977
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
2027
2028
2029
2030
2031
2032
2033
2034
2035
2036
2037
2038
2039
2040
2041
2042
2043
2044
2045
2046
2047
2048
2049
2050
2051
2052
2053
2054
2055
2056
2057
2058
2059
2060
2061
2062
2063
2064
2065
2066
2067
2068
2069
2070
2071
2072
2073
2074
2075
2076
2077
2078
2079
2080
2081
2082
2083
2084
2085
2086
2087
2088
2089
2090
2091
2092
2093
2094
2095
2096
2097
2098
2099
2100
2101
2102
2103
2104
2105
2106
2107
2108
2109
2110
2111
2112
2113
2114
2115
2116
2117
2118
2119
2120
2121
2122
2123
2124
2125
2126
2127
2128
2129
2130
2131
2132
[
    {
        "id": 1,
        "question": "Ben Milo is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Ben Milo with grounding.",
        "prediction": "<ref>Ben Milo</ref><box>(646,219),(958,985)</box>"
    },
    {
        "id": 2,
        "question": "Caption Mitch is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Caption Mitch doing?",
        "prediction": "Caption Mitch is standing with his arms crossed."
    },
    {
        "id": 3,
        "question": "Caption Mitch is <img>image_1</img>\n Vern is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Mitch is standing on the left, facing Vern, who is on the right holding a horse's reins. Both are in military attire, in a rugged outdoor setting with rocky terrain and vegetation in the background. There are other soldiers and horses under the sky."
    },
    {
        "id": 4,
        "question": "Caption Mitch is <img>image_1</img>\n Gen. Dostum is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is holding an interphone?",
        "prediction": "Gen. Dostum is holding an interphone."
    },
    {
        "id": 5,
        "question": "Hal Spencer is <img>image_1</img>\n In the image: <img>image_2</img>\n How is his state?",
        "prediction": "Hal Spencer is lying on the ground, seemingly injured."
    },
    {
        "id": 6,
        "question": "Vern is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Vern doing?",
        "prediction": "Vern is aiming a rifle."
    },
    {
        "id": 7,
        "question": "Kyle is <img>image_1</img>\n Adam is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Kyle's state?",
        "prediction": "Kyle is in a conversation with another person."
    },
    {
        "id": 8,
        "question": "Diane is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Diane with grounding.",
        "prediction": "<ref>Diane</ref><box>(10,11),(417,992)</box>"
    },
    {
        "id": 9,
        "question": "Adam is <img>image_1</img>\n Diane is <img>image_2</img>\n Richard is <img>image_3</img>\n In the image: <img>image_4</img>\n Identify Richard with grounding.",
        "prediction": "<ref>Richard</ref><box>(0,205),(341,976)</box>"
    },
    {
        "id": 10,
        "question": "Alan is <img>image_1</img>\n Mitch is <img>image_2</img>\n Please describe the image: <img>image_3</img>\n",
        "prediction": "Alan is seated on the left, holding a bowl of chips, while Mitch is on the right with his arm resting on a chair. They appear to be in a medical setting."
    },
    {
        "id": 11,
        "question": "Mitch is <img>image_1</img>\n In the image: <img>image_2</img>\n What color is Mitch's clothes?",
        "prediction": "Mitch is wearing a beige coat and blue jeans."
    },
    {
        "id": 12,
        "question": "Diane is <img>image_1</img>\n Richard is <img>image_2</img>\n In the image: <img>image_3</img>\n What is the spatial relation between Diane and Richard?",
        "prediction": "Diane is in front of Richard."
    },
    {
        "id": 13,
        "question": "Jackson is <img>image_1</img>\n Lilly is <img>image_2</img>\n Noah is <img>image_3</img>\n Please describe the photo: <img>image_4</img>\n",
        "prediction": "Yes\n\nJackson is walking holding hands with a young girl, Lilly. Noah seems to be following them, standing to the side with his hand in his pocket. There are soldiers in uniform nearby, one of them holding a gun."
    },
    {
        "id": 14,
        "question": "Kate is <img>image_1</img>\n Lilly is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Kate doing?",
        "prediction": "Kate is embracing Lilly."
    },
    {
        "id": 15,
        "question": "Jackson is <img>image_1</img>\n Gordon is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Gordon with grounding.",
        "prediction": "<ref>Gordon</ref><box>(535,128),(916,996)</box>"
    },
    {
        "id": 16,
        "question": "Kate is <img>image_1</img>\n Noah is <img>image_2</img>\n In the image: <img>image_3</img>\n From our view, which side of Kate is Noah at?",
        "prediction": "Noah is on the right side of Kate."
    },
    {
        "id": 17,
        "question": "Jackson is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Jackson with grounding.",
        "prediction": "<ref>Jackson</ref><box>(421,104),(733,945)</box>"
    },
    {
        "id": 18,
        "question": "Dukchoon is <img>image_1</img>\n Kim is <img>image_2</img>\n In the image: <img>image_3</img>\n Please give a description.",
        "prediction": "Dukchoon is kneeling beside Kim, who appears to be in a firefighter's uniform crouching on the ground."
    },
    {
        "id": 19,
        "question": "Kim is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Kim with grounding",
        "prediction": "<ref>Kim</ref><box>(199,226),(505,996)</box>"
    },
    {
        "id": 20,
        "question": "Kim is <img>image_1</img>\n Kangrim is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is pushing the bamboo raft nearby?",
        "prediction": "Kim is pushing the bamboo raft."
    },
    {
        "id": 21,
        "question": "Dukchoon is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Dukchoon's expression?",
        "prediction": "Dukchoon looks concerned or thoughtful."
    },
    {
        "id": 22,
        "question": "Ron is <img>image_1</img>\n Brian is <img>image_2</img>\n Champ is <img>image_3</img>\n In the image: <img>image_4</img>\n Who is driving the car?",
        "prediction": "Ron is driving the car."
    },
    {
        "id": 23,
        "question": "Brian is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Brian with grounding.",
        "prediction": "<ref>Brian</ref><box>(625,300),(975,988)</box>"
    },
    {
        "id": 24,
        "question": "Ron is <img>image_1</img>\n Brick is <img>image_2</img>\n Explain the content of this image: <img>image_3</img>\n",
        "prediction": "Ron, on the left, is gesturing with his right hand while holding a can, and Brick, on the right, is sitting with his hands on his lap. They appear to be in a room with blinds partially closed in the background."
    },
    {
        "id": 25,
        "question": "Veronica is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Veronica with grounding.",
        "prediction": "<ref>Veronica</ref><box>(1,176),(317,979)</box>"
    },
    {
        "id": 26,
        "question": "Ron is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Ron doing?",
        "prediction": "Ron is sitting on a chair."
    },
    {
        "id": 27,
        "question": "Ron is <img>image_1</img>\n Veronica is <img>image_2</img>\n In the image: <img>image_3</img>\n Describe the image.",
        "prediction": "Veronica, wearing a yellow coat and pink hat, stands to the left with her hand on a child's shoulder. Ron, to the right, stands in a brown coat and scarf, looking straight ahead. They are all outdoors by the water."
    },
    {
        "id": 28,
        "question": "Bill is <img>image_1</img>\n Barbara is <img>image_2</img>\n Violet is <img>image_3</img>\n In the image: <img>image_4</img>\n Who is sitting at the middle of the table?",
        "prediction": "Bill is sitting at the middle of the table."
    },
    {
        "id": 29,
        "question": "Barbara is <img>image_1</img>\n Violet is <img>image_2</img>\n Mattie is <img>image_3</img>\n Explain the content of the image: <img>image_4</img>\n",
        "prediction": "Barbara stands to the left holding a book, facing towards Mattie, who sits in the center reading a document. Violet sits to the right, also looking at the document. They seem to be in a discussion."
    },
    {
        "id": 30,
        "question": "Barbara is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Barbara with grounding.",
        "prediction": "<ref>Barbara</ref><box>(665,90),(941,972)</box>"
    },
    {
        "id": 31,
        "question": "Barbara is <img>image_1</img>\n Mattie is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Mattie doing?",
        "prediction": "Mattie is sitting at a table with a glass in her hand."
    },
    {
        "id": 32,
        "question": "Bill is <img>image_1</img>\n Charlie is <img>image_2</img>\n Barbara is <img>image_3</img>\n In the image: <img>image_4</img>\n Identify Bill with grounding.",
        "prediction": "<ref>Bill</ref><box>(0,25),(353,990)</box>"
    },
    {
        "id": 33,
        "question": "Lynn is <img>image_1</img>\n Lynn's father is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Lynn's father with grounding.",
        "prediction": "<ref>Lynn's father</ref><box>(477,75),(723,733)</box>"
    },
    {
        "id": 34,
        "question": "Grace is <img>image_1</img>\n In the image: <img>image_2</img>\n Describe Grace's state.",
        "prediction": "Grace is a student wearing a school uniform."
    },
    {
        "id": 35,
        "question": "Bank is <img>image_1</img>\n Lynn is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Bank stands on the left with his arm resting on a desk, wearing a white shirt and carrying a bag. Lynn is to the right, also in a white shirt and carrying a smaller bag. They appear to be in a room with glass walls."
    },
    {
        "id": 36,
        "question": "Lynn is <img>image_1</img>\n Grace is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Lynn with grounding.",
        "prediction": "<ref>Lynn</ref><box>(432,45),(853,988)</box>"
    },
    {
        "id": 37,
        "question": "Tong is <img>image_1</img>\n In the image: <img>image_2</img>\n What color is Tong's clothes?",
        "prediction": "Tong is wearing a blue shirt."
    },
    {
        "id": 38,
        "question": "Grace is <img>image_1</img>\n Pat is <img>image_2</img>\n Please describe the image: <img>image_3</img>\n",
        "prediction": "Grace is sitting on a table looking away, while Pat is lounging with his arm extended, both seem to be in a workshop."
    },
    {
        "id": 39,
        "question": "Scott is <img>image_1</img>\n Elizabeth is <img>image_2</img>\n In the image: <img>image_3</img>\n What does Scott wear?",
        "prediction": "Scott wears a beige blazer with a checked tie."
    },
    {
        "id": 40,
        "question": "Amy is <img>image_1</img>\n Lynn is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Amy with grounding.",
        "prediction": "<ref>Amy</ref><box>(110,43),(522,947)</box>"
    },
    {
        "id": 41,
        "question": "Wally is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Wally with grounding.",
        "prediction": "<ref>Wally</ref><box>(140,294),(586,973)</box>"
    },
    {
        "id": 42,
        "question": "Bei Zhai is <img>image_1</img>\n Shen Lian is <img>image_2</img>\n In the image: <img>image_3</img>\n Describe the visual content.",
        "prediction": "Bei Zhai stands on the left, wearing a light blue outfit, partially shaded by a bamboo structure. Shen Lian is on the right, facing her, dressed in black with a traditional hat. They appear to be in a bamboo forest."
    },
    {
        "id": 43,
        "question": "Shen lian is <img>image_1</img>\n Lu Wenzhao is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Lu Wenzhao doing?",
        "prediction": "Lu Wenzhao is extending his hand towards Shen Lian."
    },
    {
        "id": 44,
        "question": "Bei Zhai is <img>image_1</img>\n Shen Lian is <img>image_2</img>\n In the image: <img>image_3</img>\n Describe the state of Bei Zhai.",
        "prediction": "Bei Zhai is seated and looking downward."
    },
    {
        "id": 45,
        "question": "Bei Zhai is <img>image_1</img>\n Shen Lian is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Shen Lian with grounding.",
        "prediction": "<ref>Shen Lian</ref><box>(735,29),(917,990)</box>"
    },
    {
        "id": 46,
        "question": "Bei Zhai is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Bei Zhai with grounding.",
        "prediction": "<ref>Bei Zhai</ref><box>(453,268),(755,992)</box>"
    },
    {
        "id": 47,
        "question": "Lu Wenzhao is <img>image_1</img>\n Ding Baiying is <img>image_2</img>\n Describe the image: <img>image_3</img>\n",
        "prediction": "Lu Wenzhao is seen holding a sword, standing closely behind Ding Baiying. They appear to be in a tense moment, surrounded by fallen soldiers in a battle scene."
    },
    {
        "id": 48,
        "question": "Haskell is <img>image_1</img>\n In the image: <img>image_2</img>\n What is he doing?",
        "prediction": "He is sitting at a dining table."
    },
    {
        "id": 49,
        "question": "Molyneux is <img>image_1</img>\n In the image: <img>image_2</img>\n Describe the content.",
        "prediction": "Molyneux is sitting in the driver's seat of a car, looking straight ahead. There's an elderly couple in the back seat, appearing to be in a conversation."
    },
    {
        "id": 50,
        "question": "Henry Goose is <img>image_1</img>\n Native Woman is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Henry Goose with grounding.",
        "prediction": "<ref>Henry Goose</ref><box>(110,13),(531,987)</box>"
    },
    {
        "id": 51,
        "question": "Molyneux is <img>image_1</img>\n In the image: <img>image_2</img>\n Describe the state of Molyneux.",
        "prediction": "Molyneux is seated and appears to be embracing someone."
    },
    {
        "id": 52,
        "question": "Mr. Griffith is <img>image_1</img>\n Mrs. Griffith is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Mr. Griffith doing?",
        "prediction": "Mr. Griffith is sitting and leaning forward with his hand on his cheek."
    },
    {
        "id": 53,
        "question": "Gibbons is <img>image_1</img>\n Describe the image: <img>image_2</img>\n",
        "prediction": "Gibbons, an elderly man in a suit and tie, is dancing with a young woman in a cheerleading outfit on a basketball court, surrounded by a crowd of onlookers."
    },
    {
        "id": 54,
        "question": "Micah is <img>image_1</img>\n In the image: <img>image_2</img>\n Describe the image.",
        "prediction": "Two young men are sitting together; one, dressed in a plaid shirt, is strumming a guitar, while the other, in a black shirt, is playing the acoustic guitar. They seem to be enjoying a casual outdoor gathering."
    },
    {
        "id": 55,
        "question": "Rhiannon is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Rhiannon with grounding.",
        "prediction": "<ref>Rhiannon</ref><box>(10,125),(414,997)</box>"
    },
    {
        "id": 56,
        "question": "Olive is <img>image_1</img>\n Rhiannon is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is carrying a handbag?",
        "prediction": "Olive is carrying a handbag."
    },
    {
        "id": 57,
        "question": "Brandon is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Brandon with grounding.",
        "prediction": "<ref>Brandon</ref><box>(111,34),(625,991)</box>"
    },
    {
        "id": 58,
        "question": "Walt is <img>image_1</img>\n Sue is <img>image_2</img>\n In the image: <img>image_3</img>\n Explain the content.",
        "prediction": "Walt, an older man, stands on a porch with a woman, Sue, to his left, touching his shoulder. Both are facing the camera."
    },
    {
        "id": 59,
        "question": "Karen is <img>image_1</img>\n Mitch is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Karen with grounding.",
        "prediction": "<ref>Karen</ref><box>(571,125),(966,988)</box>"
    },
    {
        "id": 60,
        "question": "Walt is <img>image_1</img>\n Thao is <img>image_2</img>\n In the image: <img>image_3</img>\n Please reason out what role of Thao is in this scene.",
        "prediction": "Thao is standing on the right, facing and seemingly conversing with Walt on the left."
    },
    {
        "id": 61,
        "question": "Mitch is <img>image_1</img>\n Walt is <img>image_2</img>\n In the image: <img>image_3</img>\n What color is Walt's clothes?",
        "prediction": "Walt's clothes are white."
    },
    {
        "id": 62,
        "question": "Thao is <img>image_1</img>\n Sue is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Sue with grounding.",
        "prediction": "<ref>Sue</ref><box>(714,108),(998,985)</box>"
    },
    {
        "id": 63,
        "question": "Harry is <img>image_1</img>\n Hermione is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Hermione is standing to the left, slightly behind Harry, who is in the center. Both are in a dimly lit room, looking towards the right."
    },
    {
        "id": 64,
        "question": "Hermione is <img>image_1</img>\n Ron is <img>image_2</img>\n In the image: <img>image_3</img>\n What is the spatial relation between Hermione and Ron?",
        "prediction": "Hermione is standing to the right of Ron."
    },
    {
        "id": 65,
        "question": "Harry is <img>image_1</img>\n Albus is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Albus with grounding.",
        "prediction": "<ref>Albus</ref><box>(587,43),(828,986)</box>"
    },
    {
        "id": 66,
        "question": "Hermione is <img>image_1</img>\n Harry is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Hermione with grounding.",
        "prediction": "<ref>Hermione</ref><box>(285,15),(757,986)</box>"
    },
    {
        "id": 67,
        "question": "Ron is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Ron with grounding.",
        "prediction": "<ref>Ron</ref><box>(6,159),(273,995)</box>"
    },
    {
        "id": 68,
        "question": "Ariadne is <img>image_1</img>\n Mal is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Mal with grounding.",
        "prediction": "<ref>Mal</ref><box>(713,29),(896,985)</box>"
    },
    {
        "id": 69,
        "question": "Eames is <img>image_1</img>\n Cobb is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Cobb with grounding.",
        "prediction": "<ref>Cobb</ref><box>(63,230),(408,995)</box>"
    },
    {
        "id": 70,
        "question": "Ariadne is <img>image_1</img>\n Yusuf is <img>image_2</img>\n In the image: <img>image_3</img>\n Please describe the content.",
        "prediction": "Ariadne is lying on a chair with her arm resting on a table, looking off to the side. Yusuf is reclining on another chair with his arm extended, appearing to be in a relaxed state. They are in a room with a dark, moody atmosphere."
    },
    {
        "id": 71,
        "question": "Ariadne is <img>image_1</img>\n In the image: <img>image_2</img>\n What is she doing?",
        "prediction": "Ariadne appears to be walking alongside a man."
    },
    {
        "id": 72,
        "question": "Cobb is <img>image_1</img>\n Ariadne is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is walking in front?",
        "prediction": "Cobb is walking in front."
    },
    {
        "id": 73,
        "question": "Cobb is <img>image_1</img>\n Saito is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Cobb is seated on a train, looking intently at Saito, who is reclined in a seat facing away from the camera. Saito has his hand on Cobb's shoulder."
    },
    {
        "id": 74,
        "question": "Tony is <img>image_1</img>\n Pepper is <img>image_2</img>\n Natalie is <img>image_3</img>\n In the image: <img>image_4</img>\n Who is the person standing?",
        "prediction": "Pepper Potts is the person standing."
    },
    {
        "id": 75,
        "question": "Agent is <img>image_1</img>\n Justin is <img>image_2</img>\n Please provide an overview of the visual information in this image: <img>image_3</img>\n",
        "prediction": "Agent is standing on the left, wearing a black suit and tie, carrying a suitcase. Justin is on the right, also in a black suit, standing beside the Agent. They are in a factory setting with machinery in the background."
    },
    {
        "id": 76,
        "question": "Nick is <img>image_1</img>\n Tony is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Nick with grounding.",
        "prediction": "<ref>Nick</ref><box>(268,18),(569,797)</box>"
    },
    {
        "id": 77,
        "question": "Tony is <img>image_1</img>\n Justin is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Justin with grounding.",
        "prediction": "<ref>Justin</ref><box>(460,106),(698,995)</box>"
    },
    {
        "id": 78,
        "question": "Pepper is <img>image_1</img>\n Natalie is <img>image_2</img>\n In the image: <img>image_3</img>\n Whose clothes color is gray?",
        "prediction": "Pepper is wearing a gray outfit."
    },
    {
        "id": 79,
        "question": "Tony is <img>image_1</img>\n In the image: <img>image_2</img>\n Give a description.",
        "prediction": "Tony, wearing a suit and tie, is sitting at a table with his hands clasped in front of him. There are other individuals seated around the table, appearing to be engaged in a meeting or a public hearing."
    },
    {
        "id": 80,
        "question": "Mrs. Fairfax is <img>image_1</img>\n Jane is <img>image_2</img>\n In the image: <img>image_3</img>\n What color is Jane's dress?",
        "prediction": "Jane's dress is blue."
    },
    {
        "id": 81,
        "question": "Rochester is <img>image_1</img>\n Jane is <img>image_2</img>\n In the image: <img>image_3</img>\n Describe Rochester's clothes.",
        "prediction": "Rochester is wearing a dark coat with a waistcoat."
    },
    {
        "id": 82,
        "question": "Jane is <img>image_1</img>\n Varens is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Varens with grounding.",
        "prediction": "<ref>Varens</ref><box>(578,175),(928,997)</box>"
    },
    {
        "id": 83,
        "question": "Jane is <img>image_1</img>\n Mrs. Fairfax is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Mrs. Fairfax with grounding.",
        "prediction": "<ref>Mrs. Fairfax</ref><box>(575,119),(998,995)</box>"
    },
    {
        "id": 84,
        "question": "Bai Letian is <img>image_1</img>\n Kukai is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Bai Letian is seated to the left, looking towards Kukai, who is seated to the right holding a paper. Both are against a brick wall backdrop, sitting on what appears to be a bench."
    },
    {
        "id": 85,
        "question": "Bai Letian is <img>image_1</img>\n Kukai is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Bai Letian with grounding.",
        "prediction": "<ref>Bai Letian</ref><box>(510,230),(719,997)</box>"
    },
    {
        "id": 86,
        "question": "Bai Letian is <img>image_1</img>\n Kukai is <img>image_2</img>\n In the image: <img>image_3</img>\n Please provide an overview of the visual information.",
        "prediction": "Bai Letian is seated on the left, gesturing with his right hand, while Kukai is seated on the right with his hands on the table. There's a standing individual in the middle, between Bai and Kukai."
    },
    {
        "id": 87,
        "question": "Gennady is <img>image_1</img>\n Eddie is <img>image_2</img>\n In the image: <img>image_3</img>\n Please provide an image caption.",
        "prediction": "Gennady is seated to the right on a bench, looking down at an object in his hands, while Eddie sits to the left facing him. They are in a park setting with trees and a bench behind them."
    },
    {
        "id": 88,
        "question": "Eddie is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Eddie with grounding.",
        "prediction": "<ref>Eddie</ref><box>(586,168),(935,996)</box>"
    },
    {
        "id": 89,
        "question": "Lindy is <img>image_1</img>\n Eddie is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Lindy is holding a newspaper with the headline \" Poll Vault, \" facing to her right, while Eddie, wearing a black leather jacket, is turned towards his right with his hands in his pockets. They appear to be on a city street with shops in the background."
    },
    {
        "id": 90,
        "question": "Gennady is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Gennady doing?",
        "prediction": "Gennady is walking towards the camera with his back turned."
    },
    {
        "id": 91,
        "question": "Eddie is <img>image_1</img>\n Carl is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Carl with grounding.",
        "prediction": "<ref>Carl</ref><box>(63,211),(309,995)</box>"
    },
    {
        "id": 92,
        "question": "Old Joe is <img>image_1</img>\n Joe is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is on the left side of the table?",
        "prediction": "Old Joe is on the left side of the table."
    },
    {
        "id": 93,
        "question": "Sara is <img>image_1</img>\n Cid is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Cid with grounding.",
        "prediction": "<ref>Cid</ref><box>(254,409),(522,992)</box>"
    },
    {
        "id": 94,
        "question": "Ye Jin is <img>image_1</img>\n Vivian is <img>image_2</img>\n In the image: <img>image_3</img>\n Who just got out of the car?",
        "prediction": "Ye Jin just got out of the car."
    },
    {
        "id": 95,
        "question": "Vivian is <img>image_1</img>\n Ye Jin is <img>image_2</img>\n In the image: <img>image_3</img>\n What does Vivian wear?",
        "prediction": "Vivian wears a black and white striped blazer with a black top and skirt."
    },
    {
        "id": 96,
        "question": "Vivian is <img>image_1</img>\n Ye Jin is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Vivian with grounding.",
        "prediction": "<ref>Vivian</ref><box>(417,197),(696,996)</box>"
    },
    {
        "id": 97,
        "question": "Vivian is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Vivian doing?",
        "prediction": "Vivian is holding up a phone."
    },
    {
        "id": 98,
        "question": "Ye Jin is <img>image_1</img>\n Please describe the image: <img>image_2</img>\n",
        "prediction": "Ye Jin in a white suit walks away from the camera, carrying a bouquet of flowers. A man in a grey suit follows behind her. They are in a bright, modern living room with white furniture and a red chair."
    },
    {
        "id": 99,
        "question": "Boris' Girlfriend is <img>image_1</img>\n Boris is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is carrying the cake?",
        "prediction": "Boris' girlfriend is carrying the cake."
    },
    {
        "id": 100,
        "question": "Agent K is <img>image_1</img>\n Agent J is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Agent K with grounding.",
        "prediction": "<ref>Agent K</ref><box>(586,126),(950,732)</box>"
    },
    {
        "id": 101,
        "question": "Agent J is <img>image_1</img>\n Agent O is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Agent O doing?",
        "prediction": "Agent O is standing and facing Agent J."
    },
    {
        "id": 102,
        "question": "Young Agent K is <img>image_1</img>\n Young Agent O is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Young Agent K is seated to the left, wearing a black suit, while Young Agent O stands to the right, dressed in a black dress with a white collar. They are in a modern office setting with other agents and office elements in the background."
    },
    {
        "id": 103,
        "question": "Young Agent K is <img>image_1</img>\n Agent J is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Agent J with grounding.",
        "prediction": "<ref>Agent J</ref><box>(646,52),(905,981)</box>"
    },
    {
        "id": 104,
        "question": "Agent J is <img>image_1</img>\n Young Agent K is <img>image_2</img>\n Griffin is <img>image_3</img>\n In the image: <img>image_4</img>\n Please provide a description.",
        "prediction": "Agent J is standing on the left in a formal suit, facing Young Agent K who is on the right in a similar suit. Griffin, wearing a green jacket and a red knit hat, is serving food in the center. They are in a room with other guests at a party."
    },
    {
        "id": 105,
        "question": "Boris is <img>image_1</img>\n Griffin is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Griffin with grounding.",
        "prediction": "<ref>Griffin</ref><box>(509,231),(983,986)</box>"
    },
    {
        "id": 106,
        "question": "Billy Beane is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Billy Beane with grounding.",
        "prediction": "<ref>Billy Beane</ref><box>(480,44),(848,913)</box>"
    },
    {
        "id": 107,
        "question": "Peter Brand is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Peter Brand with grounding.",
        "prediction": "<ref>Peter Brand</ref><box>(10,37),(592,991)</box>"
    },
    {
        "id": 108,
        "question": "Billy is <img>image_1</img>\n Peter is <img>image_2</img>\n In the image: <img>image_3</img>\n Please give a description.",
        "prediction": "Peter is seated to the left, resting his chin on his hand, facing slightly right. Billy is sitting to the right, with one hand on a desk and the other near his face, in a thoughtful pose. A poster on the wall reads \"CLASH\". There's a desk with various items, including a computer, books, and a coffee cup."
    },
    {
        "id": 109,
        "question": "Sharp is <img>image_1</img>\n Scout is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Scout with grounding.",
        "prediction": "<ref>Scout</ref><box>(542,191),(832,843)</box>"
    },
    {
        "id": 110,
        "question": "Sam is <img>image_1</img>\n Suzy is <img>image_2</img>\n Please describe the image: <img>image_3</img>\n",
        "prediction": "Sam is sitting outside in a scout uniform, holding a frying pan, while Suzy sits beside her, partially visible, with a camera in her lap. There's a campsite setup with a tent and decorations in the background."
    },
    {
        "id": 111,
        "question": "Suzy is <img>image_1</img>\n Sam is <img>image_2</img>\n In the image: <img>image_3</img>\n Describe Suzy's state.",
        "prediction": "Suzy is sitting on the ground, looking at something in her hands."
    },
    {
        "id": 112,
        "question": "Sharp is <img>image_1</img>\n Scout is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Sharp with grounding.",
        "prediction": "<ref>Sharp</ref><box>(665,79),(932,992)</box>"
    },
    {
        "id": 113,
        "question": "Sam is <img>image_1</img>\n Suzy is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Suzy with grounding.",
        "prediction": "<ref>Suzy</ref><box>(137,248),(400,993)</box>"
    },
    {
        "id": 114,
        "question": "Sam is <img>image_1</img>\n Suzy is <img>image_2</img>\n In the image: <img>image_3</img>\n Describe the visual content.",
        "prediction": "Sam is sitting on the left, wearing a yellow scout uniform with badges, looking towards the right. Suzy, also in a yellow scout uniform, is seated on the right, with her hands clasped in front of her. They appear to be in a room with other scouts and a leader gesturing with one hand."
    },
    {
        "id": 115,
        "question": "Sharp is <img>image_1</img>\n Scout is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Sharp doing?",
        "prediction": "Sharp is wiping his face with a handkerchief."
    },
    {
        "id": 116,
        "question": "Social Services is <img>image_1</img>\n Scout is <img>image_2</img>\n Sharp is <img>image_3</img>\n In the image: <img>image_4</img>\n Give a description.",
        "prediction": "Social Services, holding a book, is gesturing while talking to Scout, who is standing with his hands by his sides. Sharp is looking towards their direction, partially visible on the right side. There are other people in the background."
    },
    {
        "id": 117,
        "question": "Chigusa is <img>image_1</img>\n Shimada is <img>image_2</img>\n Kawai is <img>image_3</img>\n In the image: <img>image_4</img>\n Please give a description.",
        "prediction": "Chigusa is sitting on the left, wearing a school uniform with a bow tie. Shimada is seated in the center, also in a school uniform, slightly turned towards Chigusa. Kawai is on the right, leaning forward while holding a piece of paper."
    },
    {
        "id": 118,
        "question": "Shimada is <img>image_1</img>\n Chigusa is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Chigusa with grounding.",
        "prediction": "<ref>Chigusa</ref><box>(391,202),(602,998)</box>"
    },
    {
        "id": 119,
        "question": "Chigusa is <img>image_1</img>\n In the image: <img>image_2</img>\n What is her emotion?",
        "prediction": "Chigusa is surprised."
    },
    {
        "id": 120,
        "question": "Kosaku is <img>image_1</img>\n Shimada is <img>image_2</img>\n For the image: <img>image_3</img>\n Give a description.",
        "prediction": "Kosaku stands to the right, extending a drink to Shimada who is seated to the left, both are in a dimly lit waiting area with red seating."
    },
    {
        "id": 121,
        "question": "Kosuke is <img>image_1</img>\n Sachiko is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Sachiko with grounding.",
        "prediction": "<ref>Sachiko</ref><box>(546,562),(757,997)</box>"
    },
    {
        "id": 122,
        "question": "Chigusa is <img>image_1</img>\n Kosuke is <img>image_2</img>\n Shimada is <img>image_3</img>\n In the image: <img>image_4</img>\n Who is raising hand?",
        "prediction": "Shimada is raising hand."
    },
    {
        "id": 123,
        "question": "Sachiko is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Sachiko with grounding.",
        "prediction": "<ref>Sachiko</ref><box>(578,150),(842,996)</box>"
    },
    {
        "id": 124,
        "question": "Chigusa is <img>image_1</img>\n Shimada is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Shimada doing?",
        "prediction": "Shimada is writing in a notebook."
    },
    {
        "id": 125,
        "question": "Kosaku is <img>image_1</img>\n Shimada is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Kosaku, a man in a suit, stands in a crowded train car, holding onto a pole. Next to him, Shimada, wearing a school uniform, looks downward. Other passengers are visible in the background."
    },
    {
        "id": 126,
        "question": "Kosaku is <img>image_1</img>\n Shimada is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Kosaku doing?",
        "prediction": "Kosaku is sitting and reading a book on a bench."
    },
    {
        "id": 127,
        "question": "Kosaku is <img>image_1</img>\n Shimada is <img>image_2</img>\n Please describe the image: <img>image_3</img>\n",
        "prediction": "Kosaku is seated on a bench, looking down, while Shimada stands behind him, appearing to adjust his hair or clothing."
    },
    {
        "id": 128,
        "question": "Shimada is <img>image_1</img>\n Kosaku is <img>image_2</img>\n In the image: <img>image_3</img>\n Give an image caption.",
        "prediction": "Shimada, wearing a school uniform with a camera, stands under an umbrella next to Kosaku, dressed in a suit. It's raining outside."
    },
    {
        "id": 129,
        "question": "Shimada is <img>image_1</img>\n Yusuke is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Shimada with grounding.",
        "prediction": "<ref>Shimada</ref><box>(106,12),(520,981)</box>"
    },
    {
        "id": 130,
        "question": "Kosaku Ito is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Kosaku Ito with grounding.",
        "prediction": "<ref>Kosaku Ito</ref><box>(34,179),(450,996)</box>"
    },
    {
        "id": 131,
        "question": "Hibiki Shimada is <img>image_1</img>\n Megumi Chigusa is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is holding a cylinder object?",
        "prediction": "Megumi Chigusa."
    },
    {
        "id": 132,
        "question": "Kosuke is <img>image_1</img>\n Shimada is <img>image_2</img>\n Describe the image: <img>image_3</img>\n",
        "prediction": "Kosuke, in a black school uniform with a red flower on his lapel, is walking alongside Shimada, who is wearing a similar uniform with a red bow tie, under cherry blossom trees in full bloom."
    },
    {
        "id": 133,
        "question": "Kosaku Ito is <img>image_1</img>\n Hibiki Shimada is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Kosaku Ito, dressed in a suit, stands facing Hibiki Shimada, who is in a school uniform with a bouquet of flowers in hand, on a sunny street with a car and houses in the background."
    },
    {
        "id": 134,
        "question": "Emma is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Emma with grounding.",
        "prediction": "<ref>Emma</ref><box>(383,41),(785,988)</box>"
    },
    {
        "id": 135,
        "question": "Adam is <img>image_1</img>\n Alvin is <img>image_2</img>\n In the image: <img>image_3</img>\n Is Adam standing?",
        "prediction": "No, Adam is sitting."
    },
    {
        "id": 136,
        "question": "Emma is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Emma with grounding.",
        "prediction": "<ref>Emma</ref><box>(279,226),(497,996)</box>"
    },
    {
        "id": 137,
        "question": "Adam is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Adam doing?",
        "prediction": "Adam is standing and appears to be in motion."
    },
    {
        "id": 138,
        "question": "Emma is <img>image_1</img>\n Adam is <img>image_2</img>\n Please describe the image: <img>image_3</img>\n",
        "prediction": "Emma, wearing a doctor's coat, is standing to the left, while Adam is lying in a hospital bed to the right, with his arm resting on a pillow. There's medical equipment in the background."
    },
    {
        "id": 139,
        "question": "Alvin is <img>image_1</img>\n Lucy is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Alvin, wearing a suit, sits at a table with a drink in his hand. Lucy, in a dress, is seated across from him, also with a drink. They appear to be engaged in a conversation at a bustling restaurant or bar setting."
    },
    {
        "id": 140,
        "question": "Gao Jian Xiang is <img>image_1</img>\n Liu Hao is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Gao Jian Xiang with grounding.",
        "prediction": "<ref>Gao Jian Xiang</ref><box>(220,87),(562,863)</box>"
    },
    {
        "id": 141,
        "question": "Chen is <img>image_1</img>\n Gao is <img>image_2</img>\n In the image: <img>image_3</img>\n Describe the scene.",
        "prediction": "Chen is being shoved by Gao in a tense confrontation outside, with another person watching."
    },
    {
        "id": 142,
        "question": "Gao is <img>image_1</img>\n Lin is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Lin with grounding.",
        "prediction": "<ref>Lin</ref><box>(26,99),(317,998)</box>"
    },
    {
        "id": 143,
        "question": "Chen is <img>image_1</img>\n Gao is <img>image_2</img>\n In the image: <img>image_3</img>\n What is the expression of Chen?",
        "prediction": "Chen looks surprised or in distress."
    },
    {
        "id": 144,
        "question": "Jack is <img>image_1</img>\n Gibbs is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Jack with grounding.",
        "prediction": "<ref>Jack</ref><box>(175,72),(533,985)</box>"
    },
    {
        "id": 145,
        "question": "Jack Sparrow is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Jack with grounding.",
        "prediction": "<ref>Jack</ref><box>(347,19),(723,987)</box>"
    },
    {
        "id": 146,
        "question": "Barbossa is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Barbossa with grounding.",
        "prediction": "<ref>Barbossa</ref><box>(612,137),(810,995)</box>"
    },
    {
        "id": 147,
        "question": "Barbossa is <img>image_1</img>\n In the image: <img>image_2</img>\n Please give a description.",
        "prediction": "Barbossa, dressed in elaborate pirate attire, stands on a ship with a sword in hand, while two other pirates stand beside him."
    },
    {
        "id": 148,
        "question": "Angelica is <img>image_1</img>\n Jack is <img>image_2</img>\n Describe the image: <img>image_3</img>\n",
        "prediction": "Angelica, wearing a dark hat, is extending her hand towards Jack, who is shirtless and appears to be in the jungle. There are other indistinct figures in the background."
    },
    {
        "id": 149,
        "question": "Syrena is <img>image_1</img>\n Philip is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Syrena is sitting on the lap of Philip, who is shirtless among dense foliage. There is another person visible in the background."
    },
    {
        "id": 150,
        "question": "Angelica is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Angelica with grounding.",
        "prediction": "<ref>Angelica</ref><box>(78,52),(514,988)</box>"
    },
    {
        "id": 151,
        "question": "Elizabeth is <img>image_1</img>\n William is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Elizabeth is sitting on the steps, looking down at a book. William is sitting beside her, gazing towards the left side of the image. They appear to be in a contemplative mood."
    },
    {
        "id": 152,
        "question": "William is <img>image_1</img>\n Elizabeth is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Elizabeth with grounding.",
        "prediction": "<ref>Elizabeth</ref><box>(37,69),(331,988)</box>"
    },
    {
        "id": 153,
        "question": "Brant is <img>image_1</img>\n Olive is <img>image_2</img>\n In the image: <img>image_3</img>\n What color is Olive's hat?",
        "prediction": "Olive's hat is brown."
    },
    {
        "id": 154,
        "question": "Elizabeth is <img>image_1</img>\n Olive is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Elizabeth with grounding.",
        "prediction": "<ref>Elizabeth</ref><box>(607,67),(886,983)</box>"
    },
    {
        "id": 155,
        "question": "Elizabeth is <img>image_1</img>\n William is <img>image_2</img>\n In the image: <img>image_3</img>\n Describe William's state.",
        "prediction": "William is standing with his hands on his hips."
    },
    {
        "id": 156,
        "question": "William is <img>image_1</img>\n Elizabeth is <img>image_2</img>\n In the image: <img>image_3</img>\n Is William sitting or lying down?",
        "prediction": "William is lying down."
    },
    {
        "id": 157,
        "question": "Robert is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Robert with grounding.",
        "prediction": "<ref>Robert</ref><box>(578,65),(895,976)</box>"
    },
    {
        "id": 158,
        "question": "Charles Rodman is <img>image_1</img>\n Will Rodman is <img>image_2</img>\n In the image: <img>image_3</img>\n Explain the visual content.",
        "prediction": "Charles Rodman is standing and feeding a chimpanzee with a bottle, while Will Rodman is sitting at a table, looking at a book and having a drink."
    },
    {
        "id": 159,
        "question": "Will Rodman is <img>image_1</img>\n Caesar is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is sitting in the back seat of the car?",
        "prediction": "Caesar is sitting in the back seat of the car."
    },
    {
        "id": 160,
        "question": "Will Rodman is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Will Rodman with grounding.",
        "prediction": "<ref>Will Rodman</ref><box>(381,16),(635,987)</box>"
    },
    {
        "id": 161,
        "question": "David Norris is <img>image_1</img>\n Elise Sellas is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is sitting by the window?",
        "prediction": "Elise Sellas is sitting by the window."
    },
    {
        "id": 162,
        "question": "Charlie Traynor is <img>image_1</img>\n David Norris is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Charlie Traynor with grounding.",
        "prediction": "<ref>Charlie Traynor</ref><box>(97,85),(476,988)</box>"
    },
    {
        "id": 163,
        "question": "Charlie is <img>image_1</img>\n David is <img>image_2</img>\n Elise is <img>image_3</img>\n In the image: <img>image_4</img>\n Give a description.",
        "prediction": "David is seated at a table with food and drinks, facing Charlie who stands in a suit with a red tie. Elise is seated across from David, engaged in conversation with him. They are in an outdoor dining setting with other patrons in the background."
    },
    {
        "id": 164,
        "question": "David Norris is <img>image_1</img>\n Jon Stewart is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Jon Stewart with grounding.",
        "prediction": "<ref>Jon Stewart</ref><box>(171,61),(446,987)</box>"
    },
    {
        "id": 165,
        "question": "Bloom is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Bloom with grounding.",
        "prediction": "<ref>Bloom</ref><box>(625,69),(998,989)</box>"
    },
    {
        "id": 166,
        "question": "Romeo is <img>image_1</img>\n Murphy is <img>image_2</img>\n Please describe the image: <img>image_3</img>\n",
        "prediction": "Murphy, on the left, is holding a gun with both hands in a defensive stance, while Romeo stands on the right, dressed in a white chef's uniform, looking towards Murphy. They appear to be in a dimly lit kitchen or bar area."
    },
    {
        "id": 167,
        "question": "Romeo is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Romeo doing?",
        "prediction": "Romeo is drinking at the bar."
    },
    {
        "id": 168,
        "question": "Connor is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Connor doing?",
        "prediction": "Connor is sitting and holding a card."
    },
    {
        "id": 169,
        "question": "Romeo is <img>image_1</img>\n In the image: <img>image_2</img>\n Give a description.",
        "prediction": "Romeo, holding a gun, stands in a room next to a bed. Another person is partially visible on the right side, facing Romeo."
    },
    {
        "id": 170,
        "question": "Poppa is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Poppa with grounding.",
        "prediction": "<ref>Poppa</ref><box>(285,59),(546,981)</box>"
    },
    {
        "id": 171,
        "question": "Katsuro Matsuoka is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Katsuro Matsuoka with grounding.",
        "prediction": "<ref>Katsuro Matsuoka</ref><box>(300,170),(532,998)</box>"
    },
    {
        "id": 172,
        "question": "Zhu Bajie is <img>image_1</img>\n Tangseng is <img>image_2</img>\n Shaseng is <img>image_3</img>\n In the image: <img>image_4</img>\n Identify Tangseng with grounding.",
        "prediction": "<ref>Tangseng</ref><box>(25,247),(386,986)</box>"
    },
    {
        "id": 173,
        "question": "Shaseng is <img>image_1</img>\n Tangseng is <img>image_2</img>\n Sun Wukong is <img>image_3</img>\n In the image: <img>image_4</img>\n Give a description.",
        "prediction": "Sun Wukong, wearing a red robe, stands on the left with a sword. Shaseng is in the center, draped in white, with a sword and shield. Tangseng is on the right, armored blue, holding a spear. They are in a forest setting."
    },
    {
        "id": 174,
        "question": "Shaseng is <img>image_1</img>\n Tangseng is <img>image_2</img>\n Sun Wukong is <img>image_3</img>\n In the image: <img>image_4</img>\n Identify Sun Wukong with grounding.",
        "prediction": "<ref>Sun Wukong</ref><box>(409,280),(767,987)</box>"
    },
    {
        "id": 175,
        "question": "Shary is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Shary with grounding.",
        "prediction": "<ref>Shary</ref><box>(7,481),(276,998)</box>"
    },
    {
        "id": 176,
        "question": "Shaseng is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Shaseng doing?",
        "prediction": "Shaseng is standing and seems to be in a conversation."
    },
    {
        "id": 177,
        "question": "Shaseng is <img>image_1</img>\n Zhu Bajie is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is lying down?",
        "prediction": "Zhu Bajie is lying down."
    },
    {
        "id": 178,
        "question": "Benjam is <img>image_1</img>\n Irene is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Benjam with grounding.",
        "prediction": "<ref>Benjam</ref><box>(167,50),(535,972)</box>"
    },
    {
        "id": 179,
        "question": "Irene is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Irene doing?",
        "prediction": "Irene is standing and holding some papers."
    },
    {
        "id": 180,
        "question": "Isidoro is <img>image_1</img>\n Benjam is <img>image_2</img>\n Irene is <img>image_3</img>\n In the image: <img>image_4</img>\n Identify Isidoro with grounding.",
        "prediction": "<ref>Isidoro</ref><box>(12,65),(468,981)</box>"
    },
    {
        "id": 181,
        "question": "Sam Witwicky is <img>image_1</img>\n Bruce Brazos is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Bruce Brazos going to do?",
        "prediction": "Bruce Brazos is going to shake Sam Witwicky's hand."
    },
    {
        "id": 182,
        "question": "Sam Witwicky is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Sam Witwicky with grounding.",
        "prediction": "<ref>Sam Witwicky</ref><box>(1,0),(392,983)</box>"
    },
    {
        "id": 183,
        "question": "Lennox is <img>image_1</img>\n In the image: <img>image_2</img>\n Is Lennox the person on the left or the one on the right?",
        "prediction": "Lennox is the one on the right."
    },
    {
        "id": 184,
        "question": "Sam is <img>image_1</img>\n Carly is <img>image_2</img>\n In the image: <img>image_3</img>\n What color is Sam's clothes?",
        "prediction": "Sam's clothes are light blue."
    },
    {
        "id": 185,
        "question": "Dylan is <img>image_1</img>\n Carly is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Carly with grounding.",
        "prediction": "<ref>Carly</ref><box>(119,52),(381,887)</box>"
    },
    {
        "id": 186,
        "question": "Dylan is <img>image_1</img>\n Carly is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Dylan with grounding.",
        "prediction": "<ref>Dylan</ref><box>(467,137),(723,983)</box>"
    },
    {
        "id": 187,
        "question": "Rooster is <img>image_1</img>\n LaBoeuf is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Rooster doing?",
        "prediction": "Rooster is lighting a cigarette."
    },
    {
        "id": 188,
        "question": "Mattie Ross is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Mattie Ross with grounding.",
        "prediction": "<ref>Mattie Ross</ref><box>(179,125),(414,988)</box>"
    },
    {
        "id": 189,
        "question": "Mattie Ross is <img>image_1</img>\n LaBoeuf is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Mattie Ross, wearing a dark coat and hat, is standing in a forest, holding a metal cup. LaBoeuf is sitting beside her, also dressed in dark attire, with a hat on. They both seem to be in a tense conversation at night."
    },
    {
        "id": 190,
        "question": "Mattie Ross is <img>image_1</img>\n LaBoeuf is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is hiding beside a rock?",
        "prediction": "LaBoeuf is hiding beside a rock."
    },
    {
        "id": 191,
        "question": "Qin is <img>image_1</img>\n Lord Ximen is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Qin with grounding.",
        "prediction": "<ref>Qin</ref><box>(481,16),(906,988)</box>"
    },
    {
        "id": 192,
        "question": "Qin is <img>image_1</img>\n In the image: <img>image_2</img>\n Describe Qin's state.",
        "prediction": "Qin is standing in a relaxed pose with an arm extended out."
    },
    {
        "id": 193,
        "question": "Lord is <img>image_1</img>\n In the image: <img>image_2</img>\n What color is Lord's clothes?",
        "prediction": "Lord's clothes are red."
    },
    {
        "id": 194,
        "question": "Lord Ximen is <img>image_1</img>\n Ho Sum-on is <img>image_2</img>\n In the image: <img>image_3</img>\n What color is Ho Sum-on's clothes?",
        "prediction": "Ho Sum-on's clothes are yellow."
    },
    {
        "id": 195,
        "question": "Qin is <img>image_1</img>\n Big Brother is <img>image_2</img>\n Please describe the image: <img>image_3</img>\n",
        "prediction": "Big Brother, wearing a pink shirt and gray vest, is sitting on the left side, gesturing while speaking. Qin, dressed in a black suit, is seated on the right, listening attentively. They are surrounded by a variety of books and are in a room with a view of trees outside."
    },
    {
        "id": 196,
        "question": "Qin is <img>image_1</img>\n SoSo is <img>image_2</img>\n In the image: <img>image_3</img>\n What does SoSo wear?",
        "prediction": "SoSo wears a yellow coat."
    },
    {
        "id": 197,
        "question": "Qin Kwai is <img>image_1</img>\n Qin is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is waving hand?",
        "prediction": "Qin Kwai is waving hand."
    },
    {
        "id": 198,
        "question": "Qin is <img>image_1</img>\n Qin Kwai is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Qin Kwai with grounding.",
        "prediction": "<ref>Qin Kwai</ref><box>(602,219),(991,996)</box>"
    },
    {
        "id": 199,
        "question": "Big Brother is <img>image_1</img>\n In the image: <img>image_2</img>\n Give a description.",
        "prediction": "Big Brother, wearing a light blue shirt, is embracing a young boy in a blue shirt with checkered pants. They are in a room with traditional Chinese decor. In the background, two other individuals can be seen."
    },
    {
        "id": 200,
        "question": "Qin is <img>image_1</img>\n SoSo is <img>image_2</img>\n Provide a description for the image: <img>image_3</img>\n",
        "prediction": "Qin is on the right, wearing a white shirt and pointing to the left. SoSo is on the left, dressed in a white tank top, looking towards Qin's direction. They are surrounded by blue and green balloons."
    },
    {
        "id": 201,
        "question": "Dr. Martin Harris is <img>image_1</img>\n Herr Strauss is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Herr Strauss with grounding.",
        "prediction": "<ref>Herr Strauss</ref><box>(602,84),(889,994)</box>"
    },
    {
        "id": 202,
        "question": "Herr Strauss is <img>image_1</img>\n Dr. Martin Harris is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is leaning against the table?",
        "prediction": "Dr. Martin Harris is leaning against the table."
    },
    {
        "id": 203,
        "question": "Martin is <img>image_1</img>\n Gina is <img>image_2</img>\n In the image: <img>image_3</img>\n Does Martin wear a hat?",
        "prediction": "No, Martin does not wear a hat."
    },
    {
        "id": 204,
        "question": "Martin is <img>image_1</img>\n Ernst is <img>image_2</img>\n Gina is <img>image_3</img>\n In the image: <img>image_4</img>\n Give a description.",
        "prediction": "Martin is standing on the left, facing Ernst who is on the right. In front of them is Gina, standing slightly angled towards Ernst. They appear to be having a conversation on a bridge with a historic building in the background."
    },
    {
        "id": 205,
        "question": "Martin is <img>image_1</img>\n Elizabeth is <img>image_2</img>\n In the image: <img>image_3</img>\n What is on Elizabeth's neck?",
        "prediction": "Elizabeth is wearing a pearl necklace."
    },
    {
        "id": 206,
        "question": "Professor Bressler is <img>image_1</img>\n Provide a description for the image: <img>image_2</img>\n",
        "prediction": "Professor Bressler is embracing a girl in a corridor. There are blurred people in the background."
    },
    {
        "id": 207,
        "question": "Natalie is <img>image_1</img>\n Craig is <img>image_2</img>\n In the image: <img>image_3</img>\n Is Natalie the person standing closer to Craig or farther away?",
        "prediction": "Natalie is standing closer to Craig."
    },
    {
        "id": 208,
        "question": "Craig is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Craig with grounding.",
        "prediction": "<ref>Craig</ref><box>(497,43),(885,908)</box>"
    },
    {
        "id": 209,
        "question": "Ryan is <img>image_1</img>\n Natalie is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Ryan, in a dark suit, stands holding a jacket in an airport terminal next to Natalie, who is also in a suit with a coat in her hand. They appear to be waiting or lining up."
    },
    {
        "id": 210,
        "question": "Julie Bingham is <img>image_1</img>\n Kara Bingham is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Julie Bingham with grounding.",
        "prediction": "<ref>Julie Bingham</ref><box>(5,126),(407,991)</box>"
    },
    {
        "id": 211,
        "question": "Jim is <img>image_1</img>\n Julie is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Jim is on the right, wearing a black suit with a red vest and tie, standing next to Julie who is in a white bridal dress holding a bouquet. There are other people in the background."
    },
    {
        "id": 212,
        "question": "Ryan Bingham is <img>image_1</img>\n Alex Goran is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Alex Goran doing?",
        "prediction": "Alex Goran is holding a champagne glass."
    },
    {
        "id": 213,
        "question": "Brendan Conlon is <img>image_1</img>\n Frank Campana is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Frank Campana doing?",
        "prediction": "Frank Campana is pointing towards Brendan Conlon."
    },
    {
        "id": 214,
        "question": "Paddy Conlon is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Paddy Conlon with grounding.",
        "prediction": "<ref>Paddy Conlon</ref><box>(189,40),(617,990)</box>"
    },
    {
        "id": 215,
        "question": "Principal Zito is <img>image_1</img>\n In the image: <img>image_2</img>\n What does Principal Zito wear?",
        "prediction": "Principal Zito is wearing a striped shirt."
    },
    {
        "id": 216,
        "question": "Fenroy is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Fenroy with grounding.",
        "prediction": "<ref>Fenroy</ref><box>(546,198),(906,1016)</box>"
    },
    {
        "id": 217,
        "question": "Juan Lian is <img>image_1</img>\n Zi Xia is <img>image_2</img>\n Please describe the image: <img>image_3</img>\n",
        "prediction": "Juan Lian is on the left, wearing a white robe with a black chest emblem, his right hand covering his mouth. Zi Xia is on the right, dressed in white with a side braid, looking towards the left. They appear to be in a traditional setting with a lantern and shelves in the background."
    },
    {
        "id": 218,
        "question": "Sun Wukong is <img>image_1</img>\n In the image: <img>image_2</img>\n Give a description.",
        "prediction": "Sun Wukong is in the center of the image, engaged in a fierce battle with two opponents on each side, wielding a weapon with both hands."
    },
    {
        "id": 219,
        "question": "Juan Lian is <img>image_1</img>\n Zi Xia is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Zi Xia doing?",
        "prediction": "Zi Xia is standing with her hands clasped together."
    },
    {
        "id": 220,
        "question": "Sun Wukong is <img>image_1</img>\n Zi Xia is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is sitting on top?",
        "prediction": "Sun Wukong is sitting on top."
    },
    {
        "id": 221,
        "question": "Erlang Shen is <img>image_1</img>\n Juan Lian is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Erlang Shen with grounding.",
        "prediction": "<ref>Erlang Shen</ref><box>(29,5),(662,989)</box>"
    },
    {
        "id": 222,
        "question": "Sun Wukong is <img>image_1</img>\n Zi Xia is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Zi Xia with grounding.",
        "prediction": "<ref>Zi Xia</ref><box>(142,223),(453,985)</box>"
    },
    {
        "id": 223,
        "question": "Royce is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Royce with grounding.",
        "prediction": "<ref>Royce</ref><box>(606,0),(967,969)</box>"
    },
    {
        "id": 224,
        "question": "Edwin is <img>image_1</img>\n Isabelle is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Isabelle with grounding.",
        "prediction": "<ref>Isabelle</ref><box>(385,195),(570,995)</box>"
    },
    {
        "id": 225,
        "question": "Elizabeth Shaw is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Elizabeth Shaw with grounding.",
        "prediction": "<ref>Elizabeth Shaw</ref><box>(92,131),(307,996)</box>"
    },
    {
        "id": 226,
        "question": "David is <img>image_1</img>\n Elizabeth Shaw is <img>image_2</img>\n In the image: <img>image_3</img>\n What is David doing?",
        "prediction": "David is standing and looking at a wall screen."
    },
    {
        "id": 227,
        "question": "David is <img>image_1</img>\n Elizabeth Shaw is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "David is lying on a table, and Elizabeth Shaw is leaning over him, appearing to be in a conversation or interaction."
    },
    {
        "id": 228,
        "question": "Hilly Holbrook is <img>image_1</img>\n Jolene French is <img>image_2</img>\n Elizabeth Leefolt is <img>image_3</img>\n In the image: <img>image_4</img>\n Identify Elizabeth Leefolt with grounding.",
        "prediction": "<ref>Elizabeth Leefolt</ref><box>(546,112),(821,987)</box>"
    },
    {
        "id": 229,
        "question": "Hilly Holbrook is <img>image_1</img>\n Skeeter Phelan is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is holding the other person's arms?",
        "prediction": "Hilly Holbrook is holding Skeeter Phelan's arms."
    },
    {
        "id": 230,
        "question": "Skeeter Phelan is <img>image_1</img>\n Elizabeth Leefolt is <img>image_2</img>\n In the image: <img>image_3</img>\n who has a drink in front of herself?",
        "prediction": "Skeeter Phelan has a drink in front of herself."
    },
    {
        "id": 231,
        "question": "Skeeter Phelan is <img>image_1</img>\n Aibileen Clark is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Skeeter Phelan is seated at a kitchen table, looking at Aibileen Clark who stands with her hand on the table, in a domestic kitchen setting."
    },
    {
        "id": 232,
        "question": "Hilly Holbrook is <img>image_1</img>\n Elizabeth Leefolt is <img>image_2</img>\n Jolene French is <img>image_3</img>\n In the image: <img>image_4</img>\n What does Jolene French wear?",
        "prediction": "Jolene French is wearing a black dress with lace details."
    },
    {
        "id": 233,
        "question": "Aibileen Clark is <img>image_1</img>\n Minny Jackson is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Minny Jackson with grounding.",
        "prediction": "<ref>Minny Jackson</ref><box>(645,0),(837,983)</box>"
    },
    {
        "id": 234,
        "question": "Aibileen is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Aibileen doing?",
        "prediction": "Aibileen is standing and holding a picture."
    },
    {
        "id": 235,
        "question": "Hilly Holbrook is <img>image_1</img>\n Elizabeth Leefolt is <img>image_2</img>\n Describe the content of the image: <img>image_3</img>\n",
        "prediction": "Hilly Holbrook stands with her hands on her hips facing forward, while Elizabeth Leefolt holds a baby, looking away to the side."
    },
    {
        "id": 236,
        "question": "Meiqin is <img>image_1</img>\n Heizi is <img>image_2</img>\n Leizi is <img>image_3</img>\n Chuchun is <img>image_4</img>\n In the image: <img>image_5</img>\n Identify Leizi with grounding.",
        "prediction": "<ref>Leizi</ref><box>(10,260),(346,995)</box>"
    },
    {
        "id": 237,
        "question": "Heizi is <img>image_1</img>\n Chuchun is <img>image_2</img>\n Leizi is <img>image_3</img>\n In the image: <img>image_4</img>\n Who is standing behind?",
        "prediction": "Heizi is standing behind."
    },
    {
        "id": 238,
        "question": "Meiqin is <img>image_1</img>\n Shifeng is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Meiqin is seated at a library table reading a book, while Shifeng stands behind her, leaning over as if to get a closer look at the book or discuss it with Meiqin. They are surrounded by other students in a spacious library."
    },
    {
        "id": 239,
        "question": "Meiqin is <img>image_1</img>\n Heizi is <img>image_2</img>\n Leizi is <img>image_3</img>\n Chuchun is <img>image_4</img>\n In the image: <img>image_5</img>\n Who is squatting?",
        "prediction": "Meiqin is squatting."
    },
    {
        "id": 240,
        "question": "Pei is <img>image_1</img>\n Heizi is <img>image_2</img>\n Leizi is <img>image_3</img>\n Chuchun is <img>image_4</img>\n In the image: <img>image_5</img>\n What is Chuchun doing?",
        "prediction": "Chuchun is standing and looking towards Heizi."
    },
    {
        "id": 241,
        "question": "Meiqin is <img>image_1</img>\n Heizi is <img>image_2</img>\n Pei is <img>image_3</img>\n Describe the image: <img>image_4</img>\n",
        "prediction": "Heizi stands confidently with her hands on her hips facing Meiqin, who is kneeling on the ground looking up at Heizi. Pei observes from behind, standing slightly apart from Heizi. They are on a rooftop with a clear sky above."
    },
    {
        "id": 242,
        "question": "Meiqin is <img>image_1</img>\n Pei is <img>image_2</img>\n Leizi is <img>image_3</img>\n Chuchun is <img>image_4</img>\n In the image: <img>image_5</img>\n Who is carrying Pei on her back?",
        "prediction": "Leizi is carrying Pei on her back."
    },
    {
        "id": 243,
        "question": "Leizi is <img>image_1</img>\n Chuchun is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Chuchun's experssion?",
        "prediction": "Chuchun appears to be smiling."
    },
    {
        "id": 244,
        "question": "Meiqin is <img>image_1</img>\n Pei is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Pei, in a blue dress, is leaping over a railing, while Meiqin, in a brown shirt and skirt, appears to be dodging or falling."
    },
    {
        "id": 245,
        "question": "Meiqin is <img>image_1</img>\n Dangma is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Dangma with grounding.",
        "prediction": "<ref>Dangma</ref><box>(477,108),(734,994)</box>"
    },
    {
        "id": 246,
        "question": "Meiqin is <img>image_1</img>\n Heizi is <img>image_2</img>\n Leizi is <img>image_3</img>\n Chuchun is <img>image_4</img>\n In the image: <img>image_5</img>\n Identify Heizi with grounding.",
        "prediction": "<ref>Heizi</ref><box>(57,509),(276,997)</box>"
    },
    {
        "id": 247,
        "question": "Meiqin is <img>image_1</img>\n Heizi is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Meiqin is sitting at a table, holding a spoon near Heizi, who is leaning forward slightly towards the table. They are outdoors with greenery and flowers in the background."
    },
    {
        "id": 248,
        "question": "Meiqin is <img>image_1</img>\n Shifeng is <img>image_2</img>\n In the image: <img>image_3</img>\n What does Meiqin wear?",
        "prediction": "Meiqin wears a black tank top."
    },
    {
        "id": 249,
        "question": "Dangma is <img>image_1</img>\n Leizi is <img>image_2</img>\n Describe the content of the image: <img>image_3</img>\n",
        "prediction": "Dangma is on the left, facing a girl named Leizi on the right, who is holding a necklace towards Dangma in an outdoor setting with buildings in the background."
    },
    {
        "id": 250,
        "question": "Heizi is <img>image_1</img>\n Leizi is <img>image_2</img>\n Chuchun is <img>image_3</img>\n In the image: <img>image_4</img>\n What is Chuchun doing?",
        "prediction": "Chuchun is pushing Heizi in a wheelchair."
    },
    {
        "id": 251,
        "question": "Runzi is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Runzi with grounding.",
        "prediction": "<ref>Runzi</ref><box>(108,172),(354,995)</box>"
    },
    {
        "id": 252,
        "question": "Meiqin is <img>image_1</img>\n Heizi is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Heizi is on the left, touching her right eye, while Meiqin is on the right, facing Heizi. Both are in a corridor with railings on the sides."
    },
    {
        "id": 253,
        "question": "Meiqin is <img>image_1</img>\n Heizi is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is holding the umbrella?",
        "prediction": "Meiqin is holding the umbrella."
    },
    {
        "id": 254,
        "question": "Leizi is <img>image_1</img>\n In the image: <img>image_2</img>\n What color is Leizi's clothes?",
        "prediction": "Leizi's clothes are grey with a red belt."
    },
    {
        "id": 255,
        "question": "Meiqin is <img>image_1</img>\n Heizi is <img>image_2</img>\n Leizi is <img>image_3</img>\n Chuchun is <img>image_4</img>\n In the image: <img>image_5</img>\n Identify Heizi with grounding.",
        "prediction": "<ref>Heizi</ref><box>(557,145),(867,994)</box>"
    },
    {
        "id": 256,
        "question": "Tangsan is <img>image_1</img>\n Xiaowu is <img>image_2</img>\n Rongrong is <img>image_3</img>\n Zhuqing is <img>image_4</img>\n In the image: <img>image_5</img>\n Identify Rongrong with grounding.",
        "prediction": "<ref>Rongrong</ref><box>(664,187),(857,975)</box>"
    },
    {
        "id": 257,
        "question": "Daimubai is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Daimubai doing?",
        "prediction": "Daimubai is standing with his arms crossed."
    },
    {
        "id": 258,
        "question": "Tangsan is <img>image_1</img>\n Xiaowu is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Tangsan, dressed in a blue outfit with a gold trim, stands on the left, and Xiaowu, wearing a pink dress with bunny ears, is on the right. They appear to be in a forest at night with glowing plants."
    },
    {
        "id": 259,
        "question": "Tangsan is <img>image_1</img>\n Xiaowu is <img>image_2</img>\n Rongrong is <img>image_3</img>\n Aosika is <img>image_4</img>\n In the image: <img>image_5</img>\n Give a description.",
        "prediction": "Tangsan, dressed in a blue suit, is standing on the left. Xiaowu, in a pink outfit, is adjusting Tangsan's clothes from behind on the left. Rongrong, wearing green, is holding Tangsan from behind on the right. Aosika, in a brown leather outfit, is on the far right side. They are all against a backdrop with glowing lights."
    },
    {
        "id": 260,
        "question": "Xiaose is <img>image_1</img>\n Wuxin is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Wuxin doing?",
        "prediction": "Wuxin is engaged in a close combat with Xiaose."
    },
    {
        "id": 261,
        "question": "Xiaose is <img>image_1</img>\n Wuxin is <img>image_2</img>\n Leiwujie is <img>image_3</img>\n In the image: <img>image_4</img>\n Provide description of visual content.",
        "prediction": "Wuxin is sitting on the ground to the left, working on something by the fire. Leiwujie is standing on the right, in a relaxed pose with an arm behind his back. Xiaose is not visible in the image."
    },
    {
        "id": 262,
        "question": "Xiaose is <img>image_1</img>\n Wuxin is <img>image_2</img>\n Baifa is <img>image_3</img>\n Leiwujie is <img>image_4</img>\n In the image: <img>image_5</img>\n Identify Baifa with grounding.",
        "prediction": "<ref>Baifa</ref><box>(16,125),(229,988)</box>"
    },
    {
        "id": 263,
        "question": "Tanglian is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Tanglian with grounding.",
        "prediction": "<ref>Tanglian</ref><box>(229,302),(528,988)</box>"
    },
    {
        "id": 264,
        "question": "Qianluo is <img>image_1</img>\n Tanglian is <img>image_2</img>\n Rui is <img>image_3</img>\n In the image: <img>image_4</img>\n Give a description.",
        "prediction": "Qianluo is seated on the left, wearing a blue outfit with a white frontlet. Tanglian is seated on the right, dressed in black with a red frontlet. Rui is seated beside Tanglian, adorned in a golden outfit. They appear to be engaged in a conversation in a room with pink curtains."
    },
    {
        "id": 265,
        "question": "Xiaose is <img>image_1</img>\n Leiwujie is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Xiaose doing?",
        "prediction": "Xiaose is sitting with their hand on their chin."
    },
    {
        "id": 266,
        "question": "Xiaose is <img>image_1</img>\n Qianluo is <img>image_2</img>\n Tanglian is <img>image_3</img>\n Leiwujie is <img>image_4</img>\n In the image: <img>image_5</img>\n Identify Qianluo with grounding.",
        "prediction": "<ref>Qianluo</ref><box>(203,239),(371,861)</box>"
    },
    {
        "id": 267,
        "question": "Yuechu is <img>image_1</img>\n Susu is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Susu with grounding.",
        "prediction": "<ref>Susu</ref><box>(510,525),(903,995)</box>"
    },
    {
        "id": 268,
        "question": "Yuechu is <img>image_1</img>\n Susu is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Yuechu doing?",
        "prediction": "Yuechu is holding a speech bubble with Chinese characters."
    },
    {
        "id": 269,
        "question": "Yuechu is <img>image_1</img>\n Susu is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Yuechu is on the left, holding a red popsicle, and Susu is on the right, wearing a multicolored kimono with a pointed tail. They appear to be standing against a clear sky with greenery in the background."
    },
    {
        "id": 270,
        "question": "Rongrong is <img>image_1</img>\n Yaya is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Rongrong with grounding.",
        "prediction": "<ref>Rongrong</ref><box>(28,242),(478,981)</box>"
    },
    {
        "id": 271,
        "question": "Fugui is <img>image_1</img>\n Qingtong is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Fugui doing?",
        "prediction": "Fugui is embracing Qingtong."
    },
    {
        "id": 272,
        "question": "Yuechu is <img>image_1</img>\n Susu is <img>image_2</img>\n Describe the image: <img>image_3</img>\n",
        "prediction": "Yuechu is standing on the left, drawing on a sketchbook. Susu is on the right, watching Yuechu with a pole in her hand. They are indoors with a wooden background."
    },
    {
        "id": 273,
        "question": "Yuechu is <img>image_1</img>\n Susu is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Susu is sitting on the ground next to Yuechu, who is lying down with their eyes closed. They appear to be in a destroyed village setting under a reddish sky."
    },
    {
        "id": 274,
        "question": "Aniya is <img>image_1</img>\n Fujie is <img>image_2</img>\n Yueer is <img>image_3</img>\n In the image: <img>image_4</img>\n Who is sitting on the chair?",
        "prediction": "Fujie is sitting on the chair."
    },
    {
        "id": 275,
        "question": "Aniya is <img>image_1</img>\n Fujie is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Aniya doing?",
        "prediction": "Aniya is hiding behind the door."
    },
    {
        "id": 276,
        "question": "Aniya is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Aniya with grounding.",
        "prediction": "<ref>Aniya</ref><box>(97,373),(659,983)</box>"
    },
    {
        "id": 277,
        "question": "Fujie is <img>image_1</img>\n Yueer is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Yueer doing?",
        "prediction": "Yueer is leaning towards Fujie's face with her hand raised."
    },
    {
        "id": 278,
        "question": "Aniya is <img>image_1</img>\n Fujie is <img>image_2</img>\n Yueer is <img>image_3</img>\n In the image: <img>image_4</img>\n What does Yueer hold?",
        "prediction": "Yueer holds a picnic basket."
    },
    {
        "id": 279,
        "question": "Aniya is <img>image_1</img>\n Fujie is <img>image_2</img>\n Yueer is <img>image_3</img>\n Descirbe the image: <img>image_4</img>\n",
        "prediction": "Aniya is a young girl in a red jacket, standing between Fujie, who is dressed in a suit holding a gift, and Yueer, wearing a pink dress with shopping bags. They appear to be after a successful shopping trip."
    },
    {
        "id": 280,
        "question": "Aniya is <img>image_1</img>\n Fujie is <img>image_2</img>\n Yueer is <img>image_3</img>\n In the image: <img>image_4</img>\n Who is near the window?",
        "prediction": "Fujie is near the window."
    },
    {
        "id": 281,
        "question": "Fujie is <img>image_1</img>\n Yueer is <img>image_2</img>\n In the image: <img>image_3</img>\n Describe Yueer's outfit.",
        "prediction": "Yueer is wearing a blue dress with a yellow flower on the side."
    },
    {
        "id": 282,
        "question": "Aniya is <img>image_1</img>\n Fujie is <img>image_2</img>\n Yueer is <img>image_3</img>\n In the image: <img>image_4</img>\n Give a description.",
        "prediction": "Aniya is sitting on the floor, looking at Fujie, who is sitting on a sofa reading a paper. Yueer is kneeling on the right, also looking at Fujie. They are in a living room with a fireplace and a chandelier."
    },
    {
        "id": 283,
        "question": "Aniya is <img>image_1</img>\n Fujie is <img>image_2</img>\n Yueer is <img>image_3</img>\n What do you see in the image: <img>image_4</img>\n",
        "prediction": "Aniya is sitting on Fujie's lap, leaning against a cloudy sky backdrop. Fujie is holding onto Yueer, who appears to be standing slightly behind with a focused expression."
    },
    {
        "id": 284,
        "question": "Fujie is <img>image_1</img>\n Yueer is <img>image_2</img>\n Beiji is <img>image_3</img>\n In the image: <img>image_4</img>\n Give a description.",
        "prediction": "Fujie is sitting on a sofa with a surprised expression, while Yueer is standing beside her holding a cat. Beiji is sitting on the floor, looking towards Fujie and Yueer. The room has a cozy Japanese style interior."
    },
    {
        "id": 285,
        "question": "Aniya is <img>image_1</img>\n Fujie is <img>image_2</img>\n Provide a description of the image: <img>image_3</img>\n",
        "prediction": "Aniya is riding a surfboard with a girl in the front, while Fujie is lying on a surfboard in the back, both are in the middle of the ocean with small waves."
    },
    {
        "id": 286,
        "question": "Aniya is <img>image_1</img>\n Fujie is <img>image_2</img>\n Yueer is <img>image_3</img>\n In the image: <img>image_4</img>\n What expression is on Yueer's face?",
        "prediction": "Yueer is smiling with her eyes closed."
    },
    {
        "id": 287,
        "question": "Aniya is <img>image_1</img>\n Cizi is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Cizi's emotion?",
        "prediction": "Cizi appears to be angry."
    },
    {
        "id": 288,
        "question": "Aniya is <img>image_1</img>\n Fujie is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Aniya is lying in bed, while Fujie is leaning over, looking at a clock."
    },
    {
        "id": 289,
        "question": "Aniya is <img>image_1</img>\n Fujie is <img>image_2</img>\n Yueer is <img>image_3</img>\n In the image: <img>image_4</img>\n What is Fujie doing?",
        "prediction": "Fujie is sitting and holding a drink."
    },
    {
        "id": 290,
        "question": "Aniya is <img>image_1</img>\n Fujie is <img>image_2</img>\n In the image: <img>image_3</img>\n Describe the content.",
        "prediction": "Aniya is seated at a table, focused on writing on pieces of paper, while Fujie stands behind her, gesturing as if he's explaining something."
    },
    {
        "id": 291,
        "question": "Aniya is <img>image_1</img>\n Beiji is <img>image_2</img>\n Cizi is <img>image_3</img>\n In the image: <img>image_4</img>\n Identify Cizi with grounding.",
        "prediction": "<ref>Cizi</ref><box>(579,230),(748,1020)</box>"
    },
    {
        "id": 292,
        "question": "Aniya is <img>image_1</img>\n Fujie is <img>image_2</img>\n Yueer is <img>image_3</img>\n In the image: <img>image_4</img>\n Whose arms is Aniya in?",
        "prediction": "Aniya is in Fujie's arms."
    },
    {
        "id": 293,
        "question": "Aniya is <img>image_1</img>\n Fujie is <img>image_2</img>\n Yueer is <img>image_3</img>\n In the image: <img>image_4</img>\n Give a description.",
        "prediction": "Aniya is standing on one leg with a stretched arm, seemingly in a dance pose. Fujie is sitting on the couch, partially visible on the right side. Yueer is standing behind Aniya, with her body angled towards the left side of the image."
    },
    {
        "id": 294,
        "question": "Ranbing is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Ranbing with grounding.",
        "prediction": "<ref>Ranbing</ref><box>(150,274),(331,966)</box>"
    },
    {
        "id": 295,
        "question": "Make is <img>image_1</img>\n Ranbing is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Make stands on the right in a suit, facing Ranbing, who is on the left in a bridal gown with a veil. They are surrounded by guests in a wedding setting."
    },
    {
        "id": 296,
        "question": "Ailika is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Ailika doing?",
        "prediction": "Ailika is sitting in a vehicle with her arm raised."
    },
    {
        "id": 297,
        "question": "Baiyuekui is <img>image_1</img>\n Xiadou is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Xiadou with grounding.",
        "prediction": "<ref>Xiadou</ref><box>(450,209),(757,838)</box>"
    },
    {
        "id": 298,
        "question": "Hongkou is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Hongkou doing?",
        "prediction": "Hongkou is holding a gun and seems to be in a conversation or negotiation."
    },
    {
        "id": 299,
        "question": "Chengxiaoshi is <img>image_1</img>\n Luguang is <img>image_2</img>\n Qiaoling is <img>image_3</img>\n In the image: <img>image_4</img>\n Identify Chengxiaoshi with grounding.",
        "prediction": "<ref>Chengxiaoshi</ref><box>(267,198),(493,891)</box>"
    },
    {
        "id": 300,
        "question": "Chengxiaoshi is <img>image_1</img>\n Qiaoling is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Qiaoling stands to the left with her body facing slightly right, wearing a school uniform. Chengxiaoshi is on the right, raising his hands with a smile, wearing a jacket with leaves on it. They appear to be outside near a window and a brick wall."
    },
    {
        "id": 301,
        "question": "Chengxiaoshi is <img>image_1</img>\n Luguang is <img>image_2</img>\n Qiaoling is <img>image_3</img>\n In the image: <img>image_4</img>\n Who is opening the door?",
        "prediction": "Qiaoling is opening the door."
    },
    {
        "id": 302,
        "question": "Chengxiaoshi is <img>image_1</img>\n Luguang is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Chengxiaoshi doing?",
        "prediction": "Chengxiaoshi is standing next to Luguang."
    },
    {
        "id": 303,
        "question": "Luguang is <img>image_1</img>\n Qiaoling is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Luguang with grounding.",
        "prediction": "<ref>Luguang</ref><box>(43,103),(403,983)</box>"
    },
    {
        "id": 304,
        "question": "Cody is <img>image_1</img>\n May is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify May with grounding.",
        "prediction": "<ref>May</ref><box>(150,184),(407,937)</box>"
    },
    {
        "id": 305,
        "question": "Cody is <img>image_1</img>\n May is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Cody doing?",
        "prediction": "Cody is holding a large rock."
    },
    {
        "id": 306,
        "question": "Cody is <img>image_1</img>\n May is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is carrying the red magnet?",
        "prediction": "Cody is carrying the red magnet."
    },
    {
        "id": 307,
        "question": "Bob parr is <img>image_1</img>\n Elastigirl is <img>image_2</img>\n Dashiell is <img>image_3</img>\n Violet parr is <img>image_4</img>\n In the image: <img>image_5</img>\n Identify Elastigirl with grounding.",
        "prediction": "<ref>Elastigirl</ref><box>(645,129),(950,972)</box>"
    },
    {
        "id": 308,
        "question": "Elastigirl is <img>image_1</img>\n Dashiell is <img>image_2</img>\n Violet parr is <img>image_3</img>\n Jack-jack parr is <img>image_4</img>\n In the image: <img>image_5</img>\n Who is holding Jack-jack in arms?",
        "prediction": "Violet parr."
    },
    {
        "id": 309,
        "question": "Bob parr is <img>image_1</img>\n Dashiell is <img>image_2</img>\n Violet parr is <img>image_3</img>\n In the image: <img>image_4</img>\n Identify Violet parr with grounding.",
        "prediction": "<ref>Violet parr</ref><box>(735,338),(978,986)</box>"
    },
    {
        "id": 310,
        "question": "Elastigirl is <img>image_1</img>\n Jack-jack parr is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Elastigirl is standing in a kitchen, holding JackJack Parr's head above a sink filled with water and soap."
    },
    {
        "id": 311,
        "question": "Elastigirl is <img>image_1</img>\n Dashiell is <img>image_2</img>\n Jack-jack parr is <img>image_3</img>\n In the image: <img>image_4</img>\n What is Dashiell doing?",
        "prediction": "Dashiell is sitting at the dinner table."
    },
    {
        "id": 312,
        "question": "Bob parr is <img>image_1</img>\n Elastigirl is <img>image_2</img>\n Dashiell is <img>image_3</img>\n Violet parr is <img>image_4</img>\n Jack-jack parr is <img>image_5</img>\n In the image: <img>image_6</img>\n Identify Bob parr with grounding.",
        "prediction": "<ref>Bob parr</ref><box>(114,38),(367,951)</box>"
    },
    {
        "id": 313,
        "question": "Bob parr is <img>image_1</img>\n/img>\n In the image: <img>image_2</img>\n What is Bob parr doing?",
        "prediction": "Bob parr is pouring a drink from a pitcher."
    },
    {
        "id": 314,
        "question": "Bob parr is <img>image_1</img>\n Elastigirl is <img>image_2</img>\n Jack-jack parr is <img>image_3</img>\n In the image: <img>image_4</img>\n Give a description.",
        "prediction": "Elastigirl is sitting on a red motorcycle, looking to her right. Bob Parr is standing to her left, holding a small creature. JackJack Parr is standing between them, slightly behind. They are in a dimly lit room."
    },
    {
        "id": 315,
        "question": "Woody is <img>image_1</img>\n Jessie is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Jessie with grounding.",
        "prediction": "<ref>Jessie</ref><box>(7,35),(525,983)</box>"
    },
    {
        "id": 316,
        "question": "Buzz is <img>image_1</img>\n Jessie is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Buzz Lightyear is playfully tussling with Jessie on a wooden floor."
    },
    {
        "id": 317,
        "question": "Woody is <img>image_1</img>\n Jessie is <img>image_2</img>\n Describe the content of the image: <img>image_3</img>\n",
        "prediction": "Woody and Jessie are standing close to each other, surrounded by various characters from the Toy Story series."
    },
    {
        "id": 318,
        "question": "Woody is <img>image_1</img>\n Buzz is <img>image_2</img>\n Lotso is <img>image_3</img>\n In the image: <img>image_4</img>\n Identify Woody with grounding.",
        "prediction": "<ref>Woody</ref><box>(785,62),(951,820)</box>"
    },
    {
        "id": 319,
        "question": "Woody is <img>image_1</img>\n Buzz is <img>image_2</img>\n Jessie is <img>image_3</img>\n In the image: <img>image_4</img>\n Who is Buzz looking at?",
        "prediction": "Buzz is looking at Woody."
    },
    {
        "id": 320,
        "question": "Betty is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Betty with grounding.",
        "prediction": "<ref>Betty</ref><box>(489,105),(905,887)</box>"
    },
    {
        "id": 321,
        "question": "Woody is <img>image_1</img>\n Buzz is <img>image_2</img>\n In the image: <img>image_3</img>\n Which hand of the boy is Buzz in?",
        "prediction": "Buzz is on the left hand of the boy."
    },
    {
        "id": 322,
        "question": "Woody is <img>image_1</img>\n Buzz is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Woody doing?",
        "prediction": "Woody is standing with his back slightly turned."
    },
    {
        "id": 323,
        "question": "Buzz is <img>image_1</img>\n Jessie is <img>image_2</img>\n Rex is <img>image_3</img>\n In the image: <img>image_4</img>\n Who is being pressed at the bottom?",
        "prediction": "Rex is being pressed at the bottom."
    },
    {
        "id": 324,
        "question": "Woody is <img>image_1</img>\n Buzz is <img>image_2</img>\n Jessie is <img>image_3</img>\n In the image: <img>image_4</img>\n Give a description.",
        "prediction": "Woody stands on the left, Buzz in the center, and Jessie on the right, surrounded by various toy characters in a colorful toy room."
    },
    {
        "id": 325,
        "question": "Woody is <img>image_1</img>\n Buzz is <img>image_2</img>\n Describe the image: <img>image_3</img>\n",
        "prediction": "Woody on the left is holding up his right hand while Buzz on the right is holding up a purple object in a room filled with various toys."
    },
    {
        "id": 326,
        "question": "Woody is <img>image_1</img>\n Buzz is <img>image_2</img>\n Bo peep is <img>image_3</img>\n In the image: <img>image_4</img>\n What is Bo peep doing?",
        "prediction": "Bo peep is walking with a sheep."
    },
    {
        "id": 327,
        "question": "Woody is <img>image_1</img>\n Bo peep is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Woody is lying on his back on the grass, and Bo Peep is sitting on top of him, leaning closely towards him."
    },
    {
        "id": 328,
        "question": "Woody is <img>image_1</img>\n Buzz is <img>image_2</img>\n Please give a summary of the image: <img>image_3</img>\n",
        "prediction": "Woody is sitting on a red couch with his hands up in the air, while Buzz is standing with his arms spread out wide, both against a backdrop of a vibrant, colorful scene."
    },
    {
        "id": 329,
        "question": "Woody is <img>image_1</img>\n Buzz is <img>image_2</img>\n Jessie is <img>image_3</img>\n Rex is <img>image_4</img>\n In the image: <img>image_5</img>\n Identify Rex with grounding.",
        "prediction": "<ref>Rex</ref><box>(767,357),(983,826)</box>"
    },
    {
        "id": 330,
        "question": "Woody is <img>image_1</img>\n Buzz is <img>image_2</img>\n Jessie is <img>image_3</img>\n Bo peep is <img>image_4</img>\n In the image: <img>image_5</img>\n Who is on the right of Bo peep?",
        "prediction": "Woody is on the right of Bo peep."
    },
    {
        "id": 331,
        "question": "Fuheihui is <img>image_1</img>\n Qiangwei is <img>image_2</img>\n Wutiaowu is <img>image_3</img>\n Qihai is <img>image_4</img>\n In the image: <img>image_5</img>\n Identify Fuheihui with grounding.",
        "prediction": "<ref>Fuheihui</ref><box>(389,116),(935,978)</box>"
    },
    {
        "id": 332,
        "question": "Lizi is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Lizi doing?",
        "prediction": "Lizi is standing with her hands on her hips."
    },
    {
        "id": 333,
        "question": "Wutiaowu is <img>image_1</img>\n Xiayoujie is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Wutiaowu is seated facing away, looking out a cityscape, while Xiayoujie stands beside him facing towards the viewer."
    },
    {
        "id": 334,
        "question": "Wutiaowu is <img>image_1</img>\n Lizi is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Wutiaowu doing?",
        "prediction": "Wutiaowu is sitting and embracing Lizi."
    },
    {
        "id": 335,
        "question": "Wutiaowu is <img>image_1</img>\n Xiayoujie is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Xiayoujie's expression?",
        "prediction": "Xiayoujie looks surprised or in a state of excitement."
    },
    {
        "id": 336,
        "question": "Huzhang is <img>image_1</img>\n Fuheihui is <img>image_2</img>\n Qiangwei is <img>image_3</img>\n Wutiaowu is <img>image_4</img>\n In the image: <img>image_5</img>\n Who is sitting on the sofa?",
        "prediction": "Huzhang is sitting on the sofa."
    },
    {
        "id": 337,
        "question": "Dongtang is <img>image_1</img>\n Zhenren is <img>image_2</img>\n In the image: <img>image_3</img>\n What is Zhenren doing?",
        "prediction": "Zhenren is forcefully pushing Dongtang away."
    },
    {
        "id": 338,
        "question": "Huzhang is <img>image_1</img>\n Fuheihui is <img>image_2</img>\n Goujuan is <img>image_3</img>\n In the image: <img>image_4</img>\n Give a description.",
        "prediction": "Huzhang is on the right, making a peace sign with his fingers. Fuheihui is in the middle, slightly turned to his left, with one hand covering his face. Goujuan is on the left, gesturing with both hands towards his chest. They all appear to accompany their gestures with smiles."
    },
    {
        "id": 339,
        "question": "Xiayoujie is <img>image_1</img>\n In the image: <img>image_2</img>\n What is Xiayoujie doing?",
        "prediction": "Xiayoujie is sitting and reading a book."
    },
    {
        "id": 340,
        "question": "Qihai is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Qihai with grounding.",
        "prediction": "<ref>Qihai</ref><box>(153,239),(606,977)</box>"
    },
    {
        "id": 341,
        "question": "Huzhang is <img>image_1</img>\n Mingming is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Mingming with grounding.",
        "prediction": "<ref>Mingming</ref><box>(589,433),(767,988)</box>"
    },
    {
        "id": 342,
        "question": "Louhu is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Louhu with grounding.",
        "prediction": "<ref>Louhu</ref><box>(221,250),(560,968)</box>"
    },
    {
        "id": 343,
        "question": "Fuheihui is <img>image_1</img>\n Qiangwei is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Qiangwei with grounding.",
        "prediction": "<ref>Qiangwei</ref><box>(96,131),(443,988)</box>"
    },
    {
        "id": 344,
        "question": "Huzhang is <img>image_1</img>\n Louhu is <img>image_2</img>\n Give a content description of the image: <img>image_3</img>\n",
        "prediction": "Louhu, wearing a spotted outfit, is holding Huzhang, who appears to be in a relaxed position on the ground."
    },
    {
        "id": 345,
        "question": "Wutiaowu is <img>image_1</img>\n Xiayoujie is <img>image_2</img>\n In the image: <img>image_3</img>\n What does Wutiaowu wear?",
        "prediction": "Wutiaowu is wearing a red jacket over a white shirt."
    },
    {
        "id": 346,
        "question": "Wutiaowu is <img>image_1</img>\n Shener is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Wutiaowu is in a combat stance with a sword in hand, positioned as if ready to strike. Shener is in a defensive pose with a sword, facing Wutiaowu. They seem to be in a tense altercation under a red torii gate in a wooded area."
    },
    {
        "id": 347,
        "question": "Wutiaowu is <img>image_1</img>\n Xiayoujie is <img>image_2</img>\n Lizi is <img>image_3</img>\n In the image: <img>image_4</img>\n Identify Xiayoujie with grounding.",
        "prediction": "<ref>Xiayoujie</ref><box>(753,101),(998,978)</box>"
    },
    {
        "id": 348,
        "question": "Huzhang is <img>image_1</img>\n Qiangwei is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Qiangwei, wearing a blue outfit, is gesturing with her right hand towards Huzhang, who is dressed in a red outfit with spiky hair. They appear to be in a conversation."
    },
    {
        "id": 349,
        "question": "Huzhang is <img>image_1</img>\n Fuheihui is <img>image_2</img>\n Qiangwei is <img>image_3</img>\n Please explain what's happening in the photo: <img>image_4</img>\n",
        "prediction": "Huzhang is standing on the left, wearing a red scarf and a dark uniform with his fist clenched. Fuheihui is on the right, in a brown uniform, also with a fists clasped. Qiangwei is in the center, partially obscured, facing right, wearing a blue uniform."
    },
    {
        "id": 350,
        "question": "Huzhang is <img>image_1</img>\n Fuheihui is <img>image_2</img>\n Qiangwei is <img>image_3</img>\n In the image: <img>image_4</img>\n Identify Huzhang with grounding.",
        "prediction": "<ref>Huzhang</ref><box>(155,108),(596,972)</box>"
    },
    {
        "id": 351,
        "question": "Ningguang is <img>image_1</img>\n Beidou is <img>image_2</img>\n In the image: <img>image_3</img>\n Give a description.",
        "prediction": "Ningguang, on the left, is standing behind a table looking towards Beidou, who is leaning on the table with a relaxed posture, both surrounded by a lush forest setting."
    },
    {
        "id": 352,
        "question": "Zhongli is <img>image_1</img>\n Leishen is <img>image_2</img>\n Funingna is <img>image_3</img>\n In the image: <img>image_4</img>\n Identify Leishen with grounding.",
        "prediction": "<ref>Leishen</ref><box>(546,217),(839,987)</box>"
    },
    {
        "id": 353,
        "question": "Hutao is <img>image_1</img>\n Yelan is <img>image_2</img>\n Xiao is <img>image_3</img>\n In the image: <img>image_4</img>\n What is Xiao doing?",
        "prediction": "Xiao is walking to the right."
    },
    {
        "id": 354,
        "question": "Linghua is <img>image_1</img>\n Lingren is <img>image_2</img>\n In the image: <img>image_3</img>\n What is in Lingren's hand?",
        "prediction": "Lingren is holding a teapot."
    },
    {
        "id": 355,
        "question": "Wulang is <img>image_1</img>\n In the image: <img>image_2</img>\n Identify Wulang with grounding.",
        "prediction": "<ref>Wulang</ref><box>(256,190),(456,933)</box>"
    },
    {
        "id": 356,
        "question": "Abeiduo is <img>image_1</img>\n Kong is <img>image_2</img>\n In the image: <img>image_3</img>\n Who is holding a sword?",
        "prediction": "Kong is holding a sword."
    },
    {
        "id": 357,
        "question": "Diluke is <img>image_1</img>\n Qin is <img>image_2</img>\n In the image: <img>image_3</img>\n Identify Qin with grounding.",
        "prediction": "<ref>Qin</ref><box>(135,210),(359,995)</box>"
    },
    {
        "id": 358,
        "question": "Ningguang is <img>image_1</img>\n Kong is <img>image_2</img>\n Describe the image: <img>image_3</img>\n",
        "prediction": "Ningguang is seated to the left, roasting marshmallows on a campfire, while Kong is seated to the right, looking on. They are surrounded by a forest and clear sky."
    },
    {
        "id": 359,
        "question": "Keqing is <img>image_1</img>\n Ying is <img>image_2</img>\n Give a content description of the image: <img>image_3</img>\n",
        "prediction": "Keqing is on the left, wearing a purple dress with white cat ears, standing next to Ying, who is on the right wearing a blue gladiator-style outfit with a cape. They are in front of a mountainous backdrop."
    },
    {
        "id": 360,
        "question": "Regina Lampert is <img>image_0</img>\n In the following four images, Which is Regina Lampert? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 361,
        "question": "Peter Joshua is <img>image_0</img>\n In the following four images, Which is Peter Joshua? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 362,
        "question": "Peter Joshua is <img>image_0</img>\n In the following four images, Which is Peter Joshua? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 363,
        "question": "Hamilton Bartholomew is <img>image_0</img>\n In the following four images, Which is Hamilton Bartholomew? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 364,
        "question": "Kazan is <img>image_0</img>\n In the following four images, Which is Kazan? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 365,
        "question": "Worth is <img>image_0</img>\n In the following four images, Which is Worth? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 4"
    },
    {
        "id": 366,
        "question": "Worth is <img>image_0</img>\n In the following four images, Which is Worth? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 2"
    },
    {
        "id": 367,
        "question": "Worth is <img>image_0</img>\n In the following four images, Which is Worth? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 368,
        "question": "Quentin is <img>image_0</img>\n In the following four images, Which is Quentin? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 369,
        "question": "Holloway is <img>image_0</img>\n In the following four images, Which is Holloway? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 370,
        "question": "Leaven is <img>image_0</img>\n In the following four images, Which is Leaven? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 4"
    },
    {
        "id": 371,
        "question": "Leaven is <img>image_0</img>\n In the following four images, Which is Leaven? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 4"
    },
    {
        "id": 372,
        "question": "Agent Fox Mulder is <img>image_0</img>\n In the following four images, Which is Agent Fox Mulder? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 2"
    },
    {
        "id": 373,
        "question": "Agent Dana Scully is <img>image_0</img>\n In the following four images, Which is Agent Dana Scully? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 374,
        "question": "Reed Richards is <img>image_0</img>\n In the following four images, Which is Reed Richards? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 2"
    },
    {
        "id": 375,
        "question": "Johnny Storm is <img>image_0</img>\n In the following four images, Which is Johnny Storm? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 376,
        "question": "Sue Storm is <img>image_0</img>\n In the following four images, Which is Sue Storm? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 2"
    },
    {
        "id": 377,
        "question": "Victor Von Doom is <img>image_0</img>\n In the following four images, Which is Victor Von Doom? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 378,
        "question": "Sharon is <img>image_0</img>\n In the following four images, Which is Sharon? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 4"
    },
    {
        "id": 379,
        "question": "Burke Dennings is <img>image_0</img>\n In the following four images, Which is Burke Dennings? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 380,
        "question": "Jen Yu (Mandarin version) is <img>image_0</img>\n In the following four images, Which is Jen Yu (Mandarin version)? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 4"
    },
    {
        "id": 381,
        "question": "Master Li Mu Bai is <img>image_0</img>\n In the following four images, Which is Master Li Mu Bai? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 382,
        "question": "Yu Shu Lien is <img>image_0</img>\n In the following four images, Which is Yu Shu Lien? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 383,
        "question": "Lo 'Dark Cloud' is <img>image_0</img>\n In the following four images, Which is Lo 'Dark Cloud'? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 384,
        "question": "Evan is <img>image_0</img>\n In the following four images, Which is Evan? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 2"
    },
    {
        "id": 385,
        "question": "Andrea is <img>image_0</img>\n In the following four images, Which is Andrea? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 386,
        "question": "Lenny is <img>image_0</img>\n In the following four images, Which is Lenny? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 387,
        "question": "Kayleigh is <img>image_0</img>\n In the following four images, Which is Kayleigh? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 388,
        "question": "antelope is <img>image_0</img>\n In the following four images, Which is antelope? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 389,
        "question": "appaloosa is <img>image_0</img>\n In the following four images, Which is appaloosa? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 390,
        "question": "arabiancamel is <img>image_0</img>\n In the following four images, Which is arabiancamel? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 391,
        "question": "asiancrocodile is <img>image_0</img>\n In the following four images, Which is asiancrocodile? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 4"
    },
    {
        "id": 392,
        "question": "belgianhare is <img>image_0</img>\n In the following four images, Which is belgianhare? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 393,
        "question": "bighorn is <img>image_0</img>\n In the following four images, Which is bighorn? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 394,
        "question": "bovine is <img>image_0</img>\n In the following four images, Which is bovine? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 4"
    },
    {
        "id": 395,
        "question": "cheetah is <img>image_0</img>\n In the following four images, Which is cheetah? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 2"
    },
    {
        "id": 396,
        "question": "cheviot is <img>image_0</img>\n In the following four images, Which is cheviot? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 397,
        "question": "dallsheep is <img>image_0</img>\n In the following four images, Which is dallsheep? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 398,
        "question": "elk is <img>image_0</img>\n In the following four images, Which is elk? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 2"
    },
    {
        "id": 399,
        "question": "forestgoat is <img>image_0</img>\n In the following four images, Which is forestgoat? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 4"
    },
    {
        "id": 400,
        "question": "The reference vehicle is <img>image_0</img>\n In the following four images, Which is the reference vehicle? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 401,
        "question": "The reference vehicle is <img>image_0</img>\n In the following four images, Which is the reference vehicle? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 402,
        "question": "The reference vehicle is <img>image_0</img>\n In the following four images, Which is the reference vehicle? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 403,
        "question": "The reference vehicle is <img>image_0</img>\n In the following four images, Which is the reference vehicle? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 2"
    },
    {
        "id": 404,
        "question": "The reference vehicle is <img>image_0</img>\n In the following four images, Which is the reference vehicle? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 405,
        "question": "The reference vehicle is <img>image_0</img>\n In the following four images, Which is the reference vehicle? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 406,
        "question": "The reference vehicle is <img>image_0</img>\n In the following four images, Which is the reference vehicle? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 407,
        "question": "The reference vehicle is <img>image_0</img>\n In the following four images, Which is the reference vehicle? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 408,
        "question": "The reference vehicle is <img>image_0</img>\n In the following four images, Which is the reference vehicle? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 409,
        "question": "The reference vehicle is <img>image_0</img>\n In the following four images, Which is the reference vehicle? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 410,
        "question": "Willem_Merkxtuin is <img>image_0</img>\n In the following four images, Which is Willem_Merkxtuin? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 4"
    },
    {
        "id": 411,
        "question": "Ch%C3%A2teau_de_Chenonceau is <img>image_0</img>\n In the following four images, Which is Ch%C3%A2teau_de_Chenonceau? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 4"
    },
    {
        "id": 412,
        "question": "Munshi_Ghat is <img>image_0</img>\n In the following four images, Which is Munshi_Ghat? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 413,
        "question": "Burg_Heidenreichstein is <img>image_0</img>\n In the following four images, Which is Burg_Heidenreichstein? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 414,
        "question": "Santa_Maria_in_Cappella_(Rome) is <img>image_0</img>\n In the following four images, Which is Santa_Maria_in_Cappella_(Rome)? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 1"
    },
    {
        "id": 415,
        "question": "%C3%89glise_Saint-Martin_(Laon) is <img>image_0</img>\n In the following four images, Which is %C3%89glise_Saint-Martin_(Laon)? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 4"
    },
    {
        "id": 416,
        "question": "Bizzell_Memorial_Library is <img>image_0</img>\n In the following four images, Which is Bizzell_Memorial_Library? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 4"
    },
    {
        "id": 417,
        "question": "Hiyoshi_Taisha is <img>image_0</img>\n In the following four images, Which is Hiyoshi_Taisha? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 418,
        "question": "Michelangelo's_Moses is <img>image_0</img>\n In the following four images, Which is Michelangelo's_Moses? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 2"
    },
    {
        "id": 419,
        "question": "%C4%BDubov%C5%88a_Castle is <img>image_0</img>\n In the following four images, Which is %C4%BDubov%C5%88a_Castle? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 420,
        "question": "Coll%C3%A9giale_Notre-Dame_de_Vernon is <img>image_0</img>\n In the following four images, Which is Coll%C3%A9giale_Notre-Dame_de_Vernon? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 2"
    },
    {
        "id": 421,
        "question": "Cathedral_of_Guadalajara,_Mexico is <img>image_0</img>\n In the following four images, Which is Cathedral_of_Guadalajara,_Mexico? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 422,
        "question": "Kankalini_Temple,_Saptari is <img>image_0</img>\n In the following four images, Which is Kankalini_Temple,_Saptari? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 423,
        "question": "Wat_Thewarat_Kunchon is <img>image_0</img>\n In the following four images, Which is Wat_Thewarat_Kunchon? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 424,
        "question": "Moszna_Castle is <img>image_0</img>\n In the following four images, Which is Moszna_Castle? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 425,
        "question": "Cathedral-Basilica_of_Mary,_Queen_of_the_World is <img>image_0</img>\n In the following four images, Which is Cathedral-Basilica_of_Mary,_Queen_of_the_World? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 3"
    },
    {
        "id": 426,
        "question": "Helsinki_Lutheran_Cathedral is <img>image_0</img>\n In the following four images, Which is Helsinki_Lutheran_Cathedral? Image 1: <img>image_1</img>\n Image 2: <img>image_2</img>\n Image 3: <img>image_3</img>\n Image 4: <img>image_4</img>\n",
        "prediction": "Image 2"
    }
]