Scale AI logo
SEAL Logo

VISTA

Visual Language Understanding

Last updated: April 4, 2025

Performance Comparison

1

54.65±1.46

1

54.63±0.55

3

51.79±0.63

3

51.66±1.08

3

50.78±0.57

3

50.07±1.14

5

49.59±0.66

6

49.15±0.36

6

47.32±1.78

7

48.23±0.70

9

46.97±1.29

9

46.96±0.95

10

45.50±1.20

10

45.34±0.91

11

45.49±0.21

12

45.25±0.40

15

43.53±1.24

15

43.25±1.26

17

43.21±0.52

17

43.02±1.14

17

42.11±1.39

21

41.14±0.58

21

39.95±0.80

22

39.85±0.71

23

Claude 3.5 Sonnet (October 2024)

38.72±0.51

25

Claude 3.5 Sonnet (June 2024)

38.37±0.70

25

38.33±0.55

25

ChatGPT-4o-latest (November 2024)

37.99±0.48

25

Gemini 1.5 Pro

37.07±1.34

30

GPT-4o (August 2024)

34.94±0.23

30

34.59±1.12

30

Gemini 1.5 Flash 002

34.03±1.41

31

Pixtral Large (November 2024)

33.89±0.69

31

32.69±1.40

35

Qwen2-VL-72B-Instruct

28.56±1.37

35

Claude 3 Opus

27.82±0.55

37

26.55±0.35

37

Nova Pro

26.27±0.61

37

Pixtral 12B (September 2024)

25.97±0.74

37

Nova Lite

25.50±0.77

39

Llama 3.2 90B Vision Instruct

24.61±0.80

42

Llama 3.2 11B Vision-Instruct

20.47±0.15

43

Phi 3.5 Vision-Instruct

15.18±0.81