5 February 2024 MiDaS: a large-scale Minecraft dataset for non-natural image benchmarking
David Torpey, Max Parkin, Jonah Alter, Richard Klein, Steven James
Author Affiliations +
Abstract

Reinforcement learning (RL) has recently made several significant advances using video games as a testbed. While many of these games are relatively self-contained, there has been a recent push to develop agents capable of tackling massive, open-ended environments that are more reminiscent of the real world. One of the most popular of these platforms is Minecraft, but to attain human-level performance, agents must be able to learn, plan, and reason using high-dimensional image input. Commonly, an agent will attempt to extract lower-dimensional features that assist with downstream tasks. However, representation learning techniques have primarily been applied to real-world, natural image datasets, and it is unclear how these same methods might translate to an artificial world with non-natural images. We therefore present MiDaS, a novel large-scale Minecraft dataset featuring 36,000 labeled images across 60 classes. MiDaS contains information about both the blocks in the image, critical to solving the game, as well as auxiliary information such as time of day and biome. Further, we perform an evaluation of various models to benchmark performance on this new dataset. Since RL agents must be capable of learning features without labels, we include benchmarks of various self-supervised learning approaches on the dataset. Our results indicate that self-supervised methods perform best in the linear evaluation paradigm, particularly in low-label settings with a ResNet-based backbone, whereas ImageNet-pretraining assists more in the fine-tuning setting. The full dataset is available at https://github.com/MinecraftDataset/MiDaS.

© 2024 SPIE and IS&T
David Torpey, Max Parkin, Jonah Alter, Richard Klein, and Steven James "MiDaS: a large-scale Minecraft dataset for non-natural image benchmarking," Journal of Electronic Imaging 33(1), 013035 (5 February 2024). https://doi.org/10.1117/1.JEI.33.1.013035
Received: 8 August 2023; Accepted: 16 January 2024; Published: 5 February 2024
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Machine learning

Performance modeling

Data modeling

Education and training

Image processing

Visualization

Solid state lighting

Back to Top