MiDaS: a large-scale Minecraft dataset for non-natural image benchmarking

David Torpey; Max Parkin; Jonah Alter; Richard Klein; Steven James

doi:10.1117/1.JEI.33.1.013035

5 February 2024 MiDaS: a large-scale Minecraft dataset for non-natural image benchmarking

David Torpey, Max Parkin, Jonah Alter, Richard Klein, Steven James

Author Affiliations +

Journal of Electronic Imaging, Vol. 33, Issue 1, 013035 (February 2024). https://doi.org/10.1117/1.JEI.33.1.013035

Abstract

Reinforcement learning (RL) has recently made several significant advances using video games as a testbed. While many of these games are relatively self-contained, there has been a recent push to develop agents capable of tackling massive, open-ended environments that are more reminiscent of the real world. One of the most popular of these platforms is Minecraft, but to attain human-level performance, agents must be able to learn, plan, and reason using high-dimensional image input. Commonly, an agent will attempt to extract lower-dimensional features that assist with downstream tasks. However, representation learning techniques have primarily been applied to real-world, natural image datasets, and it is unclear how these same methods might translate to an artificial world with non-natural images. We therefore present MiDaS, a novel large-scale Minecraft dataset featuring 36,000 labeled images across 60 classes. MiDaS contains information about both the blocks in the image, critical to solving the game, as well as auxiliary information such as time of day and biome. Further, we perform an evaluation of various models to benchmark performance on this new dataset. Since RL agents must be capable of learning features without labels, we include benchmarks of various self-supervised learning approaches on the dataset. Our results indicate that self-supervised methods perform best in the linear evaluation paradigm, particularly in low-label settings with a ResNet-based backbone, whereas ImageNet-pretraining assists more in the fine-tuning setting. The full dataset is available at https://github.com/MinecraftDataset/MiDaS.

Citation Download Citation

David Torpey, Max Parkin, Jonah Alter, Richard Klein, and Steven James "MiDaS: a large-scale Minecraft dataset for non-natural image benchmarking," Journal of Electronic Imaging 33(1), 013035 (5 February 2024). https://doi.org/10.1117/1.JEI.33.1.013035

Received: 8 August 2023; Accepted: 16 January 2024; Published: 5 February 2024

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $24.00

Non-members: $28.00 ADD TO CART

JOURNAL ARTICLE
20 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Machine learning

Performance modeling

Data modeling

Education and training

Image processing

Visualization

Solid state lighting

Show All Keywords

Keywords/Phrases

Search In:

Publication Years