Skip to content

Latest commit

 

History

History
67 lines (47 loc) · 2.2 KB

sunspot.md

File metadata and controls

67 lines (47 loc) · 2.2 KB
type title
article
SUNSPOT Dataset

We introduce a new dataset, SUN-Spot, for localizing objects using spatial referring expressions (REs). SUN-Spot is the only RE dataset which uses RGB-D images. It also contains a greater average number of spatial prepositions and more cluttered scenes than previous RE datasets. Using a simple baseline, we show that including a depth channel in RE models can improve performance on both generation and comprehension.

Paper

Preview of Pages 1 and 2 Download the paper here.

Citation

C. Mauceri, M. Palmer, and C. Heckman, “SUN-Spot: An RGB-D Dataset With Spatial Referring Expressions,” in International Conference on Computer Vision Workshop on Closing the Loop Between Vision and Language , 2019.

{% raw %}

@inproceedings{Mauceri2019,
author = {Mauceri, Cecilia and Palmer, Martha and Heckman, Christoffer},
booktitle = {International Conference on Computer Vision Workshop on Closing the Loop Between Vision and Language},
title = {{SUN-Spot: An RGB-D Dataset With Spatial Referring Expressions}},
year = {2019}
}

{% endraw %}

Downloads

Examples

To provide a taste of the images and annotations in SUN-Spot, here are 10 randomly selected objects from the dataset with all of their referring expressions.

{::options parse_block_html="true" /}

{% for object in site.data.sunspot_visualization %}

![{{object.object}}]({{object.url}}){:height="300px"}
{% for refexp in object.refexps %} - {{refexp}} {% endfor %}
{% endfor %}