Skip to content

This repository is for the paper "Learning Human-Human Interactions in Images from Weak Textual Supervision"

License

Notifications You must be signed in to change notification settings

TAU-VAILab/learning-interactions

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Learning Human-Human Interactions in Images from Weak Textual Supervision (ICCV 2023)

Project Page | Paper | Interactive Visualization

This is the official repository for the paper: Morris Alper and Hadar Averbuch-Elor (2023). Learning Human-Human Interactions in Images from Weak Textual Supervision. ICCV 2023

Accompanying Data

See the data documentation for information on using the accompanying data, including:

  • Waldo and Wenda benchmark for HHI understanding
  • IDs for imSitu-HHI subset of the imSitu dataset
  • pHHI (pseudo-labels indicating HHI) for the Who's Waldo dataset
  • Synthetic caption data for training summarization model

Creating Pseudo-Labels

See the pseudo-labeling documentation for information on training the summarization model and using it to generate pseudo-labels for the Who's Waldo dataset. Alternatively, you may use pre-computed pseudo-labels (pHHI) – see above.

Modeling (Training & Pretrained Checkpoint, Inference, Evaluation)

See the modeling documentation for information on training the HHI understanding model (or using a pretrained checkpoint), and running inference and evaluation.

Licence

We release our code under the MIT license. Please see the data documentation for licensing of accompanying data.

Citation

If you find this code or our data helpful in your research or work, please cite the following paper.

@InProceedings{alper2023learning,
    author    = {Morris Alper and Hadar Averbuch-Elor},
    title     = {Learning Human-Human Interactions in Images from Weak Textual Supervision},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    year      = {2023}
}

About

This repository is for the paper "Learning Human-Human Interactions in Images from Weak Textual Supervision"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages