Summary: 5 instances, 5 unique Text Count # TODO make it support different-sized images 1 # TODO make this more general 1 # FIXME h_boxes takes the last one computed, keep this in mind 1 # TODO use valid to mask invalid areas due to padding in loss 1 # TODO: currently we assume image and label has the same filename but 1