Highly scored unanswered questions - Artificial Intelligence Stack Exchange

12 votes

0 answers

436 views

Extending FaceNet’s triplet loss to object recognition

FaceNet uses a novel loss metric (triplet loss) to train a model to output embeddings (128-D from the paper), such that any two faces of the same identity will have a small Euclidean distance, and ...

CommunityBot

1

modified Mar 16, 2022 at 17:08

6 votes

0 answers

154 views

Are there any easy ways to create annotated training images for object detection?

For the purposes of object detection, are there any easy ways to create annotated training images? For example, if we have $10,000$ images and want to draw bounding boxes on 2 objects for each image, ...

nbro

42.4k

modified Dec 4, 2020 at 10:15

3 votes

0 answers

72 views

How are Ground truth provided to each Pyramid map in RetinaNet or YOLOv3 Paper? How is the mapping of Feature Pyramids done to Ground Truth

SO the YOLO V3 and RetinaNet both uses the Feature pyramids which look something like this: (except b and e which have one ...

Deshwal

263

asked Jan 31, 2021 at 11:14

3 votes

0 answers

59 views

If random rotations are included in the data augmentation process, how are the new bounding boxes calculated?

When studying bounding box-based detectors, it's not clear to me if data augmentation includes adding random rotations. If random rotations are added, how is the new bounding box calculated?

nbro

42.4k

modified Nov 15, 2020 at 22:43

3 votes

0 answers

39 views

Defect Detection System using Deep Learning

What is the general approach to defect detection in deep learning? Would the approach be better if we try to learn the positive images (defects in images) as much as possible or we try to learn the ...

user1538798

131

asked Nov 11, 2019 at 8:38

3 votes

0 answers

125 views

Does Retina-net's focal loss accomplish its goal?

Taking out the weighting factor we can define focal loss as $$FL(p) = -(1-p)^\gamma log(p) $$ Where $p$ is the target probability. The idea being that single stage object detectors have a huge ...

mshlis

2,399

modified Aug 6, 2019 at 23:05

3 votes

0 answers

65 views

How should I build an AI that quickly detects falling game assets on screen?

I want to build an AI that plays a simple android game. The game is just a one at a time object falling, some times at an angle. The AI needs to recognize the object and to decide whether to swipe ...

nbro

42.4k

modified Jul 8, 2019 at 22:58

3 votes

0 answers

30 views

How to voxelize multiple frames at the time and append them together?

I'm trying to implement this approach for object detection and tracking. In this approach, the first step is voxelize each frame to construct a 3D tensor, the second step is to append multiple voxels ...

nbro

42.4k

modified Jul 1, 2019 at 11:04

3 votes

3 answers

753 views

Face liveness detection using face landmark points

How to detect liveness of face using face landmark points? I am getting face landmarks from android camera frames. And I want to detect liveness using these landmark points. How to tell if a human ...

CommunityBot

1

modified Apr 21 at 11:06

2 votes

0 answers

146 views

Indoor elements detect from floor image

I have a large collection of floor plans that I need to convert into indoor maps by extracting walls as lines, doors as lines, and rooms as polygons. Currently, I do this process manually. The floor ...

hguser

101

modified Aug 22, 2024 at 14:52

2 votes

2 answers

2k views

How does YOLO detect the object when the object is in multiple grid cells?

I have been reading various articles and watching videos on YouTube, but I can't seem to understand one thing. How does YOLO make a bounding box for an object if it is in multiple grid cells? For ...

CommunityBot

1

modified Mar 27 at 17:04

2 votes

0 answers

115 views

What would be a reasonable option for clustering for unknown number of clusters and a lot of outliers?

I am implementing the CV detection pipeline with the use of SIFT and KNN Matcher. Image keypoints matched to the query keypoints produce the following image: The matched objects have a lot of key ...

nbro

42.4k

modified Dec 4, 2021 at 0:31

2 votes

0 answers

60 views

Can a GIoU loss (generalized intersection over union) be used after an STN module (spatial transformer network)?

I have a model that uses an STN module for number detection and Mean Squared Error loss. But I would like to replace it for GIoU, because MSE doesn't take into account how much of the target area has ...

hanugm

4,062

modified Nov 13, 2021 at 8:55

2 votes

0 answers

83 views

Which method can accurately detect circular/angular shapes? (attached example)

Is there a method to detect shapes like these accurately and efficiently? I have tried the OpenCv Haar Casacade Classifier which does not work well. These shapes should all be the same class object ...

Boyd Werkman

21

asked Oct 3, 2021 at 18:15

2 votes

0 answers

116 views

Vector input to CNN for object detection

I am training a 3D object detection network (Retinanet-based as of the moment) for re-detecting tracked objects. I would like to be able to add the velocity vector of the tracked object as an input to ...

DerekG

121

asked Jun 29, 2021 at 14:01

Stack Exchange Network

Unanswered Questions

Extending FaceNet’s triplet loss to object recognition

Are there any easy ways to create annotated training images for object detection?

How are Ground truth provided to each Pyramid map in RetinaNet or YOLOv3 Paper? How is the mapping of Feature Pyramids done to Ground Truth

If random rotations are included in the data augmentation process, how are the new bounding boxes calculated?

Defect Detection System using Deep Learning

Does Retina-net's focal loss accomplish its goal?

How should I build an AI that quickly detects falling game assets on screen?

How to voxelize multiple frames at the time and append them together?

Face liveness detection using face landmark points

Indoor elements detect from floor image

How does YOLO detect the object when the object is in multiple grid cells?

What would be a reasonable option for clustering for unknown number of clusters and a lot of outliers?

Can a GIoU loss (generalized intersection over union) be used after an STN module (spatial transformer network)?

Which method can accurately detect circular/angular shapes? (attached example)

Vector input to CNN for object detection

Unanswered Questions

Unanswered Tags