Skip to main content

Unanswered Questions

102 questions with no upvoted or accepted answers
12 votes
0 answers
436 views

Extending FaceNet’s triplet loss to object recognition

FaceNet uses a novel loss metric (triplet loss) to train a model to output embeddings (128-D from the paper), such that any two faces of the same identity will have a small Euclidean distance, and ...
6 votes
0 answers
154 views

Are there any easy ways to create annotated training images for object detection?

For the purposes of object detection, are there any easy ways to create annotated training images? For example, if we have $10,000$ images and want to draw bounding boxes on 2 objects for each image, ...
3 votes
0 answers
72 views

How are Ground truth provided to each Pyramid map in RetinaNet or YOLOv3 Paper? How is the mapping of Feature Pyramids done to Ground Truth

SO the YOLO V3 and RetinaNet both uses the Feature pyramids which look something like this: (except b and e which have one ...
3 votes
0 answers
59 views

If random rotations are included in the data augmentation process, how are the new bounding boxes calculated?

When studying bounding box-based detectors, it's not clear to me if data augmentation includes adding random rotations. If random rotations are added, how is the new bounding box calculated?
3 votes
0 answers
39 views

Defect Detection System using Deep Learning

What is the general approach to defect detection in deep learning? Would the approach be better if we try to learn the positive images (defects in images) as much as possible or we try to learn the ...
3 votes
0 answers
125 views

Does Retina-net's focal loss accomplish its goal?

Taking out the weighting factor we can define focal loss as $$FL(p) = -(1-p)^\gamma log(p) $$ Where $p$ is the target probability. The idea being that single stage object detectors have a huge ...
3 votes
0 answers
65 views

How should I build an AI that quickly detects falling game assets on screen?

I want to build an AI that plays a simple android game. The game is just a one at a time object falling, some times at an angle. The AI needs to recognize the object and to decide whether to swipe ...
3 votes
0 answers
30 views

How to voxelize multiple frames at the time and append them together?

I'm trying to implement this approach for object detection and tracking. In this approach, the first step is voxelize each frame to construct a 3D tensor, the second step is to append multiple voxels ...
3 votes
3 answers
753 views

Face liveness detection using face landmark points

How to detect liveness of face using face landmark points? I am getting face landmarks from android camera frames. And I want to detect liveness using these landmark points. How to tell if a human ...
2 votes
0 answers
146 views

Indoor elements detect from floor image

I have a large collection of floor plans that I need to convert into indoor maps by extracting walls as lines, doors as lines, and rooms as polygons. Currently, I do this process manually. The floor ...
2 votes
2 answers
2k views

How does YOLO detect the object when the object is in multiple grid cells?

I have been reading various articles and watching videos on YouTube, but I can't seem to understand one thing. How does YOLO make a bounding box for an object if it is in multiple grid cells? For ...
2 votes
0 answers
115 views

What would be a reasonable option for clustering for unknown number of clusters and a lot of outliers?

I am implementing the CV detection pipeline with the use of SIFT and KNN Matcher. Image keypoints matched to the query keypoints produce the following image: The matched objects have a lot of key ...
2 votes
0 answers
60 views

Can a GIoU loss (generalized intersection over union) be used after an STN module (spatial transformer network)?

I have a model that uses an STN module for number detection and Mean Squared Error loss. But I would like to replace it for GIoU, because MSE doesn't take into account how much of the target area has ...
2 votes
0 answers
83 views

Which method can accurately detect circular/angular shapes? (attached example)

Is there a method to detect shapes like these accurately and efficiently? I have tried the OpenCv Haar Casacade Classifier which does not work well. These shapes should all be the same class object ...
2 votes
0 answers
116 views

Vector input to CNN for object detection

I am training a 3D object detection network (Retinanet-based as of the moment) for re-detecting tracked objects. I would like to be able to add the velocity vector of the tracked object as an input to ...

15 30 50 per page
1
2 3 4 5
���
7