Highly scored unanswered questions - Data Science Stack Exchange

6 votes

0 answers

614 views

Adversarial Learning for Semantic Segmentation

I am incorporating Adversarial Training for Semantic Segmentation from Adversarial Learning for Semi-Supervised Semantic Segmentation. The idea is like this: The discriminator takes as input a ...

Pluviophile

4,153

modified Nov 17, 2019 at 15:40

3 votes

1 answer

364 views

How is padding masking considered in the Attention Head of a Transformer?

For purely educational purposes, my goal is to implement basic Transformer architecture from scratch. So far I focused on the encoder for classification tasks and assumed that all samples in a batch ...

CommunityBot

1

modified Apr 10 at 15:11

3 votes

0 answers

251 views

Cluster tabular data with text in some columns

Let's say I have a following features in the my dataframe: user_id user_age is_student is_graduate salary resume integer integer binary binary integer text (up to 1000 symbols) And also a few more ...

Mike

31

asked Mar 14, 2022 at 21:06

3 votes

0 answers

290 views

Struggling to understand/implement Transformer Decoder

I'm struggling to understand the decoder in a Transformer model, specifically with regards to some aspects of its architecture as well as how it actually handles the data during training. What I have ...

cuuupid

131

modified Jun 5, 2021 at 0:13

3 votes

0 answers

865 views

What exactly negative/positive value of Captum's Integrated Gradient mean?

I use Captum's Integrated Gradient to interprete my PyTorch's neural network. I know that from github and original paper mentioned that ... Positive attribution score means that the input in that ...

3ORZ

31

asked Jan 8, 2021 at 10:12

3 votes

0 answers

1k views

PyTorch: Train without dataloader (loop trough dataframe instead)

I was wondering if it is bad practice to instead of using built in tools such as dataloader just loop trough each row in a pandas df. Lets say I am doing text classification and my training loop looks ...

Isbister

193

asked Oct 5, 2020 at 20:05

3 votes

1 answer

143 views

How to specify version for dependencies so that each one is compatible and stays within a size limit?

I am trying to deploy a web app to Heroku. The free tier is limited to 500 MB. I am using my resnet34 model as a .pkl file. I create model with it using the fastai ...

CommunityBot

1

modified Jan 29 at 6:06

3 votes

0 answers

136 views

AlexNet Research Paper VS PytTorch and Tensorflow implementation

I'm making my way through Deep Learning research papers, starting with AlexNet, and I found differences in the implementation of PyTorch and Tensorflow that I can't explain. In the research paper, ...

Begoodpy

233

modified Jul 9, 2020 at 17:00

3 votes

0 answers

742 views

Explain FastText model using SHAP values

I have trained fastText model and some fully connected network build on its embeddings. I figured out how to use Lime on it: complete example can be found in Natural Language Processing Is Fun Part 3: ...

desertnaut

2,154

modified Oct 22, 2020 at 22:12

3 votes

1 answer

346 views

Policy Gradient not "learning"

I'm attempting to implement the policy gradient taken from the "Hands-On Machine Learning" book by Geron, which can be found here. The notebook uses Tensorflow and I'm attempting to do it with PyTorch....

CommunityBot

1

modified Mar 11 at 9:03

3 votes

1 answer

272 views

Is it possible to solve Rubik's cube using DQN?

I'm trying to solve Rubik's cube using deep learning and I came across with DQN, so I decided to give it a try. I developed all the code and started training but I got this results: Loss goes up and ...

CommunityBot

1

modified Mar 30 at 11:01

3 votes

0 answers

847 views

Understanding depthwise convolution vs convolution with group parameters in pytorch

So in the mobilenet-v1 network, depthwise conv layers are used. And I understand that as follows. For a input feature map of (C_in, F_in, F_in), we take only 1 ...

lincr

91

modified Apr 7, 2020 at 6:12

3 votes

0 answers

897 views

How can I get testing accuracy using tensorboard for Detectron2?

I'm learning to use Detecron2. I've followed this link to create a custom object detector. My training code - ...

mefahimrahman

131

asked Mar 3, 2020 at 6:45

3 votes

1 answer

269 views

Transfer Learning Question: Extending the Functionality of a Multipose-Estimation Machine Learning Model?

I have experimented with a number of different machine learning models used for pose estimation. Most of them output a heatmap and offsets for the detected person(s) in the image. I really like the ...

CommunityBot

1

modified Apr 17 at 17:07

3 votes

0 answers

312 views

Why embedding or rnn/lstm can not handle variable length sequence?

Pytorch embedding or lstm (I don't know about other dnn libraries) can not handle variable-length sequence by default. I am seeing various hacks to handle variable length. But my question is, why this ...

sovon

521

asked Aug 9, 2019 at 12:44

Stack Exchange Network

Unanswered Questions

Adversarial Learning for Semantic Segmentation

How is padding masking considered in the Attention Head of a Transformer?

Cluster tabular data with text in some columns

Struggling to understand/implement Transformer Decoder

What exactly negative/positive value of Captum's Integrated Gradient mean?

PyTorch: Train without dataloader (loop trough dataframe instead)

How to specify version for dependencies so that each one is compatible and stays within a size limit?

AlexNet Research Paper VS PytTorch and Tensorflow implementation

Explain FastText model using SHAP values

Policy Gradient not "learning"

Is it possible to solve Rubik's cube using DQN?

Understanding depthwise convolution vs convolution with group parameters in pytorch

How can I get testing accuracy using tensorboard for Detectron2?

Transfer Learning Question: Extending the Functionality of a Multipose-Estimation Machine Learning Model?

Why embedding or rnn/lstm can not handle variable length sequence?

Unanswered Questions

Unanswered Tags