Highly scored unanswered questions - Artificial Intelligence Stack Exchange

16 votes

1 answer

425 views

Will parameter sweeping on one split of data followed by cross validation discover the right hyperparameters?

Let's call our dataset splits train/test/evaluate. We're in a situation where we require months of data. So we prefer to use the evaluation dataset as infrequently as possible to avoid polluting our ...

CommunityBot

1

modified May 1 at 12:11

12 votes

0 answers

436 views

Extending FaceNet’s triplet loss to object recognition

FaceNet uses a novel loss metric (triplet loss) to train a model to output embeddings (128-D from the paper), such that any two faces of the same identity will have a small Euclidean distance, and ...

CommunityBot

1

modified Mar 16, 2022 at 17:08

10 votes

1 answer

974 views

Are Cellular Neural Networks one type of Neural Networks?

I am researching Cellular Neural Networks and have already read Chua's two articles (1988). In cellular neural networks, a cell is only in relation with its neighbors. So it is easy to use them for ...

CommunityBot

1

modified Jan 23 at 1:07

9 votes

2 answers

1k views

Are there transformer-based architectures that can produce fixed-length vector encodings given arbitrary-length text documents?

BERT encodes a piece of text such that each token (usually words) in the input text map to a vector in the encoding of the text. However, this makes the length of the encoding vary as a function of ...

CommunityBot

1

modified Mar 29 at 5:03

9 votes

2 answers

509 views

How can I solve the zero subset sum problem with hill climbing?

I want to solve the zero subset sum problem with the hill-climbing algorithm, but I am not sure I found a good state space for this. Here is the problem: consider we have a set of numbers and we want ...

CommunityBot

1

modified Jan 29 at 19:06

8 votes

0 answers

301 views

Is the Bellman equation that uses sampling weighted by the Q values (instead of max) a contraction?

It is proved that the Bellman update is a contraction (1). Here is the Bellman update that is used for Q-Learning: $$Q_{t+1}(s, a) = Q_{t}(s, a) + \alpha*(r(s, a, s') + \gamma \max_{a^*} (Q_{t}(s', ...

CommunityBot

1

modified Jul 25, 2020 at 8:28

8 votes

0 answers

160 views

Normalizing Normal Distributions in Thompson Sampling for online Reinforcement Learning

In my implementation of Thompson Sampling (TS) for online Reinforcement Learning, my distribution for selecting $a$ is $\mathcal{N}(Q(s, a), \frac{1}{C(s,a)+1})$, where $C(s,a)$ is the number of times ...

nbro

42.4k

modified Dec 20, 2021 at 14:51

8 votes

1 answer

212 views

What is the impact of using multiple BMUs for self-organizing maps?

Here's a sort of a conceptual question. I was implementing a SOM algorithm to better understand its variations and parameters. I got curious about one bit: the BMU (best matching unit == the neuron ...

CommunityBot

1

modified Jan 23 at 15:01

8 votes

0 answers

138 views

What are the current trends/open questions in logics for knowledge representation?

What are the future prospects in near future from a theoretical investigation of description logics, and modal logics in the context of artificial intelligence research?

CommunityBot

1

modified Apr 28, 2019 at 18:02

8 votes

0 answers

1k views

Is there a difference in the architecture of deep reinforcement learning when multiple actions are performed instead of a single action?

I've built a deep deterministic policy gradient reinforcement learning agent to be able to handle any games/tasks that have only one action. However, the agent seems to fail horribly when there are ...

nbro

42.4k

modified Mar 28, 2021 at 1:32

8 votes

1 answer

4k views

Does it make sense to use batch normalization in deep (stacked) or sparse auto-encoders?

Does it make sense to use batch normalization in deep (stacked) or sparse auto-encoders? I cannot find any resources for that. Is it safe to assume that, since it works for other DNNs, it will also ...

CommunityBot

1

modified Apr 7 at 20:13

8 votes

2 answers

974 views

How should we interpret this figure that relates the perceptron criterion and the hinge loss?

I am currently studying the textbook Neural Networks and Deep Learning by Charu C. Aggarwal. Chapter 1.2.1.2 Relationship with Support Vector Machines says the following: The perceptron criterion is ...

CommunityBot

1

modified Mar 24 at 21:06

7 votes

3 answers

232 views

Can training a model on a dataset composed by real images and drawings hurt the training process of a real-world application model?

I'm training a multi-label classifier that's supposed to be tested on underwater images. I'm wondering if feeding the model drawings of a certain class plus real images can affect the results badly. ...

Highen Fintech

1

answered Apr 22 at 13:39

7 votes

2 answers

232 views

How to classify human actions?

I'm quite new to machine learning (I followed the Coursera course of Andrew Ng and now starting deeplearning.ai courses). I want to classify human actions real-time like: Left-arm bended Arm above ...

CommunityBot

1

modified Feb 3 at 15:02

7 votes

0 answers

2k views

How is the rollout from the MCTS implemented in both of the AlphaGo Zero and the AlphaZero algorithms?

In the vanilla Monte Carlo tree search (MCTS) implementation, the rollout is usually implemented following a uniform random policy, that is, it takes random actions until the game is finished and only ...

nbro

42.4k

modified Dec 19, 2021 at 18:14

Stack Exchange Network

Unanswered Questions

Will parameter sweeping on one split of data followed by cross validation discover the right hyperparameters?

Extending FaceNet’s triplet loss to object recognition

Are Cellular Neural Networks one type of Neural Networks?

Are there transformer-based architectures that can produce fixed-length vector encodings given arbitrary-length text documents?

How can I solve the zero subset sum problem with hill climbing?

Is the Bellman equation that uses sampling weighted by the Q values (instead of max) a contraction?

Normalizing Normal Distributions in Thompson Sampling for online Reinforcement Learning

What is the impact of using multiple BMUs for self-organizing maps?

What are the current trends/open questions in logics for knowledge representation?

Is there a difference in the architecture of deep reinforcement learning when multiple actions are performed instead of a single action?

Does it make sense to use batch normalization in deep (stacked) or sparse auto-encoders?

How should we interpret this figure that relates the perceptron criterion and the hinge loss?

Can training a model on a dataset composed by real images and drawings hurt the training process of a real-world application model?

How to classify human actions?

How is the rollout from the MCTS implemented in both of the AlphaGo Zero and the AlphaZero algorithms?

Unanswered Questions

Unanswered Tags