Unanswered Questions
169 questions with no upvoted or accepted answers
5
votes
0
answers
369
views
What are the ways to calculate the error rate of a deep Convolutional Neural Network, when the network produces different results using the same data?
I am new to the object recognition community. Here I am asking about the broadly accepted ways to calculate the error rate of a deep CNN when the network produces different results using the same data....
4
votes
0
answers
84
views
Difficulty in agent's learning with increasing dimensions of continuous actions
I have been working on some RL project, where the policy is controlling the robot using its joint angles.Throughout the project I have noticed some phenomenon, which caught my attention. I have ...
4
votes
0
answers
809
views
How to perform classification with NEAT-Python?
I am trying to do classification using NEAT-python for the first time, and I am having difficulty getting the accuracy rate. I tried the same problem with an ANN and was able to get a good accuracy ...
4
votes
0
answers
367
views
Deep Q-Network (DQN) to learn the game 2048
I am trying to build a Deep Q-Network (DQN) agent that can learn to play the game 2048. I am orientating myself on other programs and articles that are based on the game snake and it worked well (...
4
votes
0
answers
131
views
Supervised K-means clustering doesn't appear to work
I have a data set containing actions taken by customers (e.g., view a product, add a product to cart, purchase product), the product bought (if any) and times of said actions. I am attempting to use K-...
4
votes
2
answers
2k
views
How to perform gradient checking in a neural network with batch normalization?
I have implemented a neural network (NN) using python and numpy only for learning purposes. I have already coded learning rate, momentum, and L1/L2 regularization and checked the implementation with ...
3
votes
0
answers
95
views
Is improving a Neural Network really just "trial and error"?
After asking on StackOverflow, I was redirected here, so I'm reposting this question.
I am a PhD student in Computational Physics and I've started to study a bit of Neural Networks, and decided to try ...
3
votes
0
answers
154
views
Are there Reinforcement Learning algorithms specialized for the case $\gamma=0$?
I have a Reinforcement Learning problem where the optimal policy does not depend on the next state (ie gamma equals 0). I think this means that I only need an efficient exploration algorithm coupled ...
3
votes
0
answers
755
views
How to select good inputs and fitness function to achive good results with NEAT for Icy Tower bot
I'm trying to make a bot to the famous "Icy Tower" game.
I rebuilt the game using pygame and I'm trying to build the bot using Python-NEAT.
Every generation a population of 70 characters ...
3
votes
0
answers
746
views
How does one continue the pre-training in BERT?
I need some help with continuing pre-training on Bert. I have a very specific vocabulary and lots of specific abbreviations at hand. I want to do an STS task. Let me specify my task: I have domain-...
3
votes
0
answers
44
views
Reinforcement Learning on quantum circuit
I am trying to teach an agent to make any random 1-qubit state reach uniform superposition. So basically, the full circuit will be ...
3
votes
0
answers
102
views
Which machine learning algorithms can be used to build a recommendation system?
I am working on building a recommendation engine. I need to build a model that recommends similar items. Currently, I am using the Nearest Neighbor algorithm present in ...
3
votes
0
answers
331
views
Image to image regression in tensorflow
I am working on an image to image regression task which requires me to develop a deep learning model that takes in a sequence of 5 images and return another image. The sequence of 5 images and the ...
3
votes
1
answer
819
views
How to train a LSTM model with multi dimensional data
I am trying to train my model using LTSM layer in Keras (python). I have some problems regarding the data representation and feeding it into the model.
My data is 184 XY coodinates encoded as a numpy ...
3
votes
0
answers
1k
views
Understanding log probabilities of actions in the PPO objective
I'm trying to implement the Proximal Policy Optimization (PPO) algorithm (code here), but I am confused about certain concepts.
What is the correct way to implement log probability of a policy (...