Newest Questions
12,769 questions
1
vote
1
answer
46
views
Do neural networks do wishful thinking?
I will give an example of wishful thinking. When you try to prove a theorem you think what would imply that theorem and maybe try to find a lemma that implies it. Maybe neurons try to connect previous ...
0
votes
0
answers
25
views
Is it feasible and scalable to combine self-replicating automata (after von Neumann) with federated learning and the social web?
Von Neumann’s idea of self-replicating automata describes machines that can reproduce themselves given a blueprint and a suitable environment. I’m exploring a concept that tries to apply this idea to ...
0
votes
0
answers
11
views
How do BPE Tokenizers with `add_prefix_space` handle natural language, such as quotations or poetry, where there won't be a prefix space?
BPE Tokenizers are the standard for modern LLMs. By default, most add_prefix_space, so that John went away is pretokenized to <...
-1
votes
0
answers
19
views
can you recommend an image filename(caption?) generating ai?
2 to 6 words output and filenames should be unique and relevant to the image content.
0
votes
1
answer
37
views
Who argued that we're entering a 4th era of science with machine learning?
I remember reading a reference to a recent paper that argued that science today is in its 4th stage (paradigm?), the era of modelling with machine learning. The 3rd was that of Newton, Kepler, et al.
...
0
votes
0
answers
10
views
Is there other ways than using negative log-likelihood or KL-divergence to compute a loss function?
I've read that the two common ways to express a loss function in ML problems was to start either from the likelihood, then use the negative log likelihood to find a good expression of the loss, or to ...
0
votes
0
answers
17
views
How can AI be leveraged to improve fraud detection in real-time financial transactions?
I'm exploring how AI can help financial institutions enhance their fraud detection systems, particularly in real-time transactions. What are the best approaches, tools, or models for building more ...
0
votes
1
answer
52
views
How do tools like V0.dev provide the project's code context to the AI while minimizing its input tokens?
Under the hood, tools like V0.dev use OpenAI API to generate code. How does V0 know which file context to send to the AI when user makes prompts like "make xyz change/change the button color/add ...
0
votes
0
answers
30
views
Why would the tokenizer for encoder-decoder model for machine translation use bos_token_id == eos_token_id? How does it know when a sequence ends?
I see on this PyTorch model Helsinki-NLP/opus-mt-fr-en (HuggingFace), which is an encoder-decoder model for machine translation:
...
0
votes
0
answers
15
views
Torch gradient estimates disagreeing with analytic and perturbation approximated gradients
I'm faced with a problem where as the title says I'm having trouble with the torch package's built in automatic differentiation algorithms (or my usage?). I think it was meant to be used on mini-...
-1
votes
0
answers
42
views
What are some notable connections between quantum mechanics and machine learning?
I have recently started to learn about machine learning, and I feel like there is a lot of connection to quantum mechanics in it. For example, this idea of the predictin through layers of neurons look ...
0
votes
1
answer
25
views
Different approaches to probability, connection to measure theory and machine learning
I am trying to understand the probability as used in machine learning. So far I understand there is multipile approaches to probability. The two I know of are bayseian and frequntist approach. So far ...
0
votes
2
answers
37
views
Can you train YOLOv11 using the Coral USB Accelerator?
I have a dataset that I want to train a YOLOv11 model on. The problem is, my laptop is too slow and using Google Colab is hard due to the inactivity limit and the training interrupting.
My idea was to ...
0
votes
0
answers
29
views
How to cluster data which has unique distribution for all classes?
I wanted to use MLP to classify my data. But it didn't show good result. When I saw its scatterplot I realized that my data has unique distribution for all classes which makes learning impossible, ...
2
votes
1
answer
43
views
Can the output of a language model be identical to it's training data if finetuned with reference documents also present on the training data?
E.g. Finetuning a language model using text from Wikipedia articles (without modifications) when the language model has Wikipedia data in its training dataset will cause the model to reproduce the ...