Newest Questions

1 vote
1 answer
46 views

Do neural networks do wishful thinking?

I will give an example of wishful thinking. When you try to prove a theorem you think what would imply that theorem and maybe try to find a lemma that implies it. Maybe neurons try to connect previous ...
gha00's user avatar
  • 21
0 votes
0 answers
25 views

Is it feasible and scalable to combine self-replicating automata (after von Neumann) with federated learning and the social web?

Von Neumann’s idea of self-replicating automata describes machines that can reproduce themselves given a blueprint and a suitable environment. I’m exploring a concept that tries to apply this idea to ...
blueberry's user avatar
0 votes
0 answers
11 views

How do BPE Tokenizers with `add_prefix_space` handle natural language, such as quotations or poetry, where there won't be a prefix space?

BPE Tokenizers are the standard for modern LLMs. By default, most add_prefix_space, so that John went away is pretokenized to <...
SRobertJames's user avatar
-1 votes
0 answers
19 views

can you recommend an image filename(caption?) generating ai?

2 to 6 words output and filenames should be unique and relevant to the image content.
Team2 Research's user avatar
0 votes
1 answer
37 views

Who argued that we're entering a 4th era of science with machine learning?

I remember reading a reference to a recent paper that argued that science today is in its 4th stage (paradigm?), the era of modelling with machine learning. The 3rd was that of Newton, Kepler, et al. ...
Geremia's user avatar
  • 525
0 votes
0 answers
10 views

Is there other ways than using negative log-likelihood or KL-divergence to compute a loss function?

I've read that the two common ways to express a loss function in ML problems was to start either from the likelihood, then use the negative log likelihood to find a good expression of the loss, or to ...
Tristan Beruard's user avatar
0 votes
0 answers
17 views

How can AI be leveraged to improve fraud detection in real-time financial transactions?

I'm exploring how AI can help financial institutions enhance their fraud detection systems, particularly in real-time transactions. What are the best approaches, tools, or models for building more ...
Highen Fintech's user avatar
0 votes
1 answer
52 views

How do tools like V0.dev provide the project's code context to the AI while minimizing its input tokens?

Under the hood, tools like V0.dev use OpenAI API to generate code. How does V0 know which file context to send to the AI when user makes prompts like "make xyz change/change the button color/add ...
raspace's user avatar
0 votes
0 answers
30 views

Why would the tokenizer for encoder-decoder model for machine translation use bos_token_id == eos_token_id? How does it know when a sequence ends?

I see on this PyTorch model Helsinki-NLP/opus-mt-fr-en (HuggingFace), which is an encoder-decoder model for machine translation: ...
Franck Dernoncourt's user avatar
0 votes
0 answers
15 views

Torch gradient estimates disagreeing with analytic and perturbation approximated gradients

I'm faced with a problem where as the title says I'm having trouble with the torch package's built in automatic differentiation algorithms (or my usage?). I think it was meant to be used on mini-...
Nomi Mino's user avatar
-1 votes
0 answers
42 views

What are some notable connections between quantum mechanics and machine learning?

I have recently started to learn about machine learning, and I feel like there is a lot of connection to quantum mechanics in it. For example, this idea of the predictin through layers of neurons look ...
Clemens Bartholdy's user avatar
0 votes
1 answer
25 views

Different approaches to probability, connection to measure theory and machine learning

I am trying to understand the probability as used in machine learning. So far I understand there is multipile approaches to probability. The two I know of are bayseian and frequntist approach. So far ...
Clemens Bartholdy's user avatar
0 votes
2 answers
37 views

Can you train YOLOv11 using the Coral USB Accelerator?

I have a dataset that I want to train a YOLOv11 model on. The problem is, my laptop is too slow and using Google Colab is hard due to the inactivity limit and the training interrupting. My idea was to ...
Aliaksei Badnarchuk's user avatar
0 votes
0 answers
29 views

How to cluster data which has unique distribution for all classes?

I wanted to use MLP to classify my data. But it didn't show good result. When I saw its scatterplot I realized that my data has unique distribution for all classes which makes learning impossible, ...
Mojtaba Razavi's user avatar
2 votes
1 answer
43 views

Can the output of a language model be identical to it's training data if finetuned with reference documents also present on the training data?

E.g. Finetuning a language model using text from Wikipedia articles (without modifications) when the language model has Wikipedia data in its training dataset will cause the model to reproduce the ...
user1678860's user avatar

15 30 50 per page