Newest Questions

1 vote

1 answer

46 views

Do neural networks do wishful thinking?

I will give an example of wishful thinking. When you try to prove a theorem you think what would imply that theorem and maybe try to find a lemma that implies it. Maybe neurons try to connect previous ...

gha00

21

asked Apr 25 at 7:36

0 votes

0 answers

25 views

Is it feasible and scalable to combine self-replicating automata (after von Neumann) with federated learning and the social web?

Von Neumann’s idea of self-replicating automata describes machines that can reproduce themselves given a blueprint and a suitable environment. I’m exploring a concept that tries to apply this idea to ...

blueberry

1

asked Apr 24 at 17:32

0 votes

0 answers

11 views

How do BPE Tokenizers with `add_prefix_space` handle natural language, such as quotations or poetry, where there won't be a prefix space?

BPE Tokenizers are the standard for modern LLMs. By default, most add_prefix_space, so that John went away is pretokenized to <...

SRobertJames

63

asked Apr 24 at 12:43

-1 votes

0 answers

19 views

can you recommend an image filename(caption?) generating ai?

2 to 6 words output and filenames should be unique and relevant to the image content.

Team2 Research

1

asked Apr 24 at 9:42

0 votes

1 answer

37 views

Who argued that we're entering a 4th era of science with machine learning?

I remember reading a reference to a recent paper that argued that science today is in its 4th stage (paradigm?), the era of modelling with machine learning. The 3rd was that of Newton, Kepler, et al. ...

Geremia

525

asked Apr 23 at 22:53

0 votes

0 answers

10 views

Is there other ways than using negative log-likelihood or KL-divergence to compute a loss function?

I've read that the two common ways to express a loss function in ML problems was to start either from the likelihood, then use the negative log likelihood to find a good expression of the loss, or to ...

Tristan Beruard

1

asked Apr 22 at 21:51

0 votes

0 answers

17 views

How can AI be leveraged to improve fraud detection in real-time financial transactions?

I'm exploring how AI can help financial institutions enhance their fraud detection systems, particularly in real-time transactions. What are the best approaches, tools, or models for building more ...

Highen Fintech

1

asked Apr 21 at 10:22

0 votes

1 answer

52 views

How do tools like V0.dev provide the project's code context to the AI while minimizing its input tokens?

Under the hood, tools like V0.dev use OpenAI API to generate code. How does V0 know which file context to send to the AI when user makes prompts like "make xyz change/change the button color/add ...

raspace

9

asked Apr 21 at 5:47

0 votes

0 answers

30 views

Why would the tokenizer for encoder-decoder model for machine translation use bos_token_id == eos_token_id? How does it know when a sequence ends?

I see on this PyTorch model Helsinki-NLP/opus-mt-fr-en (HuggingFace), which is an encoder-decoder model for machine translation: ...

Franck Dernoncourt

3,353

asked Apr 20 at 19:42

0 votes

0 answers

15 views

Torch gradient estimates disagreeing with analytic and perturbation approximated gradients

I'm faced with a problem where as the title says I'm having trouble with the torch package's built in automatic differentiation algorithms (or my usage?). I think it was meant to be used on mini-...

Nomi Mino

1

asked Apr 19 at 14:54

-1 votes

0 answers

42 views

What are some notable connections between quantum mechanics and machine learning?

I have recently started to learn about machine learning, and I feel like there is a lot of connection to quantum mechanics in it. For example, this idea of the predictin through layers of neurons look ...

Clemens Bartholdy

119

asked Apr 19 at 6:10

0 votes

1 answer

25 views

Different approaches to probability, connection to measure theory and machine learning

I am trying to understand the probability as used in machine learning. So far I understand there is multipile approaches to probability. The two I know of are bayseian and frequntist approach. So far ...

Clemens Bartholdy

119

asked Apr 18 at 13:46

0 votes

2 answers

37 views

Can you train YOLOv11 using the Coral USB Accelerator?

I have a dataset that I want to train a YOLOv11 model on. The problem is, my laptop is too slow and using Google Colab is hard due to the inactivity limit and the training interrupting. My idea was to ...

Aliaksei Badnarchuk

3

asked Apr 18 at 11:35

0 votes

0 answers

29 views

How to cluster data which has unique distribution for all classes?

I wanted to use MLP to classify my data. But it didn't show good result. When I saw its scatterplot I realized that my data has unique distribution for all classes which makes learning impossible, ...

Mojtaba Razavi

11

asked Apr 17 at 15:32

2 votes

1 answer

43 views

Can the output of a language model be identical to it's training data if finetuned with reference documents also present on the training data?

E.g. Finetuning a language model using text from Wikipedia articles (without modifications) when the language model has Wikipedia data in its training dataset will cause the model to reproduce the ...

user1678860

377

asked Apr 17 at 0:00

Stack Exchange Network

Do neural networks do wishful thinking?

Is it feasible and scalable to combine self-replicating automata (after von Neumann) with federated learning and the social web?

How do BPE Tokenizers with `add_prefix_space` handle natural language, such as quotations or poetry, where there won't be a prefix space?

can you recommend an image filename(caption?) generating ai?

Who argued that we're entering a 4th era of science with machine learning?

Is there other ways than using negative log-likelihood or KL-divergence to compute a loss function?

How can AI be leveraged to improve fraud detection in real-time financial transactions?

How do tools like V0.dev provide the project's code context to the AI while minimizing its input tokens?

Why would the tokenizer for encoder-decoder model for machine translation use bos_token_id == eos_token_id? How does it know when a sequence ends?

Torch gradient estimates disagreeing with analytic and perturbation approximated gradients

What are some notable connections between quantum mechanics and machine learning?

Different approaches to probability, connection to measure theory and machine learning

Can you train YOLOv11 using the Coral USB Accelerator?

How to cluster data which has unique distribution for all classes?

Can the output of a language model be identical to it's training data if finetuned with reference documents also present on the training data?

Hot Network Questions

Newest Questions

Related Tags