> *I am very much a beginner in the space of machine learning* While the (precio...

joconde · on March 13, 2022

Note that this won't work with reasonably performant CNNs. Passing an image batch through a large-ish ResNet takes half a second on our GPUs, several minutes at full load on CPU. This makes training infeasible, and most models small enough to work on CPU are so far from state-of-the-art that you can't do any worthwhile computer vision research with them.

mdp2021 · on March 13, 2022

Yes, but note on the other hand that simpler infrastructures such as one-digit-wide-GB GPUs you could buy and install on your workstation could be similarly frustrating, because you may easily encounter their limits (as in, "I got this semi-specialized equipment and I cannot get an output above 1024x768?!").

So, while one is learning, the case could be for being conservative and work directly on available tools, which will be revealing on some scalability requirements, also optimistically: you do not need a full lab to do (reasonable) linear regression, nor to train networks for OCR, largely not to get acquainted with the various techniques in the discipline.

When the needs push, it sometimes will not be just high-end consumer equipment to solve your problem, so on the side of hardware already some practical notion of actual constraints of scale will help orientation. Because you do not need a GPU for most pathfinding (nor for getting a decent grasp of the techniques I am aware of), and when you will want to produce new masterpieces from a Rembrandt "ROM construct"¹ (and much humbler projects) a GPU will not suffice.

(¹reprising the Dixie Flatline module in William Gibson's Neuromancer)

sillysaurusx · on March 13, 2022

Why start with vision? Do some language models. I used to train those all the time on my laptop.

GPT 5MB for the win. It really works.

mdp2021 · on March 13, 2022

...I am curious, now that I know about Fabrice Bellard's LibNC (bellard.org/libnc), if that «image batch through a large-ish ResNet» would be faster using this library - which can work on both CPU and CUDA...

37ef_ced3 · on March 13, 2022

Fast CPU transformers: https://bellard.org/libnc

Fast CPU convolutions: https://NN-512.com

Both are completely stand-alone (no external dependencies).

mdp2021 · on March 13, 2022

> Fast CPU transformers: https://bellard.org/libnc

And especially, from Fabrice Bellard (QEMU, FFMPEG...)

I do not know how you found it: it is not even in his site's index!

--

I see that NN-512 is a personal project of yours: congratulations! Though it seems to be a go-lang application that generates specialized C for convolutional NNs... Not a general purpose library, not for beginners.

hnfong · on March 14, 2022

FWIW this is the first link on the index page: https://bellard.org/nncp/ which mentions libnc as its underlying ML library.

mdp2021 · on March 15, 2022

(Yes, well, NNCP is an attempt to perform lossless data compression through ANN - which is quite interesting, and of definite theoretic interest, though yet not practical in its more defined purpose e.g. because of speed, hence presumably power efficiency. The other is the invention of water.

It is like "Let me show you my new idea for a cupboard..." - ok, nice! - "...I created through a new lightweight portable all-purpose "Fabrice Bellard"-quality Swiss army knife for automation that operates on any material and that you may use if you want" - YES!? REALLY?... Metaphors do not come close.

This, LibNC, is an Artificial Intelligence engine signed Fabrice Bellard, in low level implementation... It's a "revolution".)