Draft of the Fast.ai Book

chrisa · on Feb 29, 2020

It's a neat idea to write the entire book as notebooks - so you can have run-able code right in line.

Definitely excited to check this out; thanks Jeremy and Sylvain!

weego · on Feb 29, 2020

It's an awful idea, notebook stability ages like milk

_5rgz · on March 1, 2020

i think that's true of almost all tech books in paper form

personally i think these work great because i can add cells to inspect data or try experiments easily as i'm reading to help me understand whats going on

SkyMarshal · on Feb 29, 2020

What do you mean by stability in this context?

khazhoux · on Feb 29, 2020

I think he means they bit-rot. Stop working.

RocketSyntax · on March 2, 2020

read for theory and big patterns

smohare · on Feb 29, 2020

These sorts of books have existed since the dawn of notebooks. While I think them useful for demonstrations, I don’t think there is much pedantic value (beyond a more static medium) in all honesty. Well-worked examples a student can consult while attempting to solve problems on their own is still of paramount importance.

timClicks · on March 1, 2020

When you say "pedantic value", do you mean "pedagogic value"?

localhost · on Feb 29, 2020

Why can’t we have both? Great examples AND an interactive, reproducible medium?

cube2222 · on Feb 29, 2020

I recommend everybody who didn’t check those out to do so.

Not really being interested in ML, but doing all the available most popular courses to keep up, I really liked how fastai doesn’t just teach you ready and known models, but also how to compose differentiable building blocks to design NN’s yourself.

montenegrohugo · on Feb 29, 2020

Love the work Jeremy Howard does. Would definitely recommend the fastai course for anyone considering getting into ML.

Kunix · on Feb 29, 2020

Same, long-time developer, I've followed the course and learned a lot (as long as you invest the effort ^^). Great content!

alexcnwy · on Feb 29, 2020

Totally. Jeremy changed my life.

stazz1 · on Feb 29, 2020

Are you using ML in your business now? Just wondering Thanks =)

he11ow · on Feb 29, 2020

This is not intending to minimize in the slightest the amazing work that Jeremy does - I am a huge fan.

But Fast.ai has TWO co-founders, and somehow, Rachel doesn't seem to get any credit in these discussions (not the book specifically, I'm talking about the overall enterprise). Not quite sure why; A lot of the content on the website is written by her, and it's clear she adds a lot of value to the endeavor as a whole.

jph00 · on Feb 29, 2020

Thank you for mentioning Rachel! :) She is working as the Founding Director of the Center for Applied Data Ethics nowadays, which is a very full-time job. So she hasn't been involved much in fastai v2 or the book (other than chapter 3, of which she's a co-author).

She created and taught the NLP and Computational Linear Algebra courses, and has written most of the material on the fast.ai blog, and of course (as noted) co-founded fast.ai. Overall, I'd agree that she doesn't get as much credit as she deserves. That's perhaps partly due to her increasing focus on ethics issues, which aren't generally discussed much on HN (sadly).

I also would say that Sylvain Gugger doesn't get as much credit as he should -- he has been an equal partner with me in creating the book and fastai library.

(I discussed this response with Rachel prior to posting it.)

DrNuke · on Feb 29, 2020

Another smart move from fast.ai, this book is going to be the state-of-the-art reference for 2020 and a classic anthology of algo techniques for the mid term.

mci · on Feb 29, 2020

> There are also "agglutinative languages", like Polish, which can add many morphemes together to create very long "words" which include a lot of separate pieces of information. [1]

Polish does not work this way. Source: I am Polish. Perhaps jph00 meant Turkish. Issue filed.

[1] https://github.com/fastai/fastbook/blob/master/10_nlp.ipynb

jph00 · on Feb 29, 2020

Yes you're right, in our NLP course we used Turkish as our example.

But for the book I mentioned Polish due to this paper: https://arxiv.org/abs/1810.10222 . But as you say, now the word "agglutinative" isn't technically correct. I'm actually not sure what the right word is to describe languages that have lots of big compounds with no spaces. (Which is the key issue here, as to why we need subword tokenization techniques).

phaker · on Feb 29, 2020

After reading that section of the book i think the language property you're after is 'highly synthetic':

https://en.wikipedia.org/wiki/Synthetic_language

There's a spectrum between synthetic and analytic languages ( https://en.wikipedia.org/wiki/Synthetic_language#Synthetic_a... ) and those closer to the synthetic end are the ones giving you trouble.

Polish will be subtype of synthetic called fusional/inflected which means things need to be adjusted to fit together, agglutinative languages are those that use mainly agglutination where morphemes are stuck together as is:

https://en.wikipedia.org/wiki/Agglutinative_language

There are also polysynthetic languages, which is the name for the extreme of this spectrum, but there are no familiar examples of these (Mayan languages, Ainu, Inuit, Aleut are only i recognize from those mentioned on wikipedia).

jph00 · on Feb 29, 2020

Many thanks - this is really helpful.

mci · on Feb 29, 2020

The term you are looking for may be "highly inflected".

Side note: IMHO, you are exaggerating the ability of Polish to form long compounds. Dissecting the "Bezbarwne zielone idee wściekle śpią" example from https://arxiv.org/pdf/1810.10222.pdf#page=3 reveals no words longer than 4 morphemes:

bez-BARW-n-e ZIEL-on-e IDE-e WŚCIEK-l-e ŚP-ią, where I put word roots in uppercase and bound morphemes in lowercase.

The longest sequences of morphemes (for a loose definition of morpheme) I can think of are conditional mood of verbs with double prefixes like po-wy-CHODZI-ł-y-by-ście. However, the sequences of bound morphemes in those forms, which may look complex to you, form a finite-state language that admits just a few sequences.

mnks · on Feb 29, 2020

It's not about the number of letters in the compounds, but about the number of morphemes.

Your "powychodziłybyście" example could be translated as "you (feminine, plural) would have been going out". With the word tokenization, you get (ignoring comma and brackets) 8 tokens in English and one token in Polish. Now you can have three persons, two genders, two numbers, an imperfective or perfective verb, etc. resulting in combinatorial growth of word tokens in Polish. If you have all word forms for "go out" and you want to add "go in", in English you would add a single token "in", and in Polish you add all the tokens with "-wy-" replaced by "-w-". As a result in Polish you end up with much bigger vocabulary. Additionally you need bigger training corpus as you cannot learn the tokens independently. For example, if you know the meaning of "he ate" and "she wrote", you should be able to guess the meaning of "he wrote", as you've seen all of the tokens. In Polish it's "Zjadł", "Napisała" and "Napisał" - all of the word tokens are different.

Using the subword tokenization instead of word-level tokenization is kind of similar to using a normalized database instead of unnormalized one. It's not about one form being more complex than the other as they're equivalent. After all, will written English be much more complex if we remove all whitespaces? :)

mci · on Feb 29, 2020

I agree with what you wrote. I did not object to subword tokenization that let you(?) win the competition. I objected to GP's assertion that one can add many morphemes together to create very long "words" in Polish, which made casual readers think of stringing morphemes like German compounds while the number of morphemes in Polish words is bounded by 7, maybe by 8.

jph00 · on Feb 29, 2020

Both the primary authors are Polish, and they won the competition, so I don't really have any informed view to add...

Maybe best to mention Turkish in the book!

kristofferc · on Feb 29, 2020

Or Hungarian.

Megszentségteleníthetetlenségeskedéseitekért for example.

0-_-0 · on March 1, 2020

Yes, which is an agglutinative language

machiaweliczny · on Feb 29, 2020

Doesn't German work this way?

mkl · on Feb 29, 2020

Not really. "German grammar allows for the construction of long compounded noun phrases which are expressed as one word in written language. Compounding is not really the same as agglutination.": https://www.quora.com/Is-German-considered-a-true-agglutinat...

There are quite a lot of languages that do though: https://en.wikipedia.org/wiki/Agglutinative_language

gumby · on Feb 29, 2020

Not just in written language, although the difference between a “word” and “noun phrase” in spoken language is in the ear of the beholder.

But in a linguistic sense indeed, German is not at all an agglutinative language.

pault · on Feb 29, 2020

English too. Policeman, bathwater, catwalk, headstone, toothbrush, etc.

ratsimihah · on Feb 29, 2020

It looks promising!

Minor point, a requirements.txt file or something would be convenient to get started quickly.

jph00 · on Feb 29, 2020

Once the book is released there will be a whole website and prebuilt environments and lots more to get started quickly.

We didn't expect the draft to get this much attention, frankly!

ratsimihah · on Feb 29, 2020

That's exciting! If pull requests are enabled I can always send one once I'm up and running! Looking forward to checking out the good stuff in there!

jph00 · on Feb 29, 2020

Actually someone already did a PR for it!

ratsimihah · on Feb 29, 2020

Oh brilliant! That's what happens when I slack off checking out face masks haha!

honzzz · on Feb 29, 2020

Any estimate when the book will be released? Thanks :-)

jph00 · on Feb 29, 2020

July.

classified · on March 11, 2020

> If you make any pull requests to this repo, then you are assigning copyright of that work to Jeremy Howard and Sylvain Gugger.

That's not how the GPL works.

coderunner · on Feb 29, 2020

How does this compare to the video course? e.g. what are the differences?

jph00 · on Feb 29, 2020

It's all new material. It'll be the basis of the next course coming in July. Or you can join the in person course from March in SF https://www.usfca.edu/data-institute/certificates/deep-learn...

pritovido · on Feb 29, 2020

It always amazes me how bad some technical people is at basic promotion:

What is Fastai? Why do I need it?

Something as basic as an elevator speech that introduces your product in your github page and book intro can mean 10x or 100x more sales.

If you force people into having to search it for you, you have already lost most of them.

For this author it is as you already know everything about Fastai, but if you did, you would not be needing this book in the first place.

It happens a lot to technical writers because they have spent years thinking about a topic, so they could not put themselves in the shoes of someone who does not.

sixhobbits · on Feb 29, 2020

It's sad to see this perspective. With the AI hype, there are so many people spending all their time and money on marketing AI materials while their actual product is all smoke and mirrors.

By contrast, Jeremy and team have proven that "build it and they will come" is not dead. They built high quality courses and quickly became authoritative with no marketing and with full transparency and openness in everything they do.

This book draft looks great. Everyone else is talking about "democratising AI" - this is actually doing it.

Jugurtha · on Feb 29, 2020

We're building and improving our internal machine learning platform. We decided recently to support the fast.ai courses. You get everything you need (notebook, object storage, data, parameters and mettics tracking and deployment). Our colleague teaches at university and we're opening the internal platform to about 30 of her students this week to prepare their final masters project.

They don't have access to compute power (GPUs) or bandwidth to download datasets hundreds of gigabytes of data, which they'll find right there, so this should help them since they don't need to have powerful machines or worry about experiment tracking.

We also have a Publish option to make an application from a notebook in one click behind the scenes with training parameters in a form generated automatically, so they can write scripts and instrument model training.

The fast.ai course will also help current or future members, and other students. It's important for us to make it even easier for people to enter the field.

sytelus · on Feb 29, 2020

You are downvoted by fanboys but you are exactly right. I am surrounded by researchers working in DL and I have say at least 40% of them have never heard of FastAI or Jeremy Howard. However folks who are active on Twitter, listening to popular podcasts, popular media, HN etc would be very familiar with name Jeremy Howard and what FastAI is and need no introduction. In research world, an astonishing number of good researchers have little to none online presence. They have little to no time other than keeping track of research papers in their sub-field. It also surprises me when authors sweat for months to churn out 100s of polished pages but couldn’t spend 15 minutes to write a paragraph of introduction in readme.MD.

kriro · on Feb 29, 2020

I guess it depends a bit which field they work in exactly. I'd be rather surprised if rigorous DL researchers in NLP haven't heard of him because I expect "Universal language model fine-tuning for text classification" (and tbh. also "Fine-tuned language models for text classification" due to the universality of the idea) to show up in any half-decent literature review of the field.

Most DL researchers I know also have a pretty good knowledge of available libraries and make it a habit to check them pretty often.

DrNuke · on Feb 29, 2020

Fast.ai really democratizes the bleeding edge research for the masses, though, that’s why it’s popular among outcasts and outsiders. In general, I would be more wary of people working within closed environments and organizations than people making all they do public and open to review.

sytelus · on Feb 29, 2020

fastai is popular among practitioners as well as many researchers rather than just outsiders or outcasts. I've personally learned from it a lot and is amazing contribution. However, there is still a large population that is still unaware and it would be great to have a quick intro paragraph in readme so they know what all the fuzz is about.

DrNuke · on Feb 29, 2020

You know, hungry people go hunting...

mkl · on Feb 29, 2020

Fast.ai courses, software, and articles are frequently posted on HN: https://hn.algolia.com/?dateRange=all&page=0&prefix=false&qu...

Few people here need to look it up; the basic promotion has already been done very effectively for the target audience. Besides, the intro chapter explains very clearly what the book is about and who it's for.

Arrrlex · on Feb 29, 2020

This is a draft. I expect that, when the first finished version is ready, the authors will promote it effectively (IMO they are very good at promoting their courses at fast.ai).

sytelus · on Feb 29, 2020

Wow, Orielly lawyers are determined to screw this up. The thing is GPL v3 licensed which means I can’t copy any of the book code in my closed-source product or competitions or even MIT licensed code. The readme says I cannot make copies of this material but it’s ok to fork. Huh?

jph00 · on Feb 29, 2020

No, the readme says you can make copies for personal use.

If you want to use code in the book under a non GPL license, then you could just buy the book when it comes out. That doesn't seem like an unreasonable burden.

PS: none of this is anything to do with O'Reilly or their lawyers.

sytelus · on Feb 29, 2020

Wait... so if you buy book then it seizes to be GPLed? This is quite confusing. For DL research, most code is MIT licenced and legal folks at many industrial labs would be quite hesitent to permit use of code from this repo with feels like legal minefield with different restrictions spread over multiple places including LICENSE, README, fastai website and perhaps printed book. I would highly recommand converting to one simple MIT license and call it a day (except for markdown cells).

bonoboTP · on Feb 29, 2020

I don't get this perspective about the GPL. Look, they are giving you something for free, including the source code and the right to build upon it and publish modified versions. You can do basically whatever you want with it, as long as you pass on the freedoms that were granted to you. Is that unfair? Enjoying getting freedoms but not passing them on is not nice.

sytelus · on Feb 29, 2020

I get GPL and fully appreciate its philosophy. The problem happens when you actually use it in practice. Because of its viral nature, anyone with different licensing must convert to GPL if they use your code. For many scenarios, this is actually not possible not just because of commercial secrets but the potential for opening up for security vulnerabilities when you don’t have resources or competitions where you should keep code secret until some time or simply because you have dependencies on other code which is very expensive to get rid off. Due to this reason, many companies forbid the use of GPL licensed software as well as release anything under it (because then you can’t use your own code!). Many other companies simply don't want the headache of checking all of their mess of legacy codebases with a myriad of dependencies that would be hard to untangle into GPL compatible open-source release. The legal and economic overhead when you use or release GPLed code is non-trivial. For this reason, the vast majority of open-source code released by big tech companies on GitHub is MIT/BSD licensed, which ironically is more "freeier" than GPL.

bonoboTP · on March 1, 2020

> The problem happens when you actually use it in practice.

I think it's fair for any code publisher to require that the freedoms he/she gives with his/her code never get taken away and it or its modified versions can never get locked up or used in opposition to the wishes and interests of the users (i.e. the users retain the ultimate control over modifying behavior by modifying the code).

> Due to this reason, many companies forbid the use of GPL licensed software as well as release anything under it (because then you can’t use your own code!)

It seems you have a misunderstanding here, and I think it's a common one. You can use your own code in any way you want. You own the copyright, you decide the rules. And you don't need any agreements with yourself. Further, you can release your code to multiple people, each with any license you want. You can also sell proprietary licenses to companies that prefer it, while also releasing the same code under a GPL license to the public.

eeZah7Ux · on March 9, 2020

> I get GPL and fully appreciate its philosophy

No, you clearly don't.