More

sabareesh · 2026-01-30T17:27:05 1769794025

Tesla have their own Insurance product which is already very competitive compared to other providers. Not sure if lemonade can beat them . Tesla's insurance product has similar objective in place already where it rewards self driving over manual driving.

kjksf · 2026-01-30T18:07:10 1769796430

Tesla is cooperating with Lemonade on this by providing them necessary user driving data.

If Tesla didn't want Lemonade to provide this, they could block them.

Strategically, Tesla doesn't want to be an insurer. They started the insurance product years ago, before Lemonade also offered this, to make FSD more attractive to buyers.

But the expansion stalled, maybe because the state bureaucracy or maybe because Tesla shifted priority to other things.

In conclusion: Tesla is happy that Lemonade offers this. It makes Tesla cars more attractive to buyers without Tesla doing the work of starting an insurance company in every state.

mullingitover · 2026-01-30T18:13:39 1769796819

> But the expansion stalled, maybe because the state bureaucracy or maybe because Tesla shifted priority to other things.

If the math was mathing, it would be malpractice not to expand it. I'm betting that their scheme simply wasn't workable, given the extremely high costs of claims (Tesla repairs aren't cheap) relative to the low rates that they were collecting on premiums. The cheap premiums are probably a form of market dumping to get people to buy their FSD product, the sales of which boosts their share price.

Veserv · 2026-01-30T20:31:11 1769805071

It was not workable. They have a loss ratio of >100% [1], as in they paid out more in claims than received in premiums before even accounting for literally any other costs. Industry average is ~60-80% to stay profitable when including other costs.

They released the Tesla Insurance product because their cars were excessively expensive to insure, increasing ownership costs, which was impacting sales. By releasing the unprofitable Tesla Insurance product, they could subsidize ownership costs making the cars more attractive to buy right now which pumped revenues immediately in return for a "accidental" write-down in the future.

[1] https://peakd.com/tesla/@newageinv/teslas-push-into-insuranc...

redanddead · 2026-01-30T21:08:58 1769807338

Who was paying for this?

Onavo · 2026-01-30T21:41:25 1769809285

You as the consumer when you buy a tesla car that's twice the price of what you can get it for in Asia. Teslas are very cheap to produce.

Remember with their own insurance they also have access to the parts at cost.

Veserv · 2026-01-30T22:16:10 1769811370

That is not true. Since Tesla was losing money on their insurance to boost sales the customers were not paying for it since they were receiving a service for below cost.

The people paying were actually the retirement funds who fronted Tesla's cash reserves when they purchased Tesla stock and the US government paying for it in the form of more tax credits on sales that would not have otherwise materialized without this financial fraud. But do not worry, retirement funds and the US government may have lost, but it boosted Tesla sales and stock valuation so that Elon Musk could reach his KPIs to get his multiple tens of billions of dollars of payout.

redanddead · 2026-01-30T22:40:44 1769812844

Wow this went deep ahaha

redanddead · 2026-01-30T21:08:11 1769807291

The math should've mathed. Better data === lower losses right? They probably weren't able to get it to work quite right on the tech side and were eating fat losses during an already bad time in the market.

It'll come back.

Lemonade or Tesla if you find this, let's pilot, i'm a founder in sunnyvale, insurtech vertical at pnp

redanddead · 2026-01-30T21:04:29 1769807069

You'd be very surprised. Distribution works wonders. You could have a large carrier taking over Tesla's own vehicles in markets they care about. The difference then would be loss ratios on the data collection, like does LIDAR data really beat Progressive Snapshot?

The two are measuring data for different sources of losses for carriers.

sabareesh · 2026-01-06T20:26:09 1767731169

I am looking for some open source terminal for iphone .I have code server running which i can just use terminal from vs code on safari

sabareesh · 2026-01-05T14:54:26 1767624866

I have switched to terminal

xpe · 2026-01-05T15:29:35 1767626975

Do you mean a terminal-based editor, like emacs, vim, neovim, or helix? (I quite like the latter, after having used all the former to some degree.)

Or do you mean line-editors? They have gotten impressively good. See rustyline (based on linenoise) and reedline (not a typo; developed by the Nushell team) for example. Way better than one might expect!

[1]: https://github.com/kkawakam/rustyline

[2]: https://github.com/antirez/linenoise

[3]: https://github.com/nushell/reedline

sabareesh · 2026-01-05T17:27:26 1767634046

Sorry to disappoint. But purely codex and claude code

vegabook · 2026-01-05T15:19:53 1767626393

am I the only one who is just fine with neovim?

bluecalm · 2026-01-05T16:00:04 1767628804

I love it. I have 40 files open in it right now - most of them .c source files with LSP and Treesitter running and it uses 250MB of RAM. Everything works instantly, Telescope is imo the new standard for file navigation (and navigation in general: grep, symbols even neovim shortcuts). Git integration is fantastic as well.

I have spent some time configuring it and probably will spend more when I start including more languages but imo it's worth it. You can configure everything but you can also find very nice defaults by running Kickstarter (or some heavier neovim "distro").

Microsoft has done great work with LSPs - I can now get great navigation/autocompletion/formatting/inline errors/warning combined with neovim navigation, light weight and fantastic tools/extensions.

One thing I haven't integrated yet is a debugger (gdb from the terminal is good enough for me), maybe that's something people are missing in neovim?

adhamsalama · 2026-01-05T18:23:48 1767637428

I use Astronvim and it has a not-so-bad debugger support, when it works...

sabareesh · 2026-01-03T05:55:56 1767419756

TL;DR is that they didn't clean the repo (.git/ folder), model just reward hacked its way to look up future commits with fixes. Credit goes to everyone in this thread for solving this: https://xcancel.com/xeophon/status/2006969664346501589

(given that IQuestLab published their SWE-Bench Verified trajectory data, I want to be charitable and assume genuine oversight rather than "benchmaxxing", probably an easy to miss thing if you are new to benchmarking)

https://www.reddit.com/r/LocalLLaMA/comments/1q1ura1/iquestl...

ofirpress · 2026-01-03T05:59:37 1767419977

As John says in that thread, we've fixed this issue in SWE-bench: https://xcancel.com/jyangballin/status/2006987724637757670

If you run SWE-bench evals, just make sure to use the most up-to-date code from our repo and the updated docker images

LiamPowell · 2026-01-03T06:27:18 1767421638

> I want to be charitable and assume genuine oversight rather than "benchmaxxing", probably an easy to miss thing if you are new to benchmarking

I don't doubt that it's an oversight, it does however say something about the researchers when they didn't look at a single output where they would have immediately caught this.

domoritz · 2026-01-03T10:56:32 1767437792

So many data probes would be solved if everyone looked at a few outputs instead of only metrics.

alyxya · 2026-01-04T00:09:16 1767485356

Given the decrease in the benchmark score from the correction, I don't think you can assume they didn't check a single output. Clearly the model is still very capable and the model cheating its results didn't affect most of the benchmark.

stefan_ · 2026-01-03T09:26:42 1767432402

Never escaping the hype vendor allegations at SWEbench are they.

sabareesh · 2025-12-30T00:10:47 1767053447

Non starter for us, we cant ship propriety data to a third party servers.

austinbaggio · 2025-12-30T01:20:12 1767057612

I assume this is with work? And also assume you do send data, you just need some service agreement or something like with AWS or Microsoft for GH?

sabareesh · 2025-12-17T23:49:39 1766015379

Watch out these model are hallucinating lot more https://artificialanalysis.ai/evaluations/omniscience?omnisc...

joecarpenter · 2025-12-17T23:52:20 1766015540

Isn't it the opposite? From the link: Scores range from -100 to 100, where 0 means as many correct as incorrect answers, and negative scores mean more incorrect than correct.

Gemini 3 Flash scored +13 in the test, more correct answers than incorrect.

sabareesh · 2025-12-18T00:07:09 1766016429

Nope lower is better compared to recent open ai models this is bad. I am looking at AA-Omniscience Hallucination Rate

nemonemo · 2025-12-17T23:55:19 1766015719

One thing I don't understand is how come Gemini Pro seems much cheaper than Gemini Flash in the scatter graph.

andai · 2025-12-17T23:53:50 1766015630

This model has the best score on that benchmark.

Edit: Huh... It does score highest in "Omniscience", but also very high in Hallucination Rate (where higher score is worse)...

sabareesh · 2025-12-18T00:07:35 1766016455

this has one of the worse score in AA-Omniscience Hallucination Rate

sabareesh · 2025-12-09T22:14:14 1765318454

So is 10,000 IU of daily does ok ?

seba_dos1 · 2025-12-10T04:42:04 1765341724

If you start from low levels, then yes, as long as you keep your blood levels in check. It would take a while to overdose it this way, but it's not impossible.

IAmBroom · 2025-12-10T16:26:11 1765383971

Cite?

woleium · 2025-12-09T22:23:43 1765319023

yes, but not long term, from what i’ve read. do it for 90 days through winter maybe?

anamexis · 2025-12-10T01:10:44 1765329044

What have you read?

woleium · 2025-12-15T16:25:37 1765815937

i cant find it now. It was an article that suggested you may underestimate your natural production, especially in summer, so its safer to use high doses for a period of 3 months or so with a break of the same duration, after all we are likely “accustomed “ to fluctuating supply through the year.

red-iron-pine · 2025-12-10T14:03:42 1765375422

for a couple weeks? probably.

I would not take that much consistently unless you're in the arctic circle and its winter

sabareesh · 2025-11-10T00:03:54 1762733034

Technically you kind of get this in Nevada when using Tesla insurance and if you drive 100 % FSD. If you drive manually you are pretty much doxed for random Front collision Warning which is super sensitive

whoisthemachine · 2025-11-10T13:53:07 1762782787

That does sound like a punishment for not using it. My statement was more whether they actively sell you a discount for using it.

sabareesh · 2025-10-22T22:18:52 1761171532

It might be that our current tokenization is inefficient compared to how well image pipeline does. Language already does lot of compression but there might be even better way to represent it in latent space

ACCount37 · 2025-10-22T22:26:28 1761171988

People in the industry know that tokenizers suck and there's room to do better. But actually doing it better? At scale? Now that's hard.

typpilol · 2025-10-22T22:55:16 1761173716

It will require like 20x the compute

ACCount37 · 2025-10-23T00:42:54 1761180174

A lot of cool things are shot down by "it requires more compute, and by a lot, and we're already compute starved on any day of the week that ends in y, so, not worth it".

If we had a million times the compute? We might have brute forced our way to AGI by now.

Jensson · 2025-10-23T00:52:42 1761180762

But we don't have a million times the compute, we have the compute we have so its fair to argue that we want to prioritize other things.

Mehvix · 2025-10-23T00:38:44 1761179924

Why do you suppose this is a compute limited problem?

ACCount37 · 2025-10-23T01:04:11 1761181451

It's kind of a shortcut answer by now. Especially for anything that touches pretraining.

"Why aren't we doing X?", where X is a thing that sounds sensible, seems like it would help, and does indeed help, and there's even a paper here proving that it helps.

The answer is: check the paper, it says there on page 12 in a throwaway line that they used 3 times the compute for the new method than for the controls. And the gain was +4%.

A lot of promising things are resource hogs, and there are too many better things to burn the GPU-hours on.

typpilol · 2025-10-23T07:16:52 1761203812

Thanks.

Also, saying it needs 20x compute is exactly that. It's something we could do eventually but not now

kenjackson · 2025-10-23T00:48:02 1761180482

Why so much compute? Can you tie it to the problem?

typpilol · 2025-10-23T07:18:56 1761203936

Tokenizers are the reason LLMs are even possible to run at a decent speed on our best hardware.

Removing the tokenizer would 1/4 the context and 4x the compute and memory, assuming an avg token length of 4.

Also, you would probably need to 4x the parameters to have to learn understanding between individual characters as well as words and sentences etc.

There's been a few studies on small models, even then those only show a tiny percentage gain over tokenized models.

So essentially you would need 4x compute, 1/4 the context, and 4x the parameters to squeeze 2-4% more performance out of it.

And that fails when you use more then 1/4 context. So realistically you need to support the same context, so you r compute goes up another 4x to 16x.

That's why

ashirviskas · 2025-10-25T23:22:44 1761434564

This has a ton of seemingly random assumptions, why can't we compress multiple latent space representations into one? Even in simple tokenizers token "and" has no right being the same size as "scientist".

kenjackson · 2025-10-23T15:36:22 1761233782

Thanks. That helps a lot.

CuriouslyC · 2025-10-22T22:37:46 1761172666

Image models use "larger" tokens. You can get this effect with text tokens if you use a larger token dictionary and generate common n-gram tokens, but the current LLM architecture isn't friendly to large output distributions.

yorwba · 2025-10-23T03:41:33 1761190893

You don't have to use the same token dictionary for input and output. There's things like simultaneously predicting multiple tokens ahead as an auxiliary loss and for speculative decoding, where the output is larger than the input, and similarly you could have a model where the input tokens combine multiple output tokens. You would still need to do a forward pass per output token during autoregressive generation, but prefill would require fewer passes and the KV cache would be smaller too, so it could still produce a decent speedup.

But in the DeepSeek-OCR paper, compressing more text into the same number of visual input tokens leads to progressively worse output precision, so it's not a free lunch but a speed-quality tradeoff, and more fine-grained KV cache-compression methods might deliver better speedups without degrading the output as much.

mark_l_watson · 2025-10-23T03:29:30 1761190170

Interesting idea! Haven’t heard that before.

sabareesh · 2025-10-20T21:33:34 1760996014

Similar feeling. Seems it is good at certain things and if something doesnt work it want to do things simply and in turn becomes something that you didnt ask for and certain times opposite of what you wanted. On the other hand with codex certain time you feel the AGI but that is like 2 out of 10 sessions. This is primarily may be due to how complete the prompt and how well you define the problems.