More

another-cuppa · on Oct 18, 2018

Pipenv is hopelessly slow. It's a shame. Remember when git first came out and it changed the way we worked because it was so quick to commit now? (I fully expect that most git users here don't remember that, actually). There is no going back. I will not use slow tools. My tools need to be at the very least as fast as me.

wiremine · on Oct 18, 2018

> Pipenv is hopelessly slow.

Interesting, this has never been a problem for me. I've built some large tools and while it isn't fast, it's always completed in a few minutes.

iainmerrick · on Oct 18, 2018

A few minutes??! That sounds very slow.

wiremine · on Oct 18, 2018

To be clear: with few deps it's very fast for me, it's just lager projects with LOTS of non-trivial deps where it can slow up.

another-cuppa · on Oct 18, 2018

What OS?

wiremine · on Oct 18, 2018

Mid-2015 MacBook Pro running the newest OS

another-cuppa · on Oct 18, 2018

It is abundantly clear that the pipenv developers use MacOS so I wonder if it's an OS dependent thing.

y4mi · on Oct 18, 2018

my current project is at 16 dependencies atm and ... its really not as bad as you make it sound.

    pipenv lock  5.65s user 0.29s system 77% cpu 7.639 total

i think 7.6 seconds is fine for an operation that you'd rarely do

it would probably take ages at work though. just opening a WSL terminal takes several seconds there, which is predictably instantaneous (<100ms) on fedora linux at home

SJetKaran · on Oct 18, 2018

SSD vs HDD may be?

another-cuppa · on Oct 17, 2018

B-trees are really beautiful. I also like the Aho-Corasick algorithm as used by fgrep. I actually started to reinvent this algorithm myself before finding out it was already done. It's essentially a way to add links to a trie such that you can find all occurrences of multiple substrings within a larger string with one pass through the larger string.

another-cuppa · on Oct 17, 2018

K-means is not an algorithm, it's a heuristic for an Np-hard problem.

n4r9 · on Oct 17, 2018

It is absolutely an algorithm in the sense of "a set of rules to be followed". I think you mean that it doesn't guarantee an optimal solution. That just means it's a heuristic algorithm, same as simulated annealing is a heuristic algorithm for solving optimisation problems.

another-cuppa · on Oct 17, 2018

Nope. An algorithm has to be effective. You can find pathological cases for k-means such that it will never converge on anything useful. So if you set your termination case to be convergence it will never terminate and if you don't then it will never be effective.

zaphar · on Oct 17, 2018

I think you might be in the minority in this opinion. Many algorithms have pathological cases but are still considered algorithms

another-cuppa · on Oct 17, 2018

Minority? This is directly from Knuth.

bstamour · on Oct 17, 2018

Knuth defines effectiveness as: "... all of the operations to be performed in the algorithm must be sufficiently basic that they can in principle be done exactly and in a finite length of time by a man using paper and pencil."

K-means and other heuristic algorithms fit that description.

skykooler · on Oct 17, 2018

BogoSort is an algorithm. Not a very good algorithm, but an algorithm nevertheless.

another-cuppa · on Oct 17, 2018

No it absolutely is not.

The lack of fundamental computer science knowledge in this thread is alarming.

another-cuppa · on Oct 17, 2018

Negative 2 points on a post saying that a computational method that possibly never terminates is not an algorithm... Oh dear...

wnoise · on Oct 17, 2018

It never terminates with probability 0.

n4r9 · on Oct 17, 2018

As far as I can tell you're only arguing against poor implementations of K-means. If you demand that the score strictly improves at each iteration then the algorithm must terminate.

another-cuppa · on Oct 17, 2018

And how do you "demand that the score strictly improves"? It's an NP-hard problem.

n4r9 · on Oct 17, 2018

K-means implementations generally terminate once there's an iteration where the score doesn't improve. This happens when there is convergence to a local minimum or - less likely - the state hops between two nearby local minima with the same score. But it will terminate on something, and most of the time that something will be pretty good.

I saw your mention of Knuth elsewhere, I looked it up and he demanded that

> An algorithm must always terminate after a finite number of steps ... a very finite number, a reasonable number

This is a pretty niche characterization and almost certainly not what the original post was asking for. However, I concur that there is no guarantee on how quickly K-means terminates or on how good the output will be,. But... if you're going to be that strict about it you would even have to rule out the Simplex Algorithm, which everyone I've ever spoken to thinks of as an algorithm.

billfruit · on Oct 17, 2018

In that sense kmeans may be better referred to as a 'computational method' rather than an algorithm.

another-cuppa · on Oct 17, 2018

Indeed.

wnkrshm · on Oct 17, 2018

Isn't a method that gives an approximate or best-fit estimate to a problem still an algorithm, if it terminates?

another-cuppa · on Oct 17, 2018

No. You can't prove that k-means does anything useful.

alanbernstein · on Oct 17, 2018

Is the definition of "algorithm" that you're using here useful?

another-cuppa · on Oct 17, 2018

It's one of the most fundamental concepts in computer science and underpins decades of research. You can decide if it's useful.

mindcrime · on Oct 17, 2018

This isn't a classroom, and your pedantry isn't adding anything useful to the conversation. We all understand these pedantic quibbles you're arguing about... and what the community is more or less collectively saying is "in this context, we don't care about the distinction between an 'algorithm' in the textbook sense, and a 'heuristic' in the textbook sense".

another-cuppa · on Oct 17, 2018

Nah. Most of them don't understand the difference. If you did you wouldn't can it pedantry.

I personally don't find heuristics beautiful. That's why I commented.

n4r9 · on Oct 18, 2018

To be fair, you haven't explained at all clearly why you don't think k-means adheres to Knuth's notion of an algorithm.

Your objection seems to be

> You can find pathological cases for k-means such that it will never converge on anything useful

As has been pointed out more than once, a good implementation of k-means is guaranteed to terminate in a finite time. And whatever you mean by "useful" doesn't seem to appear in Knuth's definition of an algorithm.

another-cuppa · on Oct 15, 2018

Really? HN is probably the most feminised tech forum that's ever existed.

cimmanom · on Oct 15, 2018

Indeed it is. And yet I wouldn't describe it as remotely feminized, only slightly less hostile to women than, say, Slashdot.

IMO, that says more about tech forums and the tech community than it does about HN.

another-cuppa · on Oct 15, 2018

What actually would HN be like if it were perfect in your eyes? What would tech be like?

matt4077 · on Oct 15, 2018

decent.

another-cuppa · on Oct 16, 2018

Right. You have absolutely no idea.

another-cuppa · on Oct 15, 2018

It's only OK to parse something with regex if it was defined with regex. Far too often I see people wanting to match postcodes and things which are not defined by regex and your heuristic could break at any time.

zamadatix · on Oct 15, 2018

What about postcodes makes heuristics break for regex but not a general parser? I'd assume if the format changed it'd break both all the same.

another-cuppa · on Oct 15, 2018

Did you mean to start the video half way through?

another-cuppa · on Oct 14, 2018

Why does it keep mentioning .so? Nobody refers to it like that. Just say dynamically link.

another-cuppa · on Oct 11, 2018

> Why would it be public domain?

Copyright is supposed to protect creative works. There is such a thing as a threshold of originality. There are database rights, but that is separate from copyright.

another-cuppa · on Oct 11, 2018

Some plants need a lot of water. Basil does, for example. But succulents in particular don't want much.

another-cuppa · on Oct 11, 2018

Succulents are easiest. Things like peace lillies are also classic. Anything you could get from a supermarket is probably fine.