More

pdubs · on May 9, 2014

Sony MDR-V6/MDR-7506 http://thewirecutter.com/reviews/the-best-150-over-ear-headp...

epaladin · on May 9, 2014

Agreed. The MDR-7506 is a "studio standard" for reference/monitor headphones in pro audio and video. The frequency response is not "enhanced" at the low end like the Beats and many other modern headphones. I don't think the curve is really flat- they always sound a little bright to me (thought in that hear-more-detail sort of naturally pleasant way), but they're definitely more flat than the Beats. The 7506 is like $90. The V6 has the same drivers, but doesn't have a gold plated plug (seriously). I got my V6s refurbed for $50. The closed earpads passively block external sound pretty well, and they're relatively comfortable. They don't come in green or pink, but they sound good, are still made with some metal parts for durability, and they're way cheaper than Beats. Source: work in a couple different TV production studios, a friend that's in the audio/acoustics industry, and a lot of reviews at B & H.

ArkyBeagle · on May 10, 2014

Those are great for tracking and/or studio-critical ( read: intense, "What was that??") listening but they're pretty shrill for general listening.

I find them fatiguing.

I cannot get past the Koss KTXPRO-1. They're very inexpensive, but I can mix on them, they're comfortable and priced very well. There's no fatigue with them.

I really do throw a rough up first on the 'phones because it seems to get me to a decent first mix fast.

I need to buy a nice pair of Grado and/or Stax just to try to wean myself off these things. :) Don't mean to sound fanboiish, but I've used 'em now for going on fifteen years. Headphones are like that.

pdubs · on April 23, 2014

Too bad there's no first-party app for hardware-accelerated 1080p/DD5.1 streaming on Windows like there is for Netflix and Hulu...

pdubs · on March 22, 2014

Some of the previous work on this involving timing attacks against SSH [1] is particularly interesting because it's so obvious in retrospect, but no one saw it when SSH was being designed.

[1]http://www.cs.berkeley.edu/~daw/papers/ssh-use01.pdf

EGreg · on March 22, 2014

About the nested ssh attack - I don't get it, how come the ssh client on B waits until return is hit to send the password but the client on A doesnt?

pdubs · on March 19, 2014

Never more relevant... http://xkcd.com/435/

splintercell · on March 19, 2014

My friends made an improvisation to the comic:

http://i.imgur.com/etuhc8z.jpg

atmosx · on March 19, 2014

That's genius, I have a huge respect for 'xkcd' really. Some presentations are beyond amazing. That said I think it's spot on, Mathematics is the language of the Universe.

MaysonL · on March 19, 2014

He forgot category theorists...

pdubs · on March 18, 2014

If you look at the difference in surface area between traditional men's/women's models of the same watch [1] there's a pretty massive difference in what you'd be able to do with the space. However, you do see more women wearing men's watches now, so I could see it catching on.

I think they will need to hit a pretty big number for battery life. A "40-hour power reserve" (what most mechanical automatics have) would be a big step towards being taken seriously as a watch instead of a gadget.

[1] http://www.omegawatches.com/collection/seamaster/aqua-terra-...

pdubs · on Feb 24, 2014

>(For a humorous take on nation-state threat models, read the hilarious usenix article This World of Ours by James Mickens: http://research.microsoft.com/en-us/people/mickens/thisworld...)

"Security research is the continual process of discovering your spaceship is a deathtrap" has to be one of the most apt descriptions of security research I've ever heard. What a great read!

pdubs · on Feb 20, 2014

Regarding #5: >That caught me by surprise. Both options “roughly the same” and “depends on the data” got about 25% — the guessing probably.

I don't think it was guessing so much as reasoning that fetching 100 rows (and filtering by value) instead of 10 rows doesn't have significant real-world impact unless the row data is particularly large. I'll admit I didn't think of the out-of-index lookup, but my main thought was 100/1000000 vs 10/1000000 isn't a big deal unless the DB is running on an abacus.

pradocchia · on Feb 20, 2014

I also answered "about the same", and no I didn't notice the bookmark lookup (so-called on MSSQL), but even if I had I'm not sure I would have changed my answer--well maybe I would have because I would have noticed the "trick", but putting my test-taking adaptations aside....

It is already a highly selective query. Adding a 100 bookmark lookups will not cause a material change in performance, unless this query is being executed 1000s of times per second, in which case maybe your real problem lies elsewhere.

jcampbell1 · on Feb 20, 2014

> It is already a highly selective query.

Is it? The first query is always O(1). The worst case for the latter query is that it must aggregate over 999,910 rows.

Consider the case where all values of 'a' are 123, and all values 'b' are 42, except 90.

pradocchia · on Feb 20, 2014

> Is it?

At least on MSSQL, I would expect a query plan like so:

  1. Index seek WHERE a = 123, yielding ~100 rows. [1]
  2. Bookmark lookup with results from (1), yielding ~100 rows.
  3. Filter (2) WHERE b = 42 and project date_column, yielding ~10 rows.
  4. Aggregate (3) by date_column, yielding ~10 rows or less.

And the optimizer will choose this over a full table scan so long as the 1+2+3 < full table scan. I don't know the threshold for that, but is is certainly more than 100 rows out 1M+, and the planner will have an estimate of selectivity that will inform plan selection.

[1] Important caveat. I interpret this line,

Current situation, selecting about hundred rows out of a million:

....to mean selection of 100 rows from the base table, rather than projection of 100 rows in the result, post-aggregation.

But if he really means a SELECT statement that returns 100 rows, then we have no idea how selective the WHERE clause is, and my answer changes to "The query will be much slower (impact >10%)".

jcampbell1 · on Feb 20, 2014

I took the latter interpretation. The "correct" solution uses the fact the first query can be solved by selecting 0 rows from the base table.

pdubs · on Feb 20, 2014

No query optimizer would look at this and say "1M rows? Let's group and aggregate before filtering." Not to mention, the question specifically states that a=? would return 100 rows and a=? and b=? would return 10.

Regarding O(1), the first query would be some form of O(n log n) or O(log n) depending on the table/index data structures.

thwarted · on Feb 21, 2014

No query optimizer would look at this and say "1M rows? Let's group and aggregate before filtering."

I hope no optimizer would say that. It is well defined that filtering, as expressed in the where clause (if it uses indexes or not) happens before group and aggregate, and that grouping happens on the result of the filtering. If the optimizer could choose one way or the other, you'd have different results. If you want to group and aggregate first, you need to explicitly express that with a subquery.

jcampbell1 · on Feb 20, 2014

I don't see that a=? rule. By selecting "100" rows out of a million, I think that means 100 distinct dates.

In this case, I take n = 1,000,000

The first query is likely O(log(x)) where x is number of distinct values of a. I approximated that to be O(1) relative to n.

I could be wrong here, or we could have seen different questions, or we are just interpreting the question differently.

nollidge · on Feb 20, 2014

We're not talking about worst case. We're talking about 100 rows and 10 rows, which is what @pradocchia means by "selective".

Indexing shouldn't follow the theoretical worst case, it should follow what's actually in your table.

ars · on Feb 21, 2014

> The worst case for the latter query is that it must aggregate over 999,910 rows.

No, it already said it only took 100 rows, it can't get worse from that.

Now if he actually meant the final result set was 100 rows (meaning after the group by) that's different. But that's not what he actually said, so the question is misleading.

Guvante · on Feb 20, 2014

My thought was akin to what he said, the system needs to grab all 100 rows to do the second filter.

I didn't think about the fact that the original one was an index-only scan, so went with "Roughly the same", since outside of that property the performance is similar.

Too bad there is no way to get good data on why people thought that way without a much longer quiz.

trhway · on Feb 20, 2014

it has significant impact because first case is "take 100 rows from index" and the second is "take 100 rows from index _and_ for each row go to the row in the table - do the random IO and with 1M rows it is probably 1 IO/row - and check for the value of 'b'"

Such 100 random IOs will cost 0.5 sec on 1 iron platter HDD for example. So the query performance will degrade significantly until either the table is already preloaded into memory or you use SSD drives.

batbomb · on Feb 20, 2014

> Such 100 random IOs will cost 0.5 sec on 1 iron platter HDD for example.

That's an incredibly, incredibly, iffy and mostly wrong statement which depends on arguably a corner case which doesn't often reflect reality (factors include which DBMS, row ordering, table size, cache size, block size, page size, RAM size, Hard Disk seek time, HDD throughput.

The only case where that's likely is performing a very cold query on a very large randomly distributed table once (and probably only once).

Even a table of 1 million rows with ~30B per row could easily be read into memory in about 300ms (100MB read time + ~5ms seek time, or ~= 5+(1e6*rowsize/ (100e3)) )

Query Optimizers do exactly this.

trhway · on Feb 20, 2014

>> Such 100 random IOs will cost 0.5 sec on 1 iron platter HDD for example.

>That's an incredibly, incredibly, iffy and mostly wrong statement which depends on arguably a corner case which doesn't often reflect reality (factors include which DBMS, row ordering, table size, cache size, block size, page size, RAM size, Hard Disk seek time, HDD throughput.

you're welcome to specify iron platter HDDs which would do 100 random IOs in significant different time than 0.5 sec.

>Even a table of 1 million rows with ~30B per row could easily be read into memory in about 300ms (100MB read time + ~5ms seek time, or ~= 5+(1e6*rowsize/ (100e3)) )

>Query Optimizers do exactly this.

Not always. If it has enough info, it may take a full table scan path in such a case. Still it will be about the same time - 300ms vs. 0.5 sec. i mentioned.

Of course it is for cold queries. You forgot to mention, i guess, that full table scan you propose may by-pass DB cache (default depends on DB, modern tendency is by-pass by default), and thus second query will take the same time, while table blocks brought in by random IO would frequently be stored in DB cache thus making second query run somewhat faster - depends on how much data was brought in.

brianberns · on Feb 20, 2014

I had the same thought. Seems like the optimizer could still perform an index-only scan to get to 100 rows, then go to the table to filter them down to 10 rows. Yes, the second step is extra, but should still be fast. What am I missing?

eterm · on Feb 20, 2014

That was my reasoning when I answered, but I had missed the fact it was a GROUP BY, which means you can't just filter after the fact.

Edit: In other words it was 100 or 10 aggregated rows. A extra WHERE clause will change the values of each of the rows rather than just filter the rows from 100 to 10. (Which a HAVING clause would do.)

buckbova · on Feb 20, 2014

It's much simpler than that. The first query only has to reference the index because the data is IN the index. The second query has to access the table. That's it.

It's called a covering index.

eterm · on Feb 20, 2014

But if it wasn't for the GROUP BY, filtering 100 results of a million down to 10 results wouldn't change performance much even if you read every column of every row of those 100.

The trick is the fact that the GROUP BY means that "It used to return 100 it now returns 10" is a red herring, it still has to read every row to make up those 10.

buckbova · on Feb 20, 2014

I don't understand what you mean "trick".

  SELECT date_column, count(*)
  FROM tbl
  WHERE a = @a
    AND b = @b
  GROUP BY date_column;

The "AND b = @b" causes the sql engine to access data in the table instead of solely relying on the index. GROUP BY has 0 to do with it. If you changed the query to

  SELECT a, date_column
  FROM tbl
  WHERE a = @a

and

  SELECT a, date_column
  FROM tbl
  WHERE a = @a
    AND b = @b

The answer would be the same.

eterm · on Feb 21, 2014

No, it wouldn't.

If we're told that:

SELECT a, date_column FROM tbl WHERE a = @a

Returns 100 rows.

Then:

SELECT a, date_column FROM tbl WHERE a = @a AND b = @b

Will only have to scan column b over 100 rows.

Even without an index that will always be neglible, not compared to using the index to grab 100 rows from 10million but just compared to running a query and returning results at all.

The reason that the original can be a lot slower is that the 100 and 10 rows of results are comprised of a lot more rows of actual information, because of the grouping.

You're right that:

SELECT a, date_column FROM tbl WHERE a = @a AND b = @b

would be a lot slower, given the same data, but that isn't the scenario, the group by has implications about what "returns 100 rows, returns 10 rows" actually means in terms of data read.

buckbova · on Feb 21, 2014

Query 1 is an index seek only. It does not access the table data.

Query 2 will perform the same index seek but will need to do a key lookup on each row and filter.

It's not negligible. The 100 results are not comprised of a lot more information in this case, regardless of the grouping, because the 1st query does not access the table.

Edit:

I happen to have a table laying around with a little over a million rows and set up a similar set of queries.

The query optimizer suggested the index seek taking 6% of total operation time while the key lookup taking up the other 94%. The rest was negligible.

Alex3917 · on Feb 21, 2014

While you're correct, GROUP BY apparently also does kill the indexing:

http://dev.mysql.com/doc/refman/5.0/en/group-by-optimization...

henrikschroder · on Feb 21, 2014

What are you talking about? According to that page, the example from the quiz would result in a tight index scan:

> The GROUP BY does not begin with the first part of the key, but there is a condition that provides a constant for that part:

> SELECT c1, c2, c3 FROM t1 WHERE c1 = 'a' GROUP BY c2, c3;

munimkazia · on Feb 21, 2014

Yep, that's exactly what I felt too. The only question I got wrong in fact.

pdubs · on Feb 13, 2014

Unless I'm misreading this decision it merely upholds the FCC's right to classify ISPs as "information services" instead of "telecommunication services" because the Telecommunications Act is a bit fuzzy and leaves a lot up to the FCC. The FCC could decide to reclassify them as telecommunications services due to industry changes and I believe that this case would serve as precedent for them to legally do so.

pdubs · on Feb 6, 2014

Generally AMEX is known for better customer service, but YMMV. AMEX is often seen as a good choice for people looking to build credit because they're often willing to give out a 3x credit limit increase (without a hard credit pull) first 90 days after getting a card, and then every 6 months after that (up to a point and assuming you're using the account and in good standing).

pdubs · on Feb 5, 2014

The VAIO Z used to fill those requirements. Looks like the VAIO Pro replacement for the Z now has all ULV processors. :(