Hacker Newsnew | past | comments | ask | show | jobs | submit | jlukecarlson's commentslogin

I appreciate the details shared in this paper but it'd be great if they open sourced their implementation!



ml-mdm from Apple Machine Learning Research

I maintain ml-mdm, a text-to-image diffusion codebase, that contains implementations of my research group’s recent papers https://github.com/apple/ml-mdm

Any help with DOCS or CODE would be greatly appreciated. I’m happy to have BEGINNER-FRIENDLY contributions as well, there’s a “good first issues” label that we use on GitHub.

We have an active and growing group of external contributors and welcome newcomers.

Could be a fun opportunity to contribute to a machine learning research project and to learn about cutting edge techniques in this subfield.


I'm one of the contributors to this repo, so feel free to comment with any feedback!

The key research contribution from the related paper is that with a moderate amount of data (eg. 12M image pairs from CC12M) and a moderate amount of compute (single node of 8 A-100 GPUs for example) anyone can train a good text to image model using the unique multi scale nested u-net pipeline.

Hope this can help level the playing field for researchers everywhere.


To the comments about this being easy to defeat: when it comes to detecting whether a person submitted a document containing LLM generated text (whether a law document, school essay, work document etc) the real value in a technique like this is high precision, not necessarily high recall.

Yes many people can circumvent this simple watermark technique but for those who don't, it is essentially guaranteed that they used a LLM if their text has clearly atypical unicode marks (Whether U+2004, ligatures, or variant selectors). Thus an organization can feel confident in taking action against the individual who submitted the document.

Whereas right now there are a bunch of dubious "LLM detector" models that output a confidence score that may or may not correspond to whether the person used an LLM. This low precision technique leads to people getting incorrectly accused of using LLM content.

So in my opinion, a world of high precision (but potentially low recall) LLM watermarks using simple techniques is way better than this current high-noise low precision black box world of low quality "LLM detector models"


What I don't get is who will apply the watermark? Certainly not the company running the LLM (why would they degrade the output of a commercial product?). The people submitting text generated by the LLM have no interest to do that. The only people who have interest in this watermark are the recipient but they aren't involved in the production.


>why would they degrade the output of a commercial product?

For PR? It's not a degradation for legitimate uses of AI. It only degrades output being used in an attempt to mislead people. Someone using an LLM to e.g. translate would usually be fine admitting they used it. I'm working under the assumption this isn't intended for something like a code model where it would break things, but only for output being used as readable text.


A bunch of companies went to White House and promised they'd watermark their stuff.


I’d start by either stepping through the program (gdb, pdb, or a debugger of your choice) or adding more test cases - that’s often a quick way to understand what a piece of code is doing.

Either way I wouldn’t leave it as is, that’s just a recipe for ensuring it becomes even more opaque over time


It may just be a fake scenario but the actual solution there would be to implement the 1 hour fix without a single discussion or email.

Sometimes the best way to tailor words for both engineers and managers is to 1) solve the problem then 2) if the solution was actually difficult, explain how you solved it in simple terms.


Seems like a completely failed program to me. Any organization should be able to quantitatively prove their value on some level. As the article points out, for the DHS to say they do “not have information on [the air marshal program’s] effectiveness” while also averaging a cost of $200 million per arrest is an insane statistic that should immediately require changes in the program.


I haven't checked recently to see if this stat is still true, but at one point the number of people arrested by air marshals was actually lower than the number of air marshals who have been arrested.


The overwhelming majority of the time, "arrests" on airplanes are made by regular passengers and the flight attendants, who beat up the offender then hogtie him with duct tape. Air marshals are completely pointless.


Air marshals may be pointless, but not for this reason.

Air marshals are almost certainly trained not to intervene in minor disturbances. Otherwise a team of hijackers could locate the air marshal by having one person act as an unruly passenger.


If the movies are real they flash their gun and badge to the flight attendant and give them a wink.


And then get pounced on by Melissa McCarthy.


> Otherwise a team of hijackers could locate the air marshal by having one person act as an unruly passenger.

For the last twenty years, hijackers (in flights originating from US) are probably always presumed to be suicidal and much less likely to succeed in a hijacking. To the point where I'd assume that they're rarely even attempted.

Perhaps we could end up needing air marshals in the future, a future where many passengers are not old enough to recall or be affected by the terrorist attacks on September 11.


At what point is an air marshal supposed to interfere?


(Presumably) only when the fate of the entire aircraft, or a substantial portion of the passengers, is in jeopardy.


And the probability of them being on the "right" flight to come to the rescue out of thousands would be..?


Conversely, as a hijacker, you'd be taking a huge gamble to try and select the one flight where there were no air marshals.

A lot of times, I feel like this whole concept was based on an inherit deterrent and seems to have worked, in conjunction with increased airport screening:

"The number of hijackings has dwindled in recent years. About 50 have been reported since Sept, 11, 2001, and none in the U.S., according to the Aviation Safety Network."


Reinforced cabin doors and knowing that the terrorists want to kill everyone instead of taking hostages has changed the game. I'm not privy to what influences terrorists but it seems like they can no longer commandeer airplanes


If you're going to be hijacking to use the plane as a weapon you aren't going to be worried about the 1 in 10 chance of an air marshall trying to stop you, any more than the near certain attempt of the other 200 people on the plane trying to stop you. There just aren't legions of people out there willing to kill massive amounts of people in this fashion.

The number of bear attacks in Springfield went to zero in 1996 when Homer started Bear Patrol. It's a case of life imitating art.


Are you sure it isnt just the locks on the cabin doors that are reducing highjackings?


Before 9/11 you hijacked a plane, flew to Cuba, got some money, and all was well.

Before 9/11 you were a passenger/crewmember on a plane, you flew to Cuba, then got released, and all was well.

9/11 changed that. No longer were passengers and crew safe as long as they complied, no longer were planes allowed to fly around and even be refuelled safe in the knowledge that all it would take is some money to free them.

As such over the course of two hours the calculus changed. Hijacking the first 3 planes worked because nobody expected it. The fourth plane was also hijacked, but the goal (crashing into whatever building) wasn't met because the passengers and crew found out the rules had changed.

If you hvae 5 people willing to kill themselves for your cause, there are better ways to make a political statement now than the risk of hijacking a plane - not because of any potential air marshall, but because of the guarantee of the passengers.


Oh no, the lock on the cockpit has failed! Thankfully an air marshal was able to stop the hikacking by flashing his badge.


>"Air marshals are almost certainly trained not to intervene in minor disturbances."

It is either are or are not. Almost certainly sounds like you have no clue.


Like ever? Or while acting as an Air Marshal?


Yeah, that assertion is very open to interpretation and only raises questions than answer anything.


If you heard that more police officers had been arrested than had ever arrested anyone, would it really matter much if those police officers had been arrested as teenagers, or on the job? It's a program that employs people who have caused more incidents of crime than they have intervened to prevent. Not because they're particularly criminal, but because they are particularly useless.


> It's a program that employs people who have caused more incidents of crime than they have intervened to prevent.

Whether that's problematic depends on a lot of things, including whether arresting people is actually the main benefit of them being present.

I have a few dogs. They bark at people walking by on the sidewalk a lot. I've never had someone break into my house. Are the dogs useless as a mechanism to prevent home invasion or burglary, or does knowledge of their presence prevent people from even attempting such? Now consider that I haven't noted whether those crimes are common in my area or not.

Maybe those dogs have growled at guests in my house, making them feel uncomfortable. Should my stance be that they've growled at more guests than intruders they've attacked? Do you feel comfortable making a definitive statement on the value of those dogs and whether the costs outweigh the benefits with the given information, or do you think additional information would be important to discerning that?


At the point at which your dogs have broken into and robbed your neighbor's house more often than they have prevented your house from being broken into; that's the point at which you can compare your dogs to the Air Marshal service.


> more often than they have prevented your house from being broken into

The whole point is that you can't tell whether that's happened with the data presented, regardless of what you compare it to, and no, once is not too many times given the actual thing we're discussing.

If the existence of Air Marshals prevented one more 9/11 type event from happening, or even a few failed attempts (which would be capitalized on by politicians to push their own agendas), then I would without reservation say they are worth it. We just haven't been presented with that data. In some cases it may not exist. Acting like the answer to that is irrelevant is not the correct way to go about it though, in my opinion.


It would matter immensely imo


Context is important, lest you prime your reader to make unsupported conclusions. Imagine if instead of doing this to police officers, we insinuated things about arrestees instead.


> Context is important,

No, context can be important. If all context were important, you couldn't talk about one thing without talking about every other thing. If your argument about why context is important in this case rests on if a hypothetical fact that "people arrested by police have collectively prevented more crimes than they have committed" would be uninteresting or even unfair without context, I'd deeply disagree.


The latter


> Any organization should be able to quantitatively prove their value on some level.

There is danger there with corrupt and immoral organizations. They might just start generating false incidents and then run to save the day to pad their stats "look how effective we are!".

Luckily, that only happens with deeply corrupt organizations. Like for instance FBI [1], when they were sending their informants to mosques in US looking to recruit terrorists. Up until the the members of the mosque ended up reporting the FBI agents to the FBI.

--- Niazi and another mosque member had reported Monteilh to the FBI, claiming that Monteilh was espousing terrorist rhetoric and trying to draw them into a plot to blow up shopping malls ---

[1] https://www.ocweekly.com/news-the-fbi-the-islamic-center-of-...


> Like for instance FBI [1], when they were sending their informants to mosques in US looking to recruit terrorists. Up until the the members of the mosque ended up reporting the FBI agents to the FBI.

I am not saying what FBI does is good, but this reminds me now that the only way to deal with constant threat of phishing is actually to create false fishing attacks all the time, so that people would be aware of the thread. Could be generally a good approach if the methods employed were more public rather than clandestine.


> averaging a cost of $200 million per arrest

You could play that game with anything. How much does our military cost?

Or maybe, what was the total cost of one 9/11 airplane running into a target?

And there is the hidden properties of deterrence


I'm sure this has something to do with that entire re-organization feeling more like an excuse to create some contracts for companies connected to administration insiders than it did anything actually useful that couldn't have been accomplished without the creation of a huge new department (and several smaller ones).


Imagine being the politician or person who cancels the program, only to have a plane hijacked or whatever.

Spending a few billion of taxpayer money is fine if the cost is your political career or a hit to your political party.


> averaging a cost of $200 million per arrest is an insane statistic

That depends on the deterrence of having air marshals.

The argument sounds like one that gets brought up here regularly, when executives lay off 90% of the sysadmins because "nothing ever goes wrong, so why are we paying for them?", and teams that are constantly running round looking like heroes for fixing broken stuff all the time get more kudos than the teams that keep things quietly humming along without any issues.


This was my thought as well, but I still think we need evidence of this. If air marshals are primarily there to deal with terrorist threats (and not things like unruly/drunk/whatever passengers), then my feeling is that two things post-9/11 are responsible for the low risk of terrorist acts involving planes (or even just run-of-the-mill ransom-type hijackings):

1. Would-be hijackers can no longer get into cockpits. Pilots would much rather a hijacker kill every passenger and crew member on the plane than gain control of the plane.

2. Passengers don't take shit anymore. They know that, if terrorists successfully take control of a plane, the most likely outcome is that they're all dead. So they'll attack -- and hopefully subdue -- the hijackers.

I expect the effects of #2 have lessened somewhat, given that 9/11 was over 20 years ago, and the memory of it is less raw (not to mention many adults who fly now were young children or not even born in 2001).

If air marshals really do act as a deterrence, there must be some evidence to back that up.


(Nitpicking follows. I agree with your points.)

I don't think #2 actually goes away. It's a cultural shift. Prior to 9/11, the advice was always to just stay calm and let the authorities negotiate with the terrorist, because that had in the past generally led to passengers and plane surviving the incident. That level of passivity isn't coming back.


If air marshals could get rid of security checks, it would be worth the cost


Do other countries have anything like this program?


Israel.[1] They had one big success in 1970.

[1] https://www.securitysolutionsmedia.com/2018/04/20/air-marsha...


The generated "using it in a sentence" examples are often the funniest (and most absurd) part:

  chokebone
  choke·bone
  the muscle holding a prisoner down, used in controlling an animal or person such as a cat, dog, snake, snakebite, or spider
  "he managed to regain the chokebone and win the auction for his 12-year-old daughter"


I feel like it's been learning from urban dictionary https://l.thisworddoesnotexist.com/BpmB

coffeebomb cof·fee·bomb

    a bomb which, in addition to the usual stimulants such as caffeine, turns out to be deadly if inhaled by a person having sexual intercourse
    "a young man survived a blast of coffeebomb, killing two men and wounding five"


John Carreyrou, the WSJ reporter who first broke much of the Theranos story, just came out with a podcast called Bad Blood the Final Chapter that’ll follow the court case and other recent developments


Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: