I looked into the issue tracker and gosh, its getting so toxic there. https://gi...

hiatus · on May 21, 2024

With gems like this:

> +1 to the people who'd like to donate 50$ for a version which does not send my input anywhere. Okay, previously I never donated the project. But use the iTerm2 for ~10yrs and would like to continue.

So, this person has been using iTerm2 for 10 years without paying, and would only consider donating if this feature is removed?

tedunangst · on May 21, 2024

Interesting incentive system being created here. You're going to give me $50 to remove an AI feature if I add one to another project?

adamomada · on May 21, 2024

Programmers discovering racketeering

kstrauser · on May 21, 2024

Nice little shell you've got there. cracks knuckles

dmix · on May 21, 2024

> a version which does not send my input anywhere

clearly hyperbole

Even when it's turn on you have to manually engage it

Rage posters always gloss over details in their rush to tell the world how mad they are

eriri · on May 22, 2024

Even more dramatic:

> it is a very unwelcome statement of disagreeable values. Adding OpenAI integration, as optional as it is here, makes it clear that you don't stand against OpenAI and the whole "AI" industry.

Imagine the crime of maintaining a free and open source terminal emulator in one's spare time... how hideous.

phillipcarter · on May 21, 2024

Hah! Just another turn of the wheel, this time with AI. Lots of entitled developers out there who sure do have a lot of time on their hands to complain.

mrozbarry · on May 21, 2024

I think this issue is actually two issues

    1. A terminal shouldn't be able to ask some resource on the internet what to type and auto-execute it.
    2. AI fear/fatigue/???

I think point 1 is reasonable to an extent, but it should be taken in context. iTerm2 is a free app, and as far as I can tell, not even remotely required on any mac platform, since there is technically a default dumb terminal, which can be customized. I think the context issue is from the video demos I've seen, nothing directly types into your terminal, it's up to the user to review/copy/paste the generated code snippet. The underlying tech has been in iTerm for a while, from the best I can see. Auto-fill also enables things like the 1password integration, and anyone can open a chatgpt client and copy/paste shell code from there in the same way the iTerm2 integration works.

I understand point 2, I have never cared for any AI hype, it has near-zero interest for me, and doesn't affect my work. Almost every editor has some capacity to ask the internet for data and paste it in, from AI or otherwise, and no one is really sounding a major alarm bell around that. You could argue there is a big push for these integrations to train models, but even that requires a key.

lxgr · on May 21, 2024

> 1. A terminal shouldn't be able to ask some resource on the internet what to type and auto-execute it.

Every Linux shell can do that, regardless of your terminal emulator, and arguably that's by design:

    curl https://givemesomecoolshellcommands.com | sh

What iTerm can do is essentially just some GUI sugar around that capability.

If you don't like it, just don't do it :)

bloopernova · on May 21, 2024

From reading that issue, it sounds like some people are worried about compliance with security policies (whether personal or corporate)

I'm very happy with iTerm2, its features are useful to me, but I can't see myself using the AI chat feature when I have copilot in VS code. I could see a use case for people unfamiliar with certain commands, something like "please sort this output by the first then fourth columns". But if I'm writing a script or small Python utility, then VS code will be where I do it.

For compliance though, the AI integration could be a separate binary that you can access via the command line, although as pointed out in a reply, that's the same as a code path that isn't used. However, it is easier to block a separate binary, so maybe that's the thinking there?

Instead, maybe the people who have an issue with this feature would be happy with an optional setting "assign this keyboard shortcut to the AI binary". Or a feature flag that says "do not access the network under any circumstances".

nulld3v · on May 21, 2024

I get the compliance perspective but it feels stupid to bring it up now, especially since iTerm2 has already had integrated network features for a long time.

Agreed on the "do not access the network" feature flag though, every program should have that. Or really it should just be a toggle in the OS on a per-app basis.

derefr · on May 21, 2024

I dunno about "do not access the network" — sounds like the wrong granularity. I want an app like e.g. Evernote or Calendly, to sync to its own cloud backend (or better, to my configured server.) I just don't want them sending my data off anywhere else.

Annoyingly though, in that scenario, the desire to not have my data processed by third-party vendor APIs, would need to apply to both the client (which I can control through technical measures, e.g. LittleSnitch) and to the cloud backend it talks to (which I fundamentally cannot control.) So such a config flag can't be purely a technical measure, but also has to be something communicated to the backend, ala "Do Not Track." And unlike HTTP, most of the other application-layer protocols we use today don't have anything like a standardized way to communicate "user-imposed constraints on how they want you to process their request, while still giving the same result".

paulmd · on May 21, 2024

> And unlike HTTP, most of the other application-layer protocols we use today don't have anything like a standardized way to communicate "user-imposed constraints on how they want you to process their request, while still giving the same result".

This is the wrong approach and what you really want is for LLMs to instead have access to a palette of pre-vetted bugtested commands and options implemented by other applications.

ie think like those python embeds in OpenAI… but instead of building a python script, you should be building an Ansible playbook or a MacOS shortcut that does the task, rather than an LLM banging together shell code directly.

Things like file access or web request etc are just more primitives in this model. Don’t want it to call out to the web? Don’t give it access to the web request primitives. Like this literally has been a solved problem for 30 years - macOS intents and windows COM interfaces allow applications to expose these capabilities in a way that can be programmatically interfaced by other code to build up scripts.

https://developer.apple.com/documentation/appintents

https://learn.microsoft.com/en-us/windows/win32/com/the-comp...

This is HyperCard-era stuff, Unix just won so thoroughly that people don’t consider these capabilities, and everyone assumes everything has to be a single giant unshaped command of CLI garbage piped together.

The Unix mindset is not the right one for LLMs working at a token level. The idiom you need to mimic is MacOS intents or Ansible actions… or at least powershell actions. The amount of unconstrained shell-scripting involved needs to be minimized rigorously. Every time you do it it’s a risk, so why make it write anything more complex than glue code or composable YAML commands?

derefr · on May 22, 2024

You're misunderstanding what I was trying to describe.

I wasn't talking about the direct use-case at hand (iTerm), because iTerm communicates directly with ML APIs. But such a "request processing constraint" would not be for use in communicating with ML APIs.

If the client application is directly communicating with an ML API, then the technical measure — a group-policy flag that tells the app to just not do anything that would communicate with those APIs; or further, which alters the app's sandbox to forcibly disable access to any/all known ML API servers on the OS-network-stack level — is enough to prevent that.

Rather, what I was referring to by a "request processing constraint", is for the other case — the case that can't be addressed through purely technical measures. And that's the case where a client application is making a request or syncing its data to a regular, non-ML business-layer server; where that backend then might decide to make requests to third-party ML APIs using the client's submitted data.

Think: the specific REST or gRPC API that most proprietary "service client" apps are paired with; or an arbitrary SMTP server for your email client; or the hard-coded AMQP broker, for an IoT device that relies on async data push to a remote message-queue; or the arbitrary S3-compatible object-store endpoint for your `rclone` or `pgbackrest` invocation.

It is for these systems, that you'd need a way for the client to tell the backend system it is communicating with, that the client is operating under a regime where third-party ML is not to be used.

The spirit of such a "request processing constraint" being communicated to such a backend system, would be: "don't allow any of the data being carried by this request, once it arrives at your [regular non-ML business-layer] backend system, to then be sent to any third-party ML APIs for further processing. Not in the process of resolving the request, and not later on for any kind of asynchronous benefit, either. Skip all those code-paths; and tag this data as tainted when storing it, so all such code-paths will be skipped for this data in the future. Just do what processing you can do, locally to your own backend, and give me the results of that."

(Ideally, there would also be some level of legal regulation, requiring vendors to at least put their "best effort attempt" into creating and maintaining a fallback backend-local [if not client-side!] implementation, for people who opt out of third-party ML processing — rather than just saying "the best we can do is nothing; your data goes unprocessed, and so the app just does nothing at all for you, sorry.")

---

You're also misunderstanding the misapprehensions that the people in the comments section of this post are having to data being passed to ML models.

Most of the people here who are having a visceral reaction to iTerm including ML capabilities, aren't thinking through the security concerns of this specific case of iTerm generating a shell script. I do understand that that's what you were trying to address here by talking about high-level capability-based APIs; but it wasn't what I was trying to suggest fixing, because "iTerm could generate a dangerous shell script" isn't what anyone was really complaining about here.

Rather, what people here were doing, was echoing a cached-thought response they had already long decided on, to the use or capture of their data by third-party ML-model service APIs generally.

The things that make people reject the idea of applications integrating with third-party ML APIs generally, are:

1. that the model would "do its job", but produce biased or censored results, creating unexpected "memory holes" in the resulting data (for recognition / transformer models), or tainting their processed data with arbitrary Intellectual Property captured from others (for generative models); or

2. that the ML company being sent your data would have Terms of Service (that the vendor of the product or service you're using agreed to, but which you never did) that any data submitted to them in a request can be used for model-training purposes. (And where, legally, due to the Terms of Service you consented to with the vendor, whatever minimal right they have over "your data" makes it "their data" just enough for them to be able to consent to the use that data with third parties, without any additional consent on your part.)

Regarding problem 1 — imagine, as I said above, the Evernote use-case. You add a bunch of notes and web clips to an Evernote notebook through their client app. This app syncs with their cloud backend; and then this cloud backend sends each note and clip in turn through several phases of third-party ML-assisted processing during the indexing process — "AI OCR", "AI fulltext keyword extraction", etc. But maybe the models don't know some words they weren't trained on; or worse yet, maybe the creators of those models intentionally trained the AI to not emit those words, but didn't document this anywhere for the vendor to see. Now some of your documents are un-searchable — lost in the depths of your notebook. And you don't even realize it's happening, because the ML-assisted indexing isn't "flaky", it's rock-solid at 100% recognition for all your other documents... it just entirely ignores the documents with certain specific words in them.

Regarding problem 2 — think about ML built into a professional creative tool. A tool like Adobe Illustrator (where it's probably called client-side) — or even Adobe Bridge (where it might just be happening in their "creative cloud", analyzing your Bridge library to suggest stock image assets that might "go well with" your projects.) What if your own Illustrator illustrations, sitting in Bridge, are being fed by Adobe to some third-party image2text model... owned by a company that also does generative image models? In other words, what if your own Illustrator illustrations — done as a work-for-hire for a company that is paying you to create them a unique and cutting-edge brand appearance — are being used to train a generative image model to ape that novel style you created for this client; and that version of the model trained on your images gets published for use; and people end up discovering and using your new style through it, putting out works on the cheap that look just like the thing you were going to charge your client for, before you even present it to them?

A "third-party ML processing opt-out" request-constraint, would be something intended to circumvent these two problems, by disallowing forwarding of "your" data to third-party ML APIs for processing altogether.

This constraint would disallow vendors from relying on third-party models that come with opaque problems of bias or censorship that the vendors were never informed of and so can't work around. (Such vendors, to enable their backend to still function with this constraint in play, would likely either fall back to non-ML heuristics; or to a "worse" but "more transparent" first-party-trained ML model — one trained specifically on and for the dataset of their own vertical, and so not suffering from unknown, generalized biases, but rather domain-specific or training-set specific biases that can be found and corrected and/or hacked around in response to customer complaints.)

And by disallowing vendors from submitting data to third-party model companies at all, this constraint would of course prevent anyone (other than the vendor themselves) from training a model on the submitted data. Which would in turn mean that any applicable "data use" by the vendor, could be completely encapsulated by contract (i.e. by the Terms of Use of the software and accompanying service) that the client can see and consent to; rather than being covered in part by agreements the vendor makes with third parties for use of "the vendor's" data that currently just happens to contain — but isn't legally considered to be — "the user's" data.

20after4 · on May 21, 2024

Given that the "do not track" header is almost universally disregarded, I'm not sure what value we'd gain from implementing more instances of disregarded user preferences.

9dev · on May 21, 2024

Technical measures are the wrong lever to this problem. I can always send your precious data to my backend and proxy it to whatever third party vendor from there, and there’s nothing you can do to prevent that.

Instead, a legal solution like the GDPR offers better means of protection. The way the fines are structured, vendors have a clear incentive to not exfiltrate your data in the first place.

derefr · on May 22, 2024

> Instead, a legal solution like the GDPR offers better means of protection.

I mean, yes, that was my point — that there'd need to be some legal thing like GDPR. But that thing would very likely need some kind of explicit user-driven policy choice (ala how websites are now forced to ask for a user-driven cookie-handling policy.)

To comply with such a law, it would be likely that every application-layer protocol that could in theory involve a backend that relies on the use of third-party ML vendors, would have to be modified to somehow carry that policy choice along with requests. It'd be a huge boondoggle.

nomel · on May 21, 2024

> Agreed on the "do not access the network" feature flag though, every program should have that.

I would claim that is not the responsibility of the app. That should be the sandbox/OS responsibility, to make sure it's actually true, rather than an app providing a checkbox that potentially does nothing.

adamomada · on May 21, 2024

I use little snitch to get this system-wide feature and imho it’s absolutely worth the money for insight on how your apps communicate

iLoveOncall · on May 21, 2024

> For compliance though, the AI integration should probably be a separate binary that you can access via the command line. Maybe with an optional setting "assign this keyboard shortcut to the AI binary".

There's no difference at all between a code path that's never called and another binary that's never called.

You are simply wrong for even trying to argue about privacy concerns when it is a feature that is entirely off by default (and also doesn't send anything that you don't enter in the box dedicated to it).

It makes absolutely 0 sense to have any concern about this but not have concerns about the capability of the terminal to perform any other call over the network.

bloopernova · on May 21, 2024

I'm not trying to argue any position. I'm just trying to understand what misgivings some people appear to have.

The way I phrased it might be why you're misunderstanding me. I'll try to clean it up.

vundercind · on May 21, 2024

Almost. The never-called code path may nonetheless reside in memory under control of a process with certain privileges, which does make it a tad less secure than a binary resting on disk. It’s also far more likely to be invoked by mistake (a bug) and you can’t totally remove it or take away its execute flag to drop that risk to basically zero.

CameronBanga · on May 21, 2024

I think the issue that people here are expressing are not necessarily their personal concerns, but concerns that their employers may now have if they learn about the feature.

iLoveOncall · on May 21, 2024

Concerns that are still 100% unfounded.

If they're worried about your terminal calling OpenAI servers if you ask it to call OpenAI servers, why are they not worried about all the many more damaging things you can do with your terminal when you ask it to?

Guess what? My terminal has always been able to call ChatGPT, even before this curl wrapper was released :O

kstrauser · on May 21, 2024

Eh. I was the CISO at a HIPAA-covered healthcare company, and I have no problem with the way iTerm handles this. Nothing gets sent from your terminal, other than what you type into the separate AI prompt window. You have to manually enter your ChatGPT key. You have to manually choose to open the AI prompt.

I see this as not substantially different from a programmer having a browser tab open where they could type questions and get answers, just more convenient. If I didn't want my coworkers doing that at all, I'd push out a device policy adding a firewall block for OpenAI's API servers and then not worry about it at all.

oreilles · on May 21, 2024

> From reading that issue, it sounds like people are worried about compliance with security policies (whether personal or corporate)

This is incredibly stupid. If they don't trust iTerm to respect their privacy, why were they using it in the first place? For all they know it very well could have been sharing all their data without telling them from the very beginning. Alas, the tool is open source they could just audit instead of yelling at clouds but hey.

meatmanek · on May 21, 2024

They trusted it before; now things have changed, and they have lost that trust. Previously there was no reason to believe that iTerm would send your terminal input or output to a 3rd party.

wrs · on May 21, 2024

Sure there was. If you cmd-click on an URL in iTerm2, it launches the browser, thus sending your output to a third party.

oreilles · on May 21, 2024

iTerm still do not send anything to a third party, you do, by willingfully using a feature that explicitely states so. If you have concern that iTerm would do something behind your back, then you shouldn't be using it in the first place.

dcow · on May 22, 2024

If you can use vscode in a compliance environment then you can use this new release of iterm2.

derefr · on May 21, 2024

> From reading that issue, it sounds like some people are worried about compliance with security policies (whether personal or corporate)

The right thing to do, then, would be for the OS to have a group policy setting like:

"Disable application features that rely on processing documents, data, or application state using remote third-party inference APIs."

...and then for apps to look for it and respect it; and for corporations concerned about this to set it as part of MDM.

Then apps could offer these features as available by default, but also forcibly disabled when relevant.

sixhobbits · on May 21, 2024

It's hard to believe that some of these comments aren't trolling. Do they also consider the fact that iterm can call out to tools like `wget` and `curl` a privacy risk and slippery slope that might share their data if used wrong?

JasserInicide · on May 21, 2024

Complete whataboutism. If I fat finger bad data, that's expressly my fault. This is a case where I now need to worry about tools I use that never sent my usage data somewhere now sending it somewhere.

wrs · on May 21, 2024

Why do you need to worry about that? That’s not what this feature does.

joshstrange · on May 22, 2024

But isn’t it more fun to make up unrealistic/impossible scenarios and then get mad at those instead since the reality is that there is nothing to be mad about? /s

I can’t stand the people up in arms about an opt-in feature that is one of many that are in iTerm2. ITerm2 is not and will not become some kind of AI-based terminal, that idea is so absurd and so are the people making that argument.

In fact, the implementation is so barebones that it’s not useful IMHO so the outrage is even more silly.

kstrauser · on May 21, 2024

If you fat finger “cmd-Y cmd-V shift-enter” to open the AI window, paste your clipboard contents, and sent it to OpenAI, then that’s kinda on you.

sixhobbits · on May 22, 2024

You forgot the part where you have to add your API key first

mixmastamyk · on May 21, 2024

Choosing a network capable download command is different than an option to send all commands to the cloud for processing. And we know defaults get changed at times, sometimes on purpose (hi facebook!).

We also have decades of experience and culture around how to use network commands properly, especially for FLOSS tools.

Considering that newbies will be attracted to these cloud tools, the risk of information leakage sounds a lot higher in the second instance.

kstrauser · on May 21, 2024

> an option to send all commands to the cloud for processing.

iTerm doesn't have that. It has a separate pop-up window where you can type questions and get answers.

If it had a feature like you describe, I'd be there with my pitchfork. Thankfully, it doesn't.

mixmastamyk · on May 21, 2024

Ok, however there are several new-gen terms headlining this kind of feature. We're far into frog-boiling territory, who knows where it will lead.

whimsicalism · on May 21, 2024

Nietzsche wrote about this sort of stuff, we're currently in an AI ressentiment period.

octernion · on May 21, 2024

i'm sure 95% of the people are just trolling - they can't be that dense about what the change actually is as developers. just silliness.

eriri · on May 21, 2024

One of the participants is calling for a "dogpile" on Mastodon and I'm not even joking.