More

willwade · 2025-12-11T16:24:32 1765470272

that to me looks like a error in whatever logic is behind the positional error code. You'd think they would have transformer models based on different layouts but maybe some weighting issues going on.. ie I would have thought its a model that is altering based on likelihood weights and maybe something up with that..

willwade · 2025-12-11T13:55:53 1765461353

I wonder if this would have been useful https://github.com/microsoft/presidio - its heavy but looks really good. There is a lite version..

shaoz · 2025-12-11T19:48:43 1765482523

I've used it, lots of false positives out of the box, you need to do a ton of tuning or put a transformer/BERT model with it, but then at that point it's basically the same thing as the OP's project.

threecheese · 2025-12-11T17:37:36 1765474656

Looks like it uses Googles Langextract, which uses only LLMs for NLP, while OP is using a small NER model that runs locally.

winchester6788 · 2025-12-11T15:11:18 1765465878

full of false positives though. but definitely good for some types of entities and regexes

willwade · 2025-12-11T13:54:51 1765461291

can i have this between my machine and git please.. Like its twice now I've commmited .env* and totally passed me by (usually because its to a private repo..) then later on we/someone clears down the files.. and forgets to rewrite git history before pushing live.. it should never have got there in the first place.. (I wish github did a scan before making a repo public..)

acheong08 · 2025-12-11T13:57:57 1765461477

GitHub does warn you when you have API keys in your repo. Alternatively, there are CLI tools such as TruffleHog you can put in pre-commit hooks to run before commits automatically

cwinq · 2025-12-12T07:54:52 1765526092

You can try GitGuardian, it is very powerful and free for individual developers and small teams. It has precommit hooks, detection in IDE and all.

mh- · 2025-12-11T14:24:42 1765463082

You can use git hooks. Pre-commit specifically.

https://git-scm.com/docs/githooks

ComputerGuru · 2025-12-12T00:00:37 1765497637

Already mentioned it in another reply, but .env and passing secrets as environment variables are a tragedy. Take a look at how SecureStore stores secrets encrypted at rest, and you’re even advised to commit them to git!

https://github.com/neosmart/securestore-rs

hombre_fatal · 2025-12-11T14:19:20 1765462760

At least you can put .env in the global gitignore. I haven’t committed DS_Store in 15 years because of it - its secrets will die with me.

willwade · 2025-12-11T22:04:23 1765490663

sorry.. global gitignore.. what have i been doing..

PunchyHamster · 2025-12-11T15:26:49 1765466809

aside from already mentioned hooks you can add global .gitignore for .env files

willwade · 2025-11-10T23:21:20 1762816880

Meta cheated with the mms models. That is they didn’t use a phonemeizsr step. This means they just won’t work or sound very strange. ASR data is usually not quite right for tts. But anyhow - not really answering your question but many of these languages already done in mms. Try them https://huggingface.co/spaces/willwade/sherpa-onnx-tts

willwade · 2025-11-10T09:42:14 1762767734

We built this for our use case (we create solutions to help people speak who have a disability). This is a prediction model you can run in node or the browser. Next word, next character, word completion.. PPM is old - but still rocks

willwade · 2025-10-12T10:16:29 1760264189

I’m interested to see how this compares with other heroku clones. The compassion stuff is interesting. I’m using apps on digitalocean. Can we get a comparaison of using app with droplet+blossom?

I’m with the other person too. Drop the emojis and your confidence goes up. We all know coding agents JUST LOVE filling up a document with emojis. It makes you wonder if it’s imagined the benefits too

willwade · 2025-10-07T21:31:15 1759872675

Yeah I totally want this. How much data are we talking about on average?

willwade · 2025-09-08T05:26:39 1757309199

See also https://github.com/mikeborozdin/vibe-composer-midi-mcp?tab=r...

willwade · 2025-08-28T18:48:50 1756406930

I’ve developed a couple of fine tuned t5 grammar correction models. Is the model or training open?

scottfr · 2025-08-28T18:58:43 1756407523

Interesting, I would love to hear how well those worked.

Grammit uses the Prompt API for the local LLM, which currently uses a version of the Gemma 3n model on Chrome.

Grammit uses prompting instead of fine-tuning or custom training. Simplified, it has a system prompt along the lines of: "Rewrite this text, correcting any grammar or spelling mistakes," combined with a prefilled conversation containing a number of examples showing an input sentence and the corrected output.

willwade · 2025-08-23T17:51:23 1755971483

Their opentext API is actually largely marketing - infact so much it worked - Im going to make some Monster cakes https://opentextapi.monsterenergy.com/opentext/images/ecde50... - https://opentextapi.monsterenergy.com/opentext/images/a1e8b8... Yum! Thanks! Count this the first time in history has sold me something