Pure Python Vim clone

amelius · on April 26, 2015

Why not decouple the following things:

- Core editor (internal representation, etc.)

- Key bindings (so you could easily create an emacs instead of a vi)

- Rendering

- Scripting language (for customized behavior)

Finally, make sure you thoroughly document these building blocks, so others can create really cool stuff with it. Also, think of possible use-cases when defining the modules. A smart architecture could allow for a collaborative editor, for example.

jonathan_s · on April 26, 2015

Hi, thanks for the suggestions! Actually, the internal representation, is completely separate from the layout. It's not in another repository, but it's decoupled.

The key bindings are also separate. Getting emacs bindings is not much more than changing this line [0]. Only adding the bindings for the window management and emacs command line is still to be done. (I know that emacs is actually much more than only its key bindings, but you know what I mean.)

The rendering is also independent. There are two backends: vt100 terminals and the windows console. (Honesly, my main focus is vt100, but any render back-end is possible. I think even graphical)

The same for the event loops by the way, it can run on a couple of event loops. For instance asyncio.

Documentation will follow. prompt-toolkit has already quite a lot of examples, and there's a lot of documentation in the code itself. But I agree that we should keep improving.

Cheers!

[0] https://github.com/jonathanslenders/pyvim/blob/master/pyvim/...

AnkhMorporkian · on April 27, 2015

I want to compliment you on your code. It's so rare to see clean python code. I am almost never able to be able to jump into a file in a module and understand exactly what's happening, but you fucking nailed it. Well done!

notfoss · on April 28, 2015

I believe he follows pep8 ;)

amjith · on April 26, 2015

The prompt_toolkit (core library) which is the basis for pyvim (same author), is well structured and modular. https://github.com/jonathanslenders/python-prompt-toolkit

There are plenty of examples in the repo that cover most of the features in the library. The awesome thing is all of the examples fit in a single page, a testament to the power and simplicity of the library.

I built pgcli (https://github.com/dbcli/pgcli) almost entirely by reading the examples.

Svenstaro · on April 26, 2015

Will you please make pgcli compatible with the most recent prompt_toolkit release?

jonathan_s · on April 26, 2015

@amjith: we should do that indeed. If you don't have the time, I'll help you doing it later on.

evmar · on April 26, 2015

My thought process upon seeing this:

"Python, huh? Seems like typos in uncommon branches of the code would cause it to randomly fail at runtime, losing your work!"

"Now evmar, don't be such a internet nay-sayer, plenty of people write reliable Python code. You just need tests and... yep, there's a tests directory right there in the repository."

"Let's take a look. ...there's only one test!?"

It looks pretty neat other than that, though.

david-given · on April 26, 2015

I am, right now, or at least two minutes ago before I started reading HN, refactoring big chunks of _my_ text editor, WordGrinder. It's a almost-pure-Lua word processor.

In the process I have been writing unit tests. Previously the program didn't have any, which was really dumb. It was dumb because, in the three days I've spent doing this, I have found (and fixed) so, _so_ many bugs.

And now I have a decent set of tests, I can change something, rerun them, and have a pretty good idea of whether it worked or not. I don't even have to run the program!

Unit tests. They Will Save You Time™.

teacup50 · on April 26, 2015

Types. They Will Save You Tests™ :-)

david-given · on April 26, 2015

I had actually gone and looked for Lua static type checkers. Some exist, but none that'll work with Lua 5.2.

I had a really hilarious bug today where under some circumstances all the text in the display would be replaced by numbers. Small integers, each placed where the word should be.

What had happened is that I'd added a layer of indirection; where previously, after line wrapping, the data structure for a rendered paragraph was an array of pointers to the word objects, now it was an array of indices into the paragraph's word array. And I'd forgotten about one particular exotic code path. Lua was seeing the array of indices, automatically casting the ints to strings, and then using those strings instead of the word data itself...

Static types would have made this bug impossible.

...way back when, there was a really nice and thoroughly obscure language called Strongtalk; it was a Smalltalk 80 clone with optional strict types. You could annotate your classes and methods with type information. If it was there, it would be checked; if it wasn't, you got the traditional behaviour. The JIT knew about the type information and could use it to produce really fast code. It combined the ultimate dynamic language with an expressive static type system (complete with parametric polymorphism).

It was open sourced in 2006 and sank without trace. Sigh.

Jach · on April 26, 2015

For your example bug I think you're wrong that static types would have made the bug impossible. What you need is strong type checking and a lack of auto-conversion. For example, in Java, this compiles:

    int[] bodies = {1, 2, 3};
    for (int body : bodies) {
        String formatted_body = "<p>" + body + "</p>";
        callMethodWithStrArg(formatted_body);
    }

And if in this hypothetical case it was previously a `String[] bodies` and a `String body`, I bet a lot of programmers would use an auto-refactoring tool because "static types and auto-refactoring go together for being confident in changes like apples and pie" and I bet the error wouldn't have been noticed even at review time. God help you if you're using a static language without generics that has implicit type conversions. In Python, though, this raises an error:

    bodies = [1, 2, 3]
    for body in bodies:
      formatted_body = '<p>' + body + '</p>'

The error is: "TypeError: cannot concatenate 'str' and 'int' objects".

Dynamically typed languages still have types.

david-given · on April 27, 2015

That's very true (and I really wish that Lua didn't do implicit type conversion of numbers to strings --- it's a major wart on an otherwise very nice language).

I had totally forgotten that Java does it too, despite having done `""+i` lots of times as a cheap and easy and evil way to convert numbers to strings.

...I am currently rewriting a big chunk of the primary data storage to use immutable data structures, because it makes implementing Undo easier. I am having to fight the urge to redo it all in Haskell.

alepper · on April 27, 2015

You might find (Typed) Racket interesting - it offers the features you mention, and is a delight to work with.

ahuth · on April 26, 2015

Types don't tell you if the behavior is correct or not.

dllthomas · on April 26, 2015

Yes they do.

Ignoring trivial cases and dependent types, types tell you one of two things: "this might be correct" or "this is incorrect". Again ignoring trivial cases where exhaustive checking is possible, that's the same thing that tests tell you - and it's a hugely useful thing to be told!

hueving · on April 26, 2015

No they don't. They only tell you that you are matching function signatures correctly. This is valuable but it's not a substitution for unit tests.

dllthomas · on April 26, 2015

"it's not a substitution for unit tests"

It is a substitute for those unit tests that are essentially checking type invariants. It is not a substitute for all unit tests, but I don't think anyone made (much less intended to make) that claim anywhere in this thread.

"They only tell you that you are matching function signatures correctly."

All of computation can be expressed as application of functions, so for sufficiently expressive function signatures that's not much of a limitation. Of course, if you want to guarantee that your compiles terminate, you need to apply some limits to the expressiveness of your function signatures... but there are powerful guarantees you can get out of even so simplistic a type system as C's, if you work with it rather than against it.

curryhoward · on April 26, 2015

Type signatures can completely encode specifications in systems with dependent types. Of course no mainstream programming language supports dependent types, but it's worth pointing out that in principle a type can encode any property checked by a unit test. Actually, dependent types are strictly more powerful, since you can prove undecidable properties which can't be checked by unit tests (e.g., that a function does the right thing for all possible inputs).

wz1000 · on April 26, 2015

Ahem, http://en.wikipedia.org/wiki/Dependent_type

randomfool · on April 26, 2015

Here's some research on the topic, limited to just types- http://evanfarrer.blogspot.com/2012/06/unit-testing-isnt-eno...

Types are one mechanism for static analysis. Better contracts (nullability, valid ranges, etc) goes much further.

coldtea · on April 27, 2015

Nullability and ranges can also be encoded as types in the appropriate type system.

keslag · on April 26, 2015

Types will create a false sense of security...

gaze · on April 26, 2015

You know, I hear people use this excuse to justify bad behavior a lot. I once heard a bike messenger tell me that brakes gave him a false sense of security so that's why he didn't have a brake on his fixed gear bicycle.

hyperbovine · on April 26, 2015

OTOH, I also see a lot of people justifying their point with a single random anecdote. (But I agree wholeheartedly re: fixies and brakes.)

coldtea · on April 27, 2015

To disprove an absolute statement ("all X are Y") all you need is a single random anecdote.

vhost- · on April 26, 2015

Can you elaborate?

yosefk · on April 26, 2015

print(arr[i])

Compiles, runs, fails if i is out of bounds. Which means that you need to test the code that you write, and that even if you have 100% line coverage (or branch coverage or MC/DC coverage or...), it doesn't mean i won't get the wrong value.

People who claim that "once my C++/Haskell/Agda/whatever program compiles, I know it probably has no bugs" thus tempt others to mention "a false sense of security." (Agda might be going further than anyone else with proving that i cannot be out of bounds, AFAIK... though a generally undecidable problem will remain generally undecidable.)

tomjakubowski · on April 26, 2015

I've never seen anyone claim that types (even a very strong type system like Haskell or Rust) mean you don't have to write tests. They just mean you don't have to write the very silly tests you would otherwise have to to feel secure in a dynamically typed language.

yosefk · on April 26, 2015

I dunno. If you cover every source line at least once, that catches the dumb type errors as it does dumb bounds errors etc. If you don't have a test covering every source line at least once, then you will have dumb type errors as well as dumb bounds errors in those uncovered lines. So I don't think dynamic languages force you to write silly tests when you look at it that way.

If however you say, "hey, I seriously don't want to cover all the lines including say error messages", then in Python,

if error: print obj.name

...might be a problem because obj doesn't have a name; in C++,

if(error) cout << obj.name;

...might only blow up because obj is a null reference (which is what you get when you dereference the null pointer, in practice, even though null references aren't supposed to exist); and in a language with non-nullable pointers, the equivalent of the above can only blow up if printing the name (which is surely valid if obj is non-nullable and type-checks as having a name) somehow blows up, which for a string in a memory-safe language is very very unlikely.

So if you leave uncovered lines in your code which we all do then yes, the stricter the type system, the better your chances are, statistically, all else being equal (for instance, the number of lines being the same... which might not be the case.)

Overall the silly tests people sometimes write in dynamic languages are needless IMO and result from over-applying "TDD" or "unit testing" or some other buzzword and/or paranoia of someone coming from a statically typed language background. I personally think I have pretty much the same amount of tests regardless of the type system.

Manishearth · on April 26, 2015

In Rust you can use (wrt exhaustive match) and traits to a really great extent to provide safety.

Quite often I write some Rust code and know that it will work if it compiles (though I still write tests)

simlevesque · on April 26, 2015

https://news.ycombinator.com/item?id=9442562

tomjakubowski · on April 28, 2015

Sure. My reading of that is they will save you from writing some tests. Comment didn't say "they will free you from tests" or something like that.

codygman · on April 26, 2015

> They just mean you don't have to write the very silly tests

I have seen claims and have a hunch that more than just very silly tests can be eliminated by types, but I'm struggling to come up with or remember any examples.

I'm hoping someone who has one will reply with one of these claims or any examples.

bsummer4 · on April 26, 2015

You can't write this in Haskell:

  print(arr[i])

You can write this, but you probably wouldn't.

  print (arr `V.unsafeIndex` i)

dllthomas · on April 26, 2015

Unfortunately, unsafeIndex is often spelled "!". There is a push (which I support!) away from partial functions, but there are still plenty of partial functions provided by standard libraries under names that sound reasonable.

pekk · on April 26, 2015

It's interesting that your first thought was to post a troll about how Python is a bad language.

evmar · on April 26, 2015

I regret that this post is the top of the page. I failed to anticipate that people will take any opportunity to have yet another boring static typing debate.

But to be clear, in my day job I work on an app with 350k lines of Python in it and the reason I know it mostly works is due to our test coverage. As someone else mentioned in this thread, a lack of tests should not give you confidence regardless of the language.

drivingmenuts · on April 27, 2015

From the project docs

"There is no roadmap. I mostly implement the stuff which I need or interests me, or which gives me the opportunity to learn."

Pretty much says it all. Seems it was primarily written to satisfy personal needs, not be the end-all, be-all of professionally written software. It is, incidentally, potentially interesting to others who might want to learn from it, or use it, knowing the background.

So, you might cut the guy a break. The requirements for personal projects aren't the same.

jonjacky · on April 27, 2015

> "Python, huh? Seems like typos in uncommon branches of the code would cause it to randomly fail at runtime, losing your work!"

Maybe not - Python itself can provide some crash protection. Run it this way: python -i run_pyvim.py. Then if pyvim crashes, Python will still be running with all your work still there. Maybe you can just type run() at the Python prompt to resume the session in almost the state you left it -- depending on how run() is coded, how much re-initialization it does.

bbcbasic · on April 26, 2015

Yes indeed any 'rewritten in ...' project should target Haskell.

shangxiao · on April 26, 2015

vs random memory bugs that will fail randomly at runtime ;)

marktangotango · on April 26, 2015

Fyi there's a similar project here:

https://github.com/stefanoborini/vai

kgadek · on April 26, 2015

  Q: Why Python?
  A: The only alternative would be Haskell, but I still have to learn that.

Wow, that would be interesting.

fadsasda432 · on April 26, 2015

Hi, I once tried that, but I ran into problems with lazyness. In particular my data structure was rather simple (famous gap data structure), but editing "large" files (>1000LOC) became rather unpleasant (too big input-feedback latency).

However I managed to build a _very_ basic proof-of-concept editor (no dependencies) in just a few hundreds lines of code which I could explain, but until now I was too shy to share it as it did not involve magic abstract Haskell-foo ... ;)

codygman · on April 26, 2015

> I was too shy to share it as it did not involve magic abstract Haskell-foo

Please don't be, simple understandable Haskell code is very nice :)

Plus if there is a better way of doing it you get to find that out too.

implicit · on April 26, 2015

That's a shame!

"Gets the job done in just a few hundred lines of easily-explained code" is a terrific standard to meet.

"Magic abstract Haskell-fu" cannot improve such a program very much.

wz1000 · on April 26, 2015

Yi[0] is an editor written in Haskell.

[0]: https://github.com/yi-editor/yi

viraptor · on April 27, 2015

It's got an amazing (I'm not sure if that's in a good or bad way) system for reloading config. Config is code. Reconfiguration means recompiling. If you reload the config, you recompile yi and reexec it - file handles and other information is left laying around so you end up in exactly the same state as before. It's slightly scary from the programming side...

wz1000 · on April 27, 2015

Xmonad does the same thing.

euid · on April 26, 2015

Emulates Vim quite nicely, too.

bsummer4 · on April 26, 2015

Not really, no. yi badly needs some more love.

codygman · on April 26, 2015

Care to elaborate on where it falls down emulating vim?

bsummer4 · on April 27, 2015

Yi is really nice, overall. The code is clean too.

I don't remember all of the issues, but there are a ton of small things that make the editor unusable to me. I used it for a couple of weeks, and I spent some time working on these issues, but never had PR-worthy code. Here's what I can remember off the top of my head:

- Startup time is very slow because of the way configuration works. In my local copy, I made a version without runtime configuration, and that solved this problem. This conflicts pretty badly with the whole architecture, so I didn't make a PR.

- :n :N don't work. Opening multiple files from the command line doesn't work.

- :cq doesn't work. I fixed this, but my fix was a hack, so I didn't make a PR.

- Operating on regions with '{' and '}' is off by one line in some directions.

- You can't replace regions with shell commands. For example, using '!}sort' to sort a paragraph.

codygman · on April 27, 2015

Cool! I guess my evaluation of Yi/vim emulation was more cosmetic than I thought :)

esbio · on April 26, 2015

Here is vai https://github.com/stefanoborini/vai , a similar project I started a year and a half ago.

marco2357 · on April 26, 2015

There's a Java version of Vim here: https://www.mtsystems.ch/#section2

It's an automatic translation of the C version.

Like tomp said 4 hours ago: Like vim, just slower.™

haches · on April 26, 2015

Could you make it available as a Github repo? Would allow to browse it without requiring a download.

tomp · on April 26, 2015

Like vim, just slower.™

andybak · on April 26, 2015

From the readme:

Why did I create Pyvim?

There are several reasons.

The main reason is maybe because it was a small step after I created the Python prompt-toolkit library. That is a library which is actually only a simply pure Python readline replacement, but with some nice additions like syntax highlighting and multiline editing. It was never intended to be a toolkit for full-screen terminal applications, but at some point I realised that everything we need for an editor was in there and I liked to challenge its design. So, I started an editor and the first proof of concept was literally just a few hundred lines of code, but it was already a working editor.

The creation of pyvim will make sure that we have a solid architecture for prompt-toolkit, but it also aims to demonstrate the flexibility of the library. When it makes sense, features of pyvim will move back to prompt-toolkit, which in turn also results in a better Python REPL. (see ptpython, an alternative REPL.)

Above all, it is really fun to create an editor.

krick · on April 26, 2015

So is it vi clone on vim clone, exactly? I just noticed some usual things are not working (ZQ, for example), but I don't remember, maybe it's supposed to be that way in vi.

rch · on April 26, 2015

This looks great - very hackable. I think I'd like to try adding support for HDF5 files via h5py.

mkonecny · on April 26, 2015

Was this more of a blackbox re-implementation, or is more of a 1:1 code port?

jonathan_s · on April 26, 2015

It's a blackbox re-implementation. In Python, things are done different from C and there are other libraries are available (Pygments for instance). I also don't want to claim that it is as powerful as Vim is. But it should be stable, easy to install, and especially usable for Python development.

Yadi · on April 26, 2015

This is awesome, I just tested it out :)! It would be cool to have the possibility creating the extensions in future.

chestervonwinch · on April 26, 2015

Tried it on OS X. Everything blinks - as in blink-tag blinks. Not sure what's going on with that...

jonathan_s · on April 26, 2015

Can you tell me what terminal application you are using? Does it happen as well in a new terminal?

chestervonwinch · on April 26, 2015

default terminal app on osx 10.6. happens in a new terminal.

avinassh · on April 27, 2015

Is there any reason you are still using 10.6?

chestervonwinch · on April 27, 2015

old laptop. no $$.

rhapsodyv · on April 26, 2015

Is there any vim clone that actually have full support to vimfiles?

undergrowth54 · on April 26, 2015

Does https://github.com/neovim/neovim not?

rhapsodyv · on April 27, 2015

But it isn't a clone.

Every clone I see try to reproduce vim usage, but none really try to run vimfiles. I think running vimfiles is a must have for a clone to get real users.

andor · on April 26, 2015

This is really nice. Does anybody know whether there's a comparable (real time, asynchronous, highlighting symbols not lines) syntax checker for neovim?

yoanizer · on April 27, 2015

Hello, I have a question, what made you want to start this project? What do you not like about main branch vim?

morekozhambu · on April 26, 2015

awesome !!

Does it work with unicode Indic characters?

jonathan_s · on April 26, 2015

Yes, it should normally work with all unicode characters. (If not, please file a bug.)

eivarv · on April 26, 2015

Very cool!