Method Call Syntax in a JS-like Language

November 23rd, 2016

As a side-project, I'm working on a small JavaScript-like programming language called Plush. Like JavaScript, this language is going to have objects and method calls. In this blog post, I'm going to discuss some seemingly trivial aspects of method call syntax and semantics which illustrate why programming language design is difficult, and solicit your feedback as to potential solutions.

Like JavaScript, plush has object literals which can be defined with the curly brace syntax. It will also have function objects, which are in fact closures. In JavaScript, you can assign a function to an object property, and it becomes callable as a method, like so:

o = {}; // empty object
o.m = function (x) {...};
o.m(2); // call the function in property m as a method

This all seems fine and dandy, but there are some issues with the way JavaScript implements this. The first is that method call syntax introduces some "hidden" semantics. When you call a JS function as a method, there is a hidden this argument (a reference to the object the method is called on) which gets passed to functions. This is invisible to the programmer. Hence:

// The statement below:
o.m(2);

// Is not equivalent to these statements:
f = o.m;
f(2);

// But it is equivalent to these two statements:
f = o.m;
f.call(o, 2);

The passing of the hidden this argument is a little bit annoying because it breaks the programmer's intuition in a small way. JS syntax makes method calls look like regular function calls, but there is a difference. Sure, it's not a big deal, but I ran into this issue while implementing IO functions in Plush. I wanted to implement some C++ "host" functions which would allow the language to interface with the outside, as such:

io = import "core.io"; // import the IO module

io.print("hello world!"); // here 'print' is a host function written in C++

Maybe you see where this is going. If I implement method calls as JavaScript does, then every function, including host functions, need to take a hidden this argument, even if they don't need it. I can work around this by making a special case where host functions called as methods behave differently, but then I potentially have to add an extra dynamic check at every function call. This gets annoying. I can work around this issue by wrapping my C++ host functions into plush functions which handle the unneeded this argument.

The second issue I ran into involves the way the this argument is handled. JavaScript has this problem where if you declare a closure inside a method, you can't directly access the parent function's this argument, as described in this blog post. This annoying quirk of the language was "solved" by the introduction of Function.prototype.bind. The JS workaround seemed like an ugly kludge to me, and so I thought "why not just do it like Python"? Why not make the this argument explicit. Make programmers declare and name the this value, in the same way that Python forces you to declare the self argument in methods.

Upon first examination, making the programmer explicitly declare the this argument seems like a fairy good idea. However, it does have the annoying consequence that an argument that was previously hidden is now a positional argument. Consider the following scenario:

// An IO module is imported, this produces an object containing IO functions
io = import "core.io";

// The print method is "extracted", so that we can call it with a shorthand name
print = io.print;

// This works fine. It passes a hidden `this` argument value to the print method
io.print(2);

// ERROR: we are not passing a `this` argument value
print(2);

In the case of modules, it's clear that the print function shouldn't even need a this argument value. I pondered this problem, and had the idea that possibly, method calls could have a different syntax from property accesses. The "arrow syntax" would make explicit the passing of the this argument:

// Call of the method m of object o
// The "arrow syntax" is specifically for method calls, passes a `this` argument
o->m(2);

// Equivalent function call, passing an explicit `this` argument
o.f(o, 2);

io = import "core.io";

// Regular function call, `print` doesn't take `this` argument, not a method
io.print(2);

// This works correctly, as one might expect
print = io.print;
print(2);

The solution I came up with is not perfect, it raises some potential problems. For one, with a special arrow syntax for method calls, it becomes possible to call object properties using both regular function calls, and arrow style method calls which pass a this argument. Experience tells me that if both styles are possible, people are going to use both, which could be messy. For example, what do you do if you have a set of methods which do not need a this argument? Do you declare one anyway? Would you end up with a mix of regular function calls and method calls on the same object?