Arc Forumnew | comments | leaders | submitlogin
Prototypes/methods vs. extending.
2 points by Pauan 5111 days ago | 129 comments
Let's suppose you wish to create a custom table type. How would you go about doing that in pgArc? I would imagine it would look something like this:

  (extend eval (x env) (isa x 'table) (x 'raw)) ; probably broken
    
  (extend apply (f . args) (isa f 'table)
    (orig f 'get args))
    
  (extend sref (x v n) (isa x 'table) (x 'set n v))
  
  (extend table ()
    (let tab (orig)
      (fn (x n v)
        (case x
          'raw tab
          'get (tab n)
          'set (= (tab n) v))))) ; possibly broken

          
  ; after all that work, we can finally create a custom table type...!
          
  (def my-table (x n v)
    (case x
      'raw ...
      'get ...
      'set ...))
Actually, that won't work, because you can't properly extend apply and eval in pgArc. But even if you could, talk about hacky! All that work just to make a custom table? Why not build that functionality into Arc itself, rather than requiring grafting it on...?

Another example: stdin returns an input stream, and it'd be nice if Arc users could create custom input streams as well, that could be read by readc, readb, peekc, etc.

This is similar to the above example with a table. You would need to extend every function individually. And then there's the problem that your custom input stream would need to return different things in different situations, like with peekc vs readc. Here's a possible solution:

  (with (old-p peekc
         old-r readc
         old-s (stdin))
         
    (def stdin ()
      (fn (x)
        (case x
          'peek (old-p old-s)
                (old-r old-s))))
                       
    (def peekc (x)
      ((coerce x 'input) 'peek))
      
    (def readc (x)
      ((coerce x 'input))))
      
  
  ; after all that work, we can finally create custom input streams...!

  (= my-stream (annotate 'input (fn (x)
                                  (case x
                                    'peek ...
                                          ...))))
Gosh, another situation where users will need to jump through hoops and extend stuff just to create custom data types... in fact, these two situations share a common feature: they hide internal data.

What do I mean by that? A table has internal data, obviously, and apply and eval know how to get that data. The same goes for an input stream: peekc and readc are built-in primitives because stdin has special data that only they can read.

But of course, Arc code cannot create those special types... all we have are functions. So rather than defining the primitives so they operate on internal data (that is not accessible or creatable within Arc), we can define them so they work on functions instead.

Now, Arc programs can create functions, and so this makes it trivial to create custom data types that work correctly with built-in primitives like readc or peekc. In other words, I'm proposing that all complex/compound data types be implemented as functions/objects, thus allowing Arc code to create them.

Things like tables are self-contained, so it makes sense for them to behave like objects in OOP languages (a get/set method, for example). But, there's two basic ways to do this: you can either pass a symbol as an argument to a function (which designates the "key"), or you can create an object that has functions as keys:

  (def my-table (x)
    (case x
      'raw ...
      'get ...
      'set ...))

  (= my-table
    (obj raw (fn () ...)
         get (fn () ...)
         set (fn () ...)))
They're roughly comparable as far as verbosity goes, but there's one particular advantage that the first approach has: it lets you do something when a "key" isn't found:

  (case x
    'raw ...
    'get ...
    'set ...
         "no keys matched!")
This is equivalent to "method_missing" in Ruby. However, the second approach has one advantage: each key corresponds to a function, so you can extract them individually:

  (= set (my-table 'set))
  (set)
Perhaps we should simply implement method_missing (or similar) on tables, and use that? An idea: tables could use nil as a key, which would behave like method_missing:

  (= foo (obj nil 'bar))

  (foo nil)  -> 'bar
  (foo 'foo) -> 'bar
  (foo 'bar) -> 'bar
Basically, when a table can't find the key, rather than returning nil, it would return (foo nil) which would default to nil. This isn't quite as extensible as using a function, but it should cover 95% of use-cases.

Also keep in mind that if the concept of methods became common enough, I don't think it'd be hard to create a macro for it:

  (proto my-table ()
    raw ...
    get ...
    set ...)
It may not seem like much of an improvement, but it also lets you specify a "parent" that it would inherit from:

  (proto my-stdin (stdin)
    peek ...)
This causes my-stdin to behave like stdin in every situation except when called with 'peek. How would you do that in Arc right now? Note: you can't just overwrite stdin, because you want stdin to keep it's old behavior.


2 points by Pauan 5110 days ago | link

I noticed that there seems to be some confusion and resistance about my idea. The resistance is probably due to confusion, so I'm going to attempt to conclusively prove that message passing is significantly superior to the current way of creating new data types. A bold claim, can I back it up? Read on.

I will write this post as a series of examples. These examples will show things that should be trivial, but in fact end up being far more complicated in wart/Anarki/pgArc.

For the rest of this post, when I refer to "wart" I am referring to wart/Anarki, because Anarki does things in basically the same way. For every example, pgArc will be at least as verbose as wart, and in some cases it is impossible to make the example work in pgArc.

This is not a post designed to say, "wart sucks", instead it's a post designed to say, "look! message passing is awesome!"

First, let us define a new table type, which we shall call a "clone". A clone behaves completely identically to a table in every way, with only one exception: if it can't find a key, it will check it's parent. On the left side, I will show the way to accomplish this in wart. On the right side, I will show the same way using message passing.

  (def clone(x)                                   ;  (def clone(p)
    (annotate 'heir (obj val (table) parent x)))  ;    (let t (table)
                                                  ;      (annotate 'table (fn (m k v) 
  (defcall heir h                                 ;                         (case m
    [or rep.h!val._                               ;                           'keys (join (keys p) (keys t))
        rep.h!parent._])                          ;                           'get (or (t k v) (p k v))
                                                  ;                           'set (sref t k v))))))
  (defset heir(h key value)                       ;
    (= rep.h!val.key value))                      ;
                                                  ;
  (def iso(x y) :case (isa x 'heir)               ;
    ...)                                          ;
                                                  ; 
  (def fill-table(tb data) :case (isa x 'heir)    ;
    ...)                                          ;
                                                  ;
  (def keys(x) :case (isa x 'heir)                ;
    (join (keys rep.x!val)                        ;
          (keys rep.x!parent)))                   ;
                                                  ; 
  (def vals(x) :case (isa x 'heir)                ;
    (join (vals rep.x!val)                        ;
          (vals rep.x!parent)))                   ;
          
I gave up on iso and fill-table because they would have ended up being much too verbose and complicated. I expect if I had tried to support them, wart would have become twice as verbose as it already is. All of that just to make a measely table type? Wart doesn't even work according to the spec... because it needs to create a new 'heir type. Why is this a problem? Hold that thought, let me show you another example.

In this case, we're going to use AVL trees (http://arclanguage.org/item?id=14181) to represent a table. This is nice because AVL trees have good worse-case performance:

  (def avl-table(x)                                  ;  (def avl-table()
    (annotate 'avl-table ...))                       ;    (annotate 'table (fn (m k v)
                                                     ;                       (case m
  (defcall avl-table h                               ;                         'keys ...
    ...)                                             ;                         'get ...
                                                     ;                         'set ...))))
  (defset avl-table(h key value)                     ;
    ...)                                             ;
                                                     ;  
  (def iso(x y) :case (isa x 'avl-table)             ;                           
    ...)                                             ;
                                                     ; 
  (def fill-table(tb data) :case (isa x 'avl-table)  ;
    ...)                                             ;
                                                     ;
  (def keys(x) :case (isa x 'avl-table)              ;
    ...)                                             ;
                                                     ; 
  (def vals(x) :case (isa x 'avl-table)              ;
    ...)                                             ;
    
I'm not going to actually create avl-tables, so you'll have to imagine the implementation. Once again, in wart, we ended up needing to create a new type 'avl-table, and then re-extend all the table built-ins, and call defcall and defset on the new type... Not only is this significantly more verbose, but it's also inefficient and inextensible.

To answer why it's inextensible, I will ask a question: what does it mean when something has a type of 'table? You cannot say, "it's implemented as a hash" because Arc specifically leaves the implementation undefined. A table in Arc could be a hash table, or an alist, or AVL trees, or anything, really. So what does "table" mean? It is an interface. When Arc code sees a variable my-table that has a type of 'table, it expects all the following to work on it:

  (my-table 'foo)    -> get the key 'foo
  (= mytable.'foo 5) -> set the key 'foo
  (keys my-table)    -> get the keys
  (vals my-table)    -> get the values
  
The above is the interface for tables. If your data type can do all of the above, then you can legitimately call it a table. If it can't, then it's not a table. In other words, types represent an interface. They specify behavior. When something is of type 'table, you expect it to behave like a table. You don't care whether the table is created with AVL trees or conses, you only care that it supports the table interface.

So the problem with wart (and Anarki, and pgArc) is that it requires you to create a new type every time you want to create data. But we don't care about the implementation, we only care about the interface. Consider the function `each`. It has different behavior depending on whether the variable is a cons or a string or a table. Let's represent that as follows:

  (def each (x)
    (case (type x)
      'cons ...
      'table ...
      'string ...))
      
The thing is... in wart, 'heir and 'avl-table support the table interface. But they don't have a type of 'table, so now `each` needs to be re-written like so:

  (def each (x)
    (let t (type x)
      (if (in t 'table 'avl-tree 'heir) ...
          (case t
            'cons ...
            'string ...)))
          
This is ridiculous! `each` doesn't care about how the tables are implemented, it only cares that they behave like tables. Can it be made better? Yes, by creating a global function `table?`:

  (def table? (x)
    (in (type x) 'table 'avl-tree 'heir))

  (def each (x)
    (if (table? x) ...
        (case (type x)
          'cons ...
          'string ...)))
But that doesn't solve the problem of needing to extend all the relevant built-in types... it applies a bandaid to the problem, but doesn't solve it. Meanwhile, message passing doesn't even have that problem, since your table variants can have a type of 'table. No need for a `table?` function.

Let's look at one final example. Let's create an alist type... specifically, something that combines the behavior of 'cons and 'table. Here's what I mean by that:

  (= my-alist (alist x 1 y 2))
  
  (my-alist 'x)     -> 1
  (= my-alist.'x 5) -> 5
  (car my-alist)    -> (x 5)
  (type my-alist)   -> cons
  
How would you go about doing this in wart...? Impossible or at the very least super incredibly difficult. How would you do this with message passing?

  (def alist args
    (let l (pair args)
      (annotate 'cons (fn (m k v)
                        (case m
                          'get (assoc l k)
                          'set ...
                               (attr l k v))))))
Perhaps you are scared because I used OOP words like "methods" and "prototypes". For that, I apologize. Instead, I will focus on a simpler, yet more powerful idea: messages. Alan Kay (the creator of Smalltalk) said that objects aren't important: messages are [1]. I think I understand now what he meant by that.

Hopefully we all here realize that modern OOP is a mistake. They claim that it leads to more extensible code, yet I have not seen evidence for that. What I have seen is that it tends to result in a lot of unnecessary verbosity and fiddling with type hierarchies. There is a simple, fundamental, core reason for this: they try to use objects for everything.

Functions are like verbs. Most programs do something, so writing in a functional style is very good for writing most programs. In the OOP world, however, they try to write everything using nouns... which of course is ridiculous.

I claim, that functional programming (though it has great merits), is not a silver bullet that is perfect in every situation. When representing action, functional style is great. But when representing data, it falls apart. I claim that this results in the same verbosity and rigidness that OOP creates.

Don't believe me? Look at the examples above. Now look again. Now say with a straight face, "functional style can create data types just as easily as message passing"

Message passing is a natural, easy, short, and extensible way to represent data: things like conses, tables, input streams. Functional style is a natural, easy, short, and modular way to represent programs: doing something to data. By combining these two styles, Arc can become exceedingly short, and hackable.

* [1]: http://lists.squeakfoundation.org/pipermail/squeak-dev/1998-... http://userpage.fu-berlin.de/~ram/pub/pub_jf47ht81Ht/doc_kay...

"OOP to me means only messaging, local retention and protection and hiding of state-process, and extreme late-binding of all things. It can be done in Smalltalk and in LISP. There are possibly other systems in which this is possible, but I'm not aware of them."

Note: I've probably written less than 10 lines of Smalltalk, but that doesn't change the fact that I can clearly see how message passing reduces verbosity and increases extensibility.

-----

2 points by akkartik 5107 days ago | link

"in wart, 'heir and 'avl-table support the table interface. But they don't have a type of 'table, so now `each` needs to be re-written.."

In wart, each doesn't check the type of the sequence.

https://github.com/akkartik/wart/blob/81a6dc27f7ebe58bd89fc0...

https://github.com/akkartik/wart/blob/81a6dc27f7ebe58bd89fc0...

This works because coerce is extensible. All I had to do was define how to coerce tables into lists: https://github.com/akkartik/wart/blob/81a6dc27f7ebe58bd89fc0... and arc's transparent destructuring takes care of the rest.

---

Nobody here is scared of anything, least of all 'OOP words'. You may find it more helpful to phrase us as 'unconvinced that methods are needed in a lisp'. A previous iteration of def :case was called defgeneric, named after the mechanism in CLOS, so there's plenty of history of people trying out OOP ideas in lisp, or in this forum.

---

Somehow I didn't notice this post until you linked to it at http://arclanguage.org/item?id=14307, even though I responded to its descendants! Perhaps that's one reason posts don't get responses (http://arclanguage.org/item?id=14264)

-----

1 point by Pauan 5107 days ago | link

Yes, except rocketnia doesn't like coerce either. I do like coerce, and I think it's a good idea for `each` to coerce to 'cons (or a similar iterable thing). I think coerce + extend + message passing would make for fantastically short and extensible code.

---

"Nobody here is scared of anything, least of all 'OOP words'. You may find it more helpful to phrase us as 'unconvinced that methods are needed in a lisp'."

I have already produced plenty of evidence that extend is over twice as verbose as message passing, more difficult to use, and doesn't play well with modules. When it comes to describing data types, I have not seen a single advantage that extend has over message passing. What more do you want? Should message passing do tap dancing too?

  (def tap-dancer ()
    (annotate 'dancer
      (fn (m)
        (case m
          'tap-dance (prn "tap dancing now")
                     (err:string "did not understand the message " m)))))

  (= dancer (tap-dancer))
  (dancer 'tap-dance)   -> "tap dancing now"
  (dancer 'break-dance) -> error: did not understand the message break-dance

-----

2 points by rocketnia 5107 days ago | link

"Yes, except rocketnia doesn't like coerce either."

For the record, that's true, but I do follow a coercion pattern sometimes. See the end of http://arclanguage.org/item?id=14220: "To digress a bit, coercion is a positively useful design pattern when...."

As it happens, 'each is one of those times I'd use a coercion pattern. I'd probably define these utilities, with these dependencies:

  each  -->  some  -->  seqify  -->  entries  -->  keys  -->  unwrap-my-table
To introduce the type 'my-table and give it support for 'each, I only need to extend 'keys so that it tries 'unwrap-my-table.[1] If I do that, 'seqify will already be able to coerce my type to a lazy list.

[1] If the system doesn't support failcall, I may also have to extend 'supports-keys so that it tries 'isa-my-table. The diagram would be about twice as big if I included these predicates.

-----

1 point by akkartik 5107 days ago | link

I am not convinced your tiny example proves anything. We aren't playing code golf here to make things shorter by any means. Expressive constructs make for shorter programs, and I still believe methods are no more expressive than anything else. I think you're moving parens around.

One way to add evidence is to actually have an implementation of an arc-like language with methods (like I have done with wart). And I don't mean half-baked pseudocode that I have to spend time massaging. (http://arclanguage.org/item?id=14288) Who knows if I would massage it like you would have? You're leaving too much unspecified to persuade me.

"What more do you want? Should message passing do tap dancing too?"

I'm not going to respond if you talk like that.

Keep running with your ideas instead of waiting for us to catch up. If they're good you'll eventually bring us around. Expecting validation from others before moving onward is a recipe for misery. So we're fools. Ignore us.

-----

1 point by Pauan 5107 days ago | link

...what? "moving parens around"?

http://arclanguage.org/item?id=14261

Look at the first example there. That is a perfectly reasonable and trivial thing to do (creating a table that can inherit from another table). Message passing is less than half the size of wart... And that was without implementing iso and fill-table, which would make wart even more verbose...

That is not "moving parens around". This is a clear example where message passing is less than half as verbose as extend. Message passing should always be at least as good as extend, and in some situations will be significantly better. If you would rather use extend and write programs that are over twice as long, then go ahead... but I thought one of the main points of Arc was brevity.

---

"I'm not going to respond if you talk like that."

It was a joke.

---

"One way to add evidence is to actually have an implementation of an arc-like language with methods (like I have done with wart)."

As said, I plan to implement this in py-arc.

---

"Keep running with your ideas instead of waiting for us to catch up. If they're good you'll eventually bring us around. Expecting validation from others before moving onward is a recipe for misery. So we're fools. Ignore us."

Yes, you're right. I simply don't understand why, when I present clear evidence that extend is either equal to, or worse than message passing in every way pertaining to data types... it isn't accepted. What is it that is preventing acception? Is the evidence not clear? Are my words confusing?

-----

1 point by akkartik 5107 days ago | link

...what? "moving parens around"?

If you would rather use extend and write programs that are over twice as long, then go ahead

How would your approach clone from strings rather than from tables? Those are the sorts of questions I am talking about. Are methods useful if they can only clone tables?

The main point of arc is brevity across a range of programs.

"As said, I plan to implement this in py-arc."

I enjoy our discussions, but I don't understand how I am expected to be persuaded in the absence of running code. I'm happy to agree to disagree until you have something I can take the trouble to install, run, and make your complaints in http://arclanguage.org/item?id=14313 about :)

-----

1 point by Pauan 5107 days ago | link

Strings are annotated conses, so you can use message passing for them too. Message passing works for all data types, even int and num. What does "cloning a string" even mean, by the way?

Let me look at an example that is a bit more string-oriented: we want to make a string that can do substring searches with function notation. In other words, we want this to work:

  (= my-string (pos-string "foobar"))
  (my-string "foo") -> 0
  (my-string "bar") -> 3
  (my-string 0)     -> #\f
  (my-string 3)     -> #\b
Here's how to do that with message passing:

  (def pos-string (x)
    (annotate 'string
      (fn (m k . rest)
        (case m
          'get (case (type 'k)
                 'string (posmatch k x)
                         (apply x m k rest))
               (apply attr x m k rest)))))
How would you do this with extend? Can it be done without extending every single built-in that uses strings?

-----

1 point by rocketnia 5107 days ago | link

Here's a hopefully clearer and less buggy version of your example:

  (def pos-string (orig-string)
    (annotate 'string
      (fn (message key . rest)
        (if (and (is message 'get) (isa key 'string))
          
          ; This message represents a function call with a string as the
          ; first argument. Let's use 'posmatch for that.
          (posmatch key orig-string)
          
          ; We don't handle this message. Let's use 'attr to propagate
          ; raw message to the original.
          (apply attr orig-string message key rest)))))
---

"Can it be done without extending every single built-in that uses strings?"

There aren't that many axiomatic things strings do in Arc. In my mind, a custom string type would only have to support two operations: being called as a function; and being coerced to a lazy list, sent through a function, and coerced back. If a string-like thing supports those two behaviors, the core utilities should be able to handle it.

Arc's strings also support 'sref, but I pretend they don't. :-p

The "being coerced to a lazy list, sent through a function, and coerced back" utility may seem overcomplicated. It's the solution I prefer to enable this behavior:

  arc> (map inc "abcde")
  "bcdef"
This is implemented in pg-Arc using a 'coerce back to the input type, but if a pos-string has a type of 'string, then coercing the output to its original type will probably just make a core string, rather than a pos-string.

-----

1 point by Pauan 5107 days ago | link

"less buggy"? You're using k when it should be key. :P It is shorter, though, so thanks for that.

---

"There aren't that many axiomatic things strings do in Arc"

Yeah, I know. Tables seem to be the big pain point. Possibly conses too, if somebody wanted to extend those... tables seem like they'd be the most common thing to extend, though, and also the trickiest.

---

"[...] being coerced to a lazy list, sent through a function, and coerced back [...]"

Oh! Oh! I know! Let's use message passing! :P We can give strings an 'iter message! Just like Python! Whooo! I'm reminded of the Jamie Zawinski quote, "Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems."

Actually, I think it'd just be easier to define a `coerce` rule for 'iter, no need for message passing in that situation. Now, as for the actual creation of iterators, that could be done with message passing. :P

Then again... you're right that if pos-string uses the same type 'string as built-in strings, then `coerce` won't be able to know that it should use pos-string... unless you added in a message to the iterator, of course. But then that won't play as well with coerce...

I guess that is one example where the current way of coerce + extend + new-type-every-time is actually better.

-----

1 point by rocketnia 5107 days ago | link

"You're using k when it should be key. :P"

Fixed! Now it oughta be less buggy. ^^

---

"Tables seem to be the big pain point. Possibly conses too[...]"

I could be wrong, but I think it's enough for a table to extend 'keys, function call behavior, and 'sref. <Insert usual disclaimer that there'd need to be predicates like 'supports-keys in the absence of failcall.>

For Penknife's purposes, the only reason I care about cons cells is that they're eager implementations of nonempty lazy lists. They should support function call behavior, 'sref, and the one and only lazy list operation:

  (iffirst first rest seq
    <...thing to do with first and rest if the seq isn't empty...>
    <...thing to do if the seq is empty...>)
If cons cells are mutable and allowed to be improper, then they need 'scdr too, and it might be more elegant to add 'scar, 'car, and 'cdr.

-----

1 point by Pauan 5106 days ago | link

"I could be wrong, but I think it's enough for a table to extend 'keys, function call behavior, and 'sref."

And the built-in `table?` as well, so other code recognizes it as a table.

-----

1 point by akkartik 5107 days ago | link

I don't understand your example. I think my brain is fried by this conversation. Let me know when you have an example that I can actually run.

-----

1 point by Pauan 5107 days ago | link

What I'm saying is that (my-string "foo") would be equivalent to (posmatch "foo" my-string). I'll try to get message passing working in py-arc.

-----

1 point by Pauan 5107 days ago | link

"The main point of arc is brevity across a range of programs."

Message passing is general... it can apply to all the built-in types, and can also apply to all user-created types. It does increase brevity across a range of programs. You could even write all Arc code in that style, but I don't recommend that. I'm restricting my discussion of message passing to data types because message passing is clearly superior with regard to data types. I make no claims about the superiority of message passing with regard to things other than data types.

I've been using specific examples because as aw mentioned (http://arclanguage.org/item?id=14264) it's important to explain the problem that it solves, and also demonstrate how it solves the problem... but the concept is general.

In any case, since you're demanding runnable code, it's up to me to put my money where my mouth is.

-----

1 point by Pauan 5110 days ago | link

I will also note that all it would take to convince me that I'm wrong is to demonstrate an alternate system that is just as short and extensible as message passing. I have not seen such a system.

rocketnia seems to dislike types, and I agree: a proliferation of types is troublesome. But we only have two routes: we can accept that types exist in Arc, or we can make Arc typeless, like GEDANKEN [1].

Simply saying "don't use isa" doesn't solve the problem... because you still need to extend everything. I have clearly demonstrated that with the current type system, creating new types, or variants of existing types, is far too difficult. But with message passing, it becomes trivial. Do you have a better suggestion?

* [1]: http://portal.acm.org/citation.cfm?id=362364

I don't have an ACM account, so I can't read that book, unfortunately, but it seemed relevant to the discussion.

-----

5 points by aw 5109 days ago | link

A few random observations, in case they are helpful to you:

- I often don't get anyone agreeing with me in the forums (http://arclanguage.org/item?id=14263 notwithstanding :)... I notice that people often will spend their time instead to share their own ideas, concerns, and criticisms... or say nothing if they don't have anything additional to contribute. (Some of my best hacks have gotten little response in the forum when I first post about them). Thus I suspect you shouldn't necessarily take lack of positive agreement as evidence of disagreement.

- One pattern I've noticed on the forum is that someone will drop by and say "Arc would be a better solution for my problem if it had (immutable lists | software transactional memory | message passing | documentation | unit tests | libraries | ...)". It's more rare for this to be followed up by an actual runnable implementation of the feature.

- When someone does provide an implementation, they sometimes appear to be disappointed when their solution isn't adopted by other people. I suspect maybe there may be some confusion about the nature of problems and solutions: even if say X is an excellent and succinct solution to problem A, someone like myself for example may not take the trouble of adapting it if I happen not to have problem A. And... even if I do have problem A, X may or may not be a useful contribution to a solution for my problems A+B. When I'm trying to solve A+B, I may remember X and then try it out and see if it is useful for A+B, but it can be hard to know in advance whether I'll personally find X useful until I'm seriously looking at the details of A+B.

- I enjoy sharing solutions I've implemented to problems I've had, and I'm pleased when I learn that someone else has found them useful as well. But it can take a year or more after I first publish a hack for me to learn that someone is using it... which is fine for me since I'm aware that there can be a long feedback cycle, but I notice other people become discouraged when nothing seems to be happening in a shorter time frame.

To maximize the likelihood that other people will be able to make use of your solution, you may find this template helpful:

- explain the problem the solution solves. (People often omit this step: they immediately start off with describing the solution. But while it may be easy to see that a solution is elegant, it can be hard to see what it's a solution for).

- describe the solution. (Most people do this part).

- demonstrate how the implementation solves the problem. (This part is also often omitted, but a few concrete examples can do wonders to help people make the connection between the solution and a problem they have).

The forum itself tends to be a poor place to do this because of its modest formatting capabilities and the inability to edit posts later means that descriptions become disjointed, spread over multiple comments and posts.

-----

3 points by thaddeus 5109 days ago | link

I agree with most of what you're saying, but I will add a few things:

There are, generally speaking, two types of people who visit this forum. The first, type-A let's call them, are hardcore hackers, like yourself(ves), who are interested in the language design and the underbelly mechanics of language assembly. Then there are people, like myself, whom are only interested in using the language to build applications.

When I read: > "Arc would be a better solution for my problem if it had...", "It's more rare for this to be followed up by an actual runnable implementation of the feature." I will suggest the two types of people represent the divide within the pattern you're recognizing and I will going to even further to suggest this is not a bad thing. I would like to encourage more type-B user engagement - even, if only, so that I am not all alone :)

While I don't have the time or inclination to understand the full content/depth within these type-A posts, I do try to learn what I can even if only at a surface level. I really do enjoy reading these posts and do appreciate them so I hope they don't go to a back channel as akkartik has mentioned a possibility for.

Furthermore, I would like to highlight some of the points aw's message that I feel are really important (and with my interpretation):

* Inline comments are not the best way to present your ideas. I've struggled assembling all the bits and pieces and knowledge requirements from prior posts/threads. The order alone messes everything up and I simply do not have the time to do the paper chase.

* Writing an external blog post will likely force you to present a fuller picture, with its surrounding context, to an audience that can't always be on top of things.

All that said, please don't give up on the ideas, we need more of them - not less.

-----

2 points by aw 5109 days ago | link

I would like to encourage more type-B user engagement

I added an "Arc Wish List" page to the wiki: https://sites.google.com/site/arclanguagewiki/wish-list It's currently empty :-) Perhaps some type-B users would like to fill in some things they'd like implemented? ^_^

Oh, and by the way, there's an infinite amount of room available in the wiki for other things such as proposals, ideas, arguments, implementation approaches, and so on. So if you're looking for a place where you can write things out and edit them later, the wiki is great place to do that. (You can of course also write your own blog post if you want, and we can link to it from the wiki as well).

-----

1 point by thaddeus 5109 days ago | link

I'm working on it :)

I took a break to write a blogging platform from scratch that includes my own custom markdown.

I even posted my first arc topic here: http://blackstag.com/blog.posting?id=2. I then quickly discovered the auto spam bots are moving from the arc forum to my blog site every night. So I took another break, lol, just to put some spam management code in. I'm glad had read enough of pg's code to implement some good techniques similar to news.arc.

Now I'm ready to do more blogging - I hope. I've accumulated a list of 9 or 10 posts and I plan to get some arc stuff in there as well.

I swear, I don't know how people find the time!

-----

1 point by Pauan 5109 days ago | link

"...there's an infinite amount of room available in the wiki..."

You sure you want to encourage DoS attacks? :)

-----

2 points by akkartik 5109 days ago | link

I notice other people become discouraged when nothing seems to be happening in a shorter time frame.

I went through the five stages of grief after submitting http://arclanguage.org/item?id=12657. It took me literally weeks to get right, and it sat at I think 1 point for nearly a week (with just evanrmurphy's positive comment[1]).

But eventually I realized much of what you said :) I have more realistic expectations now. Points are nothing, and responses come in time.

[1] While evan was active you could tell when he'd seen something because it would go to 2 points :) (without suggesting that he upvoted everything)

Evan would often articulate agreement or admiration even if he had nothing to add, and it was useful. But I'm not sure we should all do it.

Perhaps all this is a sign that we need a back channel.

-----

1 point by akkartik 5109 days ago | link

Yeah it seems people here don't express agreement unless they can add something useful.

Some of my best hacks have gotten little response in the forum when I first post about them

Because some of your best hacks left me speechless :)

Things I see here often take a while to digest, days or weeks. It's taken me over a year to fully appreciate extend, a road that took me through various dead-ends.

-----

1 point by akkartik 5109 days ago | link

It was just when I got up this morning (thinking about http://arclanguage.org/item?id=14253) that I realized what you were suggesting wasn't just a syntactic thing of putting objects before methods, but doing away with types altogether. You're saying the language shouldn't allow us to say isa x 'table, only responds_to x tableness (http://arclanguage.org/item?id=14258).

So yeah sometimes it takes people a while to digest the implications of a solution (http://arclanguage.org/item?id=14264), but sometimes they just haven't understood what you're saying :)

-----

1 point by Pauan 5109 days ago | link

Yeah, you must be thinking of rocketnia.

What I'm suggesting is to get rid of the current view of types. The `type` and `annotate` and `isa` functions would still be used, just for a different purpose: defining interfaces.

So in that sense, yes, you're right. But it's not to the extreme of rocketnia; I'm merely suggesting to change how they're used.

-----

1 point by rocketnia 5109 days ago | link

Hmm, I wonder if you're getting Pauan and me mixed up. >.> I'm against 'isa and 'type almost altogether, but I think Pauan considers 'type to be part of the high-level interface of a value:

  (type my-alist)   -> cons

-----

1 point by Pauan 5109 days ago | link

I consider it to be a part of an interface. If we're using duck typing (which I recommend), what happens when two functions define different behavior, but with the same names? All sorts of subtle bugs ensue. But by leveraging the type mechanism, we can group these differing behaviors into interfaces, avoiding the problem completely.

-----

1 point by akkartik 5107 days ago | link

"For the rest of this post, when I refer to "wart" I am referring to wart/Anarki, because Anarki does things in basically the same way."

I'm flattered :) but I think people are about as reluctant to try out my wart as they are to try out your methods.

I was tempted to more aggressively push wart in this forum, but I'm glad I was ruled by my doubts, because wart turns out to have more warts than arc :) Any new idea takes a while to be digested, to try out in various situations.

http://akkartik.name/blog/1451852

-----

1 point by Pauan 5107 days ago | link

I'll note that I like some of the stuff you put into wart, so my reluctance is three-fold:

1) downloading SBCL and wart, and fiddling with it to get it working

2) learning how wart differs from Arc, especially with regards to core functions

3) learning all the nifty tricks that wart has (like :case)

-----

1 point by akkartik 5107 days ago | link

These are great, valid reasons. I wasn't complaining, was I? How do they not apply to your suggestions? I'm still not sure what you would have us do. Say, "yes you're right"? Go off and rebuild arc like you suggest? Why would that then not suffer the problem of 'learning how it differs from old arc'?

-----

1 point by Pauan 5107 days ago | link

Think of a situation where Arc currently throws an error, like when you try to use `apply` on a macro. Current code cannot use apply on a macro, because of the error.

Now let's suppose we fix that, so apply works on macros... nobody needs to "learn" anything, other than "hey, apply works on macros now!" because the changes are completely backwards compatible.

Arc doesn't expose the internal details of it's data types, what I'm proposing is a way to expose them. Existing Arc programs can't be relying on the internal details because Arc programs can't access the internal details.

Thus, message passing is backwards compatible. The only time people would need to learn message passing is if they actually plan on creating a new data type.

So how does implementing message passing differ from wart? It doesn't change the way existing Arc programs are run. Can I take any Arc program, plop it in, and have it work the same in wart? Message passing is backwards compatible with existing Arc programs.

In other words... you could take an existing Arc program, plop it into the new version of Arc (that has message passing), and it should work just fine. Only the low-level details have changed, but the high-level ones remain the same.

-----

1 point by akkartik 5107 days ago | link

"nobody needs to "learn" anything"

They still need to 'learn how it differs from arc' (your words). Unless you convince PG to include methods :)

"Message passing is backwards compatible with existing Arc programs."

Awesome, so nothing's stopping you adding it to anarki.

-----

1 point by Pauan 5107 days ago | link

Yeah, I know, but I was referring more to, "what do I need to learn to get stuff to work in wart". I admit that my terminology was less precise than it probably should have been.

-----

1 point by akkartik 5107 days ago | link

wart's not backwards compatible, but I find its comprehensive unit tests to be useful examples.

-----

2 points by Pauan 5110 days ago | link

Okay, I know I wrote a lot, and I kinda rambled a bit, so here is the simplified step-by-step explanation of what I'm talking about. You can ignore the other posts if you like, but please do read this one.

First, what is our goal? I want to allow users to create new data types that behave like existing Arc datatypes... I want them to fit in seamlessly. What am I talking about?

Suppose you wish to create something that behaves almost identically to a table, but does something special when you call = on it:

  (= my-table.'foo 5) ; do something special
The naive approach of creating a new data type and extending sref is not extensible or scalable. I'll explain why soon.

---

But first, what is the problem? In order to explain this, I must first explain the concept of private/public data. Public data is exposed everywhere. An example would be a global variable or a global function.

Private data is data that is hidden from the outside world completely. In Arc, we can use closures to create private data:

  (let priv 5
    (def get-priv () (+ 2 priv)))

  priv       -> error
  (get-priv) -> 7
As you can see, by using a let expression, the variable priv is private: only the function `get-priv` can access it, but nobody else. `get-priv` has full control over who gets to access the variable priv, and how they access it.

Okay, so what? We use closures all the time, this isn't exactly new... hold that thought for a moment. What is an input port, in Arc? They are obviously implemented in Racket, and they contain private data, just like the variable priv.

This is important. Arc code cannot access the private data of the stdin port. Only the Racket functions readc, peekc, readb, etc. can access stdin's private data. This means two things:

1) The above-mentioned functions (readc, peekc, etc.) must be implemented in Racket, because only they can access stdin's private data.

2) This makes it much harder and more verbose to create "fake" input ports: a data type that is written in Arc, but designed to behave like an input port.

Let's consider an example. We wish to create two input ports, foo and bar. Here is the obvious way:

  (let stream '(#\f #\o #\o)
    (def foo ())
    
    (extend peekc (x) (is x foo)
      (car stream))
      
    (extend readc (x) (is x foo)
      (pop stream)))
      
      
  (let stream '(#\b #\a #\r)
    (def bar ())
    
    (extend peekc (x) (is x bar)
      (car stream))
      
    (extend readc (x) (is x bar)
      (pop stream)))
Now let's test it:

  (peekc foo) -> #\f
  (readc foo) -> #\f
  (readc foo) -> #\o
  (readc foo) -> #\o
  (readc foo) -> nil

  (peekc bar) -> #\b
  (readc bar) -> #\b
  (readc bar) -> #\a
  (readc bar) -> #\r
  (readc bar) -> nil
And it works perfectly. But you'll note a couple things... first off, the verbosity. Even worse, there's duplicate code! A macro won't help much, because your stream might want different behavior, so you still need to define it individually. And this was only an example with extending readc and peekc... let alone the other input functions! Not only is the above verbose, but it's inefficient! Every time you want to create a new stream, you need to re-extend all the relevant built-in functions...

Now, that was just streams. Tables are a whole different ballgame. In fact, in pgArc it's impossible to make a fake table (something that looks and behaves like a table, but isn't). This is because you can't properly extend eval and apply in Arc. But even if you could, it would be just as clunky as the stream example (or worse).

Why is this? It is for the exact same reason: tables contain private data that only Racket knows about. Arc cannot access that private data, nor can it create it. But before we solve that problem, let's take a little detour...

---

There are fundamentally two ways of calling a function. Let's consider a simple example:

  (+ 1 5)  -> 6
  (1 '+ 5) -> 6
Woah, what's going on here? Suppose we had a hypothetical Lisp language where numbers were functions. Rather than calling (+ 1 5) you would call the number, and tell it what behavior you want (in this case, addition). Thus:

  (+ 1 5) -> (1 '+ 5)
  (- 1 5) -> (1 '- 5)
  (* 1 5) -> (1 '* 5)
  ...
What gives? That's really weird! Why are we doing this? Hold that thought for a moment, please. Let's look at some more examples:

  (car my-list)       -> (my-list 'car)
  (len my-list)       -> (my-list 'len)
  (my-table 'foo)     -> (my-table 'get 'foo)
  (= my-table.'foo 5) -> (my-table 'set 'foo 5)
No, really, why are we doing this? Let's go back to our two streams from before. They both contain private data, specifically a list of characters. peekc and readc need access to that private data, so we extend them. The alternative would be to store the data as global variables, which is ridiculous.

But... wait... functions can accept arguments. Consider the two streams again. foo is a self-contained stream, that has access to private data. We want a way to access that private data, which is why we use the functions readc and peekc. The problem is, that the behavior of the private data varies depending on how foo is called.

If foo is called by readc, we want it to consume a character... but if foo is called by peekc, we don't want it to consume a character. But foo doesn't know what function called it! It doesn't know whether it was called by readc or peekc, so how does it know what to do? Simple, we give it a message, telling it what to do:

  (foo 'peek)
  (foo 'read)
The first argument to foo is a symbol, telling foo what to do. Now foo knows whether it should consume a character or not. But... why are we going into this whole "message passing" thing? To reduce verbosity and drastically increase extensibility. Let's assume that we adopted this message-passing system. You could now write the two streams like this:

  (let stream '(#\f #\o #\o)
    (= foo (annotate 'input (fn (x)
                              (case x
                                'peek (car stream)
                                'read (pop stream))))))
                                
  (let stream '(#\b #\a #\r)
    (= bar (annotate 'input (fn (x)
                              (case x
                                'peek (car stream)
                                'read (pop stream))))))
You'll note something. I didn't have to extend anything. I didn't have to extend readc, or peekc, or readb, or anything else. Everything just works. Imagine if every time you wanted to make a new stream, you had to extend several built-ins... ew. This way, you don't need to extend anything.

Okay, that's great for streams, but what about tables? They work too! My idea is that when `apply` sees something that is annotated with type 'table, it will do a behind-the-scenes conversion:

  ; before            -> after
  (my-table 'foo)     -> (my-table 'get 'foo)
  (= my-table.'foo 5) -> (my-table 'set 'foo 5)
  (keys my-table)     -> (my-table 'keys)
This is an internal conversion done by `apply`, so tables look and behave the same way they do right now. But what it allows us to do is create new table types, that integrate seamlessly into Arc:

  (annotate 'table (fn (x)
                     (case x
                       'keys ...
                       'get ...
                       'set ...)))
Basically, all the compound data types (conses, tables, input, output) would be represented as functions that accept a message parameter. This lets Arc code create custom data types that integrate seamlessly into the language, without needing to extend anything.

Note: I'm only proposing this for the data-types. Arc code in general can be written in this style, of course, but I doubt it will be. But given the evidence I've seen, I think this particular style is well-suited to creating data types in Arc.

-----

1 point by Pauan 5110 days ago | link

Okay, so... one thing I didn't emphasize in this post is that this lets you write core Arc functions in Arc itself... Like cons, table, instring. Stuff like that. So here's my first partial crack at it:

  (def attr (x . args)
    (apply (annotate 'fn x) args))

  (def cons (x y)
    (annotate 'cons
      (fn (m)
        (case m
          'len (+ (len y) 1)
          'car x
          'cdr y))))

  (def car (x)
    (attr x 'car))

  (def cdr (x)
    (attr x 'cdr))

  (def table ()
    (let alist '((nil nil))
      (annotate 'table
        (fn (m k v)
          (case m
            'keys (map [car _] alist)
            'get (alist k v)
            'set (sref alist k v))))))

  (def keys (x)
    (attr (coerce x 'table) 'keys))

  (def len (x)
    (case type.x
      'cons  (attr x 'len)
      'table (len (keys x))))

  (def vals (x)
    (map [x _] (keys x)))
Note: this is assuming py-arc, and also assuming the message-passing idea.

It's untested, and py-arc doesn't actually support my idea (yet), so I might not be testing it anytime soon. As such, there's probably bugs in it. But it should give you a taste of the power of this idea: the ability to define core datatypes in Arc itself, using only functions (no piggybacking!), but do so in a way that it's easy for user code to extend.

As a side note, what's with the attr function? Well, you see, when Arc sees (my-table 'foo) it expands it to (my-table 'get 'foo), so calling (my-table 'keys) doesn't work. But by annotating the table with type 'fn, you can get at the actual "hidden" attributes, like 'keys, 'get, and 'set. This means that (my-table 'keys) will never conflict with (my-tables 'get 'keys).

Also, I'll note that this is probably significantly slower than having these things defined in Racket, but that's okay. It's nice to know that Arc has the possibility to define them, even if they're defined in Racket (for the sake of performance).

-----

2 points by aw 5110 days ago | link

Yes, Arc should be hackable, so it's a good idea for people to be able to provide their own implementations in Arc for tables, streams, and so on.

-----

2 points by rocketnia 5108 days ago | link

Here we are talking about the issue of Arc code being able to introduce a new kind of value or a new operation to Arc while giving it seamless integration with existing operations (for values) or existing values (for operations), and it turns out our issue has a common name, the "expression problem." ^_^

Functional programming tends to emphasize making new operations on existing values, while object-oriented programming tends to emphasize making new kinds of values with support for existing operations. It's less common to have a language support both, and it naturally gets less and less common the more one demands other properties of the solution. Languages which already prioritize certain problems have more difficulty tackling the expression problem.

Well, when Philip Wadler coined the term "expression problem," he had certain other problems in mind that made it difficult: He was going for a way to statically check that all the necessary extensions existed, and he wanted to do it in a language with separate compilation of modules. These are hardly problems Arc's designed to solve, being such a dynamic language, but according to some people they're essential to what makes the expression problem a problem. http://lambda-the-ultimate.org/node/4136#comment-63009

As long as we don't make those demands in Arc, it's a non-problem indeed, and one easy solution is 'extend:

  ; Define a new type A with support for existing ops B and C.
  (= A-type (uniq))
  (def isa-A (x) (isa x A-type))
  (extend B (self . args) isa-A.self ...)
  (extend C (self . args) isa-A.self ...)
  
  ; Define a new op A with support for existing types B and C.
  (def A (self . args) (err "Can't A these args."))
  (extend A (self . args) isa-B.self ...)
  (extend A (self . args) isa-C.self ...)
In fact, 'extend reduces some more boilerplate in practice, since we aren't limited to 'isa predicates.

This 'extend kind of approach is widely known as "predicate dispatch," unsurprisingly. Unfortunately, Because the predicates can have Turing-complete dependence on the run time arguments, predicate dispatch about as difficult to statically reason about as it gets. This in turn makes it hard to make a module system that automatically determines a precedence order among extensions made in different libraries. http://lambda-the-ultimate.org/node/3700#comment-52728

Those are downsides that hardly have alernatives in a language that's mostly dynamic anyway, like our Arc-like languages. In fact, once I get back around to Blade, my extensible static language idea, I expect to still like predicate dispatch: Blade will probably have no (built-in) static type system, and I already design libraries so they extend only cases they introduce. If and when precedence is an issue, I can probably handle it very nicely using my self-organizing rule precedence system from Lathe.

Ohhh, predicate dispatch. Predicate dispatch, la la la~

ahem

The prototype system doesn't seem (to me) to actually solve the expression problem, instead erring on the side of being too object-oriented: It focuses on making new kinds of values with support for existing operations, but doesn't make it easy to make a new operation (message) that acts on existing kinds of values. (I don't think this is anything I haven't said before, but maybe it's cast in more common terms now. ^_^ )

Still, there's a good chance the prototype system solves problems aside from the expression problem. I just can't imagine what they are yet, and I don't know if I'd call them problems of extensibility.

-----

1 point by Pauan 5107 days ago | link

"Here we are talking about the issue of Arc code being able to introduce a new kind of value or a new operation to Arc [...]"

No. I am talking only about the issue of creating new values. Message passing is not designed to solve the problem of creating new operations. That job is already fulfilled by `extend`. Message passing solves the issue of creating new data types. Nothing more, nothing less.

---

I am well aware that extend can solve it... Message passing solves it while being significantly shorter and easier to use. Also, extending does not work well with modules...

If one module tries to create or modify an existing data type, and it uses `extend`, only that module will see the extensions. Thus you are forced to either jump through even more hoops, or extend __built-ins* which is bad because then safe modules can't properly create new data types...

Extend is at least twice as verbose [1] as message passing, it is more difficult to use, and it causes problems when you try to introduce a module system. Message passing has none of those issues...

---

I'm not talking about prototypes [2], I'm talking about message passing, which is different. It is possible to create prototypes using message passing, but message passing is on a lower level, in the same way that it's possible to create iterators (high level) using continuations (low level).

Message passing simply means that you (the caller) tell the function what you want it to do. This message would probably be a symbol like 'get or 'keys or whatever, but it doesn't have to be a symbol [3].

The only important thing to understand about message passing is that it inverts control: rather than a function returning a value, and then you operating on that value, you send an operator (the message) to the function, and then the function uses that operator to do something.

This is good for describing data types, which are self-contained. It is not so good for describing programs. That's why I'm proposing a mixture of message passing (at the lowest level) and functional (at higher levels).

---

"It focuses on making new kinds of values with support for existing operations, but doesn't make it easy to make a new operation (message) that acts on existing kinds of values."

You can use extend to create new operators, just as we do right now. But extend does not work well for creating new data types that use the existing operators. A combination of extend + message passing is ideal.

Extend is a beautiful thing, but when all you have is a hammer, everything looks like a nail. That doesn't mean a hammer is always the appropriate tool for the job. Use message passing when it makes sense, and use extend when it makes sense.

---

Also, it seems you still don't think that message passing is more extensible... what is extensibility? It can't be "the capability to do something" because any Turing-complete language can do that. It's how easy it is to do things, and how easy it is to make changes to your program.

If you try to make a new data type with extend, you end up jumping all over the place, extending several built-ins, and now good luck making it play nicely with modules and existing code.

With message passing, you just change one place and everything just works perfectly. Thus, they can both do the same thing, but message passing allows for changes and additions to data types in a way that requires you (the programmer) to do less work. Thus, I would call that more extensible.

---

* [1] http://arclanguage.org/item?id=14261

* [2] The original post in this thread does talk about prototypes, but that's because I was still figuring stuff out, and didn't realize that message passing was better. It is possible to create prototypes using message passing, so I'm going to ditch the prototype idea and focus on message passing. If we ever need/want prototypes, we can easily add them using message passing.

* [3] This is analogous to the way tables work right now... saying (my-table 'foo) retrieves the value of the 'foo key, and (my-table 'bar) retrieves the value of the 'bar key. What I'm proposing is a way to generalize that to all data types using a short and extensible system that is accessible to Arc.

-----

2 points by rocketnia 5107 days ago | link

(I know I'm probably falling way behind here. If I end up repeating a response you've already heard by now, sorry about that.)

---

"No. I am talking only about the issue of creating new values."

Okay then, we're getting somewhere. ^_^ Nevertheless, back when you said "The naive approach of creating a new data type and extending sref is not extensible or scalable" (emphasis yours), I don't think you actually suggested anything more extensible. Maybe we just have a difference in meaning here, or maybe the "scalable" part was crucial to what you meant, or maybe you've changed your mind now. Is it any of those things?

---

"Message passing solves the issue of creating new data types. Nothing more, nothing less."

Then your approach doesn't solve a problem I find important. In fact, in the examples you've given, it would make it a harder problem to solve, thanks to encapsulation. So not only would I not find the approach useful myself, but I would find libraries that did follow that approach less helpful.

Thus I continue to advocate 'extend, which solves the issue of creating new data types and solves even more problems I have (or expect to have).

Note that there are other problems message-passing might solve, notably parallelism. I'm not saying you can magically get parallelism in a few steps from where you're at, lol, but it's probably a natural way to begin approaching the expression problem in a system that already has a parallelism focus.

---

"If one module tries to create or modify an existing data type, and it uses `extend`, only that module will see the extensions."

Not necessarily. If modules are implemented the way I like (which is biased since I like 'extend), an imported variable could have the same mutable binding as the binding it has in its home module, in which case an 'extend extension would be visible in every part of the program that uses that module.

You could argue that it's not always a good idea for extensions to be visible everywhere, and I agree. It might not be a problem in my own code, since I almost always write libraries so that they only extend cases they introduce (so that no two libraries' cases overlap), but it is a problem in general, and I've been wrestling with recently. I believe there is actually a solution to this problem that lets me keep using predicate dispatch, but your style of message-passing doesn't inspire me.

---

"Extend is at least twice as verbose [(example)] as message passing"

Your example isn't fair to 'extend. You assume 'iso, 'fill-table, and 'vals would be derived functions in PyArc, but you don't give Wart the same advantage. You use the same (m k v) argument list for the 'keys, 'get, and 'set functions, when really different methods will tend to have different argument lists (at least in my mind).

Meanwhile, you overlook that the 'extend approach is extensible enough to support a message-passing system like yours inside it (which is what should really matter in the core, IMO). Plenty of times I design types that support only a fixed set of behaviors and fully encapsulate those behaviors; for me, they tend to be shaped as lists of functions (virtual method tables?), but from time to time I use a dispatch-on-a-symbol approach like yours. The way I make those types work is straightforward: I use 'extend and delegate to the functionality held within the value. There's a bit of boilerplate involved in this right now--not as much as I think you see, but some. (Once we've established a type, we can use it over and over.)

Since you also say "extend does not work well for creating new data types that use the existing operators" (emphasis yours), here's an illustration, extrapolating from my earlier example:

  ; Define a new type A with support for existing ops B and C and with
  ; nothing else special about it.
  
  (= A-type (uniq))
  
  (def isa-A (x)
    (isa x A-type))
  
  (def make-A (B-behavior C-behavior)
    (annotate A-type (list B-behavior C-behavior)))
  
  (extend B (self . args) isa-A.self
    (apply rep.self.0 self args))
  
  (extend C (self . args) isa-A.self
    (apply rep.self.1 self args))
  
  ; Later on:
  ;
  ; (make-A (fn (this <arguments to B>) ...)
  ;         (fn (this <arguments to C>) ...))
There are lots of little tweaks that could be made in this design: The type could be represented not by a list but by a table or a "message-passing" function, the 'this parameters could be left out, and so on. It might indeed be beneficial to decide on a standard format for these, if only to make them easier to define.

However, I currently don't have enough motivation for encapsulation to consider it a positive aspect of my designs; I try to see types like these as placeholders for more open-faced types, and I make them only in the most simplistic, irreducible cases where the boilerplate hardly matters. It'll be a while before I have a meaningful opinion on which way to go from here (and I welcome you to surpass me in the meantime).

---

"I'm not talking about prototypes, I'm talking about message-passing, which is different."

Oops, sorry about that. I was just using "prototypes" as a name for your general approach, and I forgot you drew a distinction. You've illustrated message-passing without prototypes now (the 'tap-dancing example), and I think that helps.

---

"Also, it seems you still don't think that message-passing is more extensible... what is extensibility?"

Oh, this topic again. :-p To me, "extensibilty" is the expression problem (or an Arc-centric variant of it I'm peddling), and at the beginning of your post you claimed that your message-passing only tries to tackle one half of that problem: extending the system with new kinds of value, not extending it with new operations.

-----

1 point by Pauan 5107 days ago | link

Alright, that's better. I feel like we're finally on (more or less) the same page, so we can actually discuss the idea, rather than arguing about what the idea even is.

---

"Maybe we just have a difference in meaning here, or maybe the "scalable" part was crucial to what you meant, or maybe you've changed your mind now. Is it any of those things?"

I think we just have a different view of extensibility.

---

"In fact, in the examples you've given, it would make it a harder problem to solve, thanks to encapsulation."

Um... extending is just as encapsulated as message passing is... I'm not sure what you were trying to say.

---

"Thus I continue to advocate 'extend, which solves the issue of creating new data types and solves even more problems I have (or expect to have)."

Please give a single example where extending is better than message passing, with regards to creating or modifying data types.

I'm not arguing that we get rid of extend. Extend is great, it gets to stay. I'm arguing that implementing the low-level data types using message passing would be less verbose and easier than using extend.

Even if we adopted message passing, extend would still have a large role to play. In fact, I envision extend being used to add/modify/remove messages:

  (def bad-function (m)
    (case m
      'get ...          ; bad-function has a bug in it!
      'set ...))
      
  (extend bad-function (m) (is m 'get) ; fix bad-function, hurray!
    ...)
This could play well with prototypes as well (assuming we used them)... for instance, imagine there was a table-proto* function that served as the base for all tables, and you wanted to add a new message:

  (extend table-proto* (m) (is m 'my-message)
    ...)
Voila, now all tables support your new message.

---

"I believe there is actually a solution to this problem that lets me keep using predicate dispatch, but your style of message-passing doesn't inspire me."

I'd be interested in hearing that.

---

"Your example isn't fair to 'extend. You assume 'iso, 'fill-table, and 'vals would be derived functions in PyArc, [...]"

I'll note that with message passing, it can use a type of 'table, and thus work seamlessly with code that uses isa, whereas in wart you need to create a new type, and then extend code (like iso and filltable) to recognize your new type.

But... let's assume that there was a built-in called `table?` that tested for "tableness", and that the other built-ins used that rather than isa:

  (def clone(x)                                   ;  (def clone(p)
    (annotate 'heir (obj val (table) parent x)))  ;    (let t (table)
                                                  ;      (annotate 'table (fn (m k v)
  (defcall heir h                                 ;                         (case m
    [or rep.h!val._                               ;                           'keys (join (keys p) (keys t))
        rep.h!parent._])                          ;                           'get (or (t k v) (p k v))
                                                  ;                           'set (sref t k v))))))
  (defset heir(h key value)                       ;
    (= rep.h!val.key value))                      ;
                                                  ;
  (def table?(x) :case (isa x 'heir) t)           ;
                                                  ;
  (def keys(x) :case (isa x 'heir)                ;
    (join (keys rep.x!val)                        ;
          (keys rep.x!parent)))                   ;
Not as huge as before, I admit, but still a 2x difference. Does it matter? Perhaps. Perhaps not. But with message passing, you get correct behavior with modules out of the box... you already mentioned that extend will cause problems with modules unless you either A) design the module system so it's no longer doing it's job right, in which case why have the module system at all? or B) jump through hoops.

I'll also note that I dislike how this requires you to create a new type every time you want to extend something... thus even if we moved to a `table?` approach, we'd still be keeping isa and type around, solely to support extensions... So if they're in the language, why not co-opt them for a more useful purpose (like message passing)? Or figure out a better way to create data types that doesn't even use isa?

---

"You use the same (m k v) argument list for the 'keys, 'get, and 'set functions, when really different methods will tend to have different argument lists (at least in my mind)."

You're right: that's done for simplicity, but that is easily solved. For instance, consider a macro like this:

  (object table
    keys () ...
    get (k v) ...
    set (k v) ...)

  (object cons
    call args ...
    car () ...
    cdr () ...)
The system doesn't care how the functions are implemented, as long as they follow the interface. You could use nested functions, currying, inheritance, whatever floats your boat.

---

"Meanwhile, you overlook that the 'extend approach is extensible enough to support a message-passing system like yours inside it (which is what should really matter in the core, IMO)."

Uh huh. And how are you going to get that to play nicely with the built in table type? Or the built in cons type? It's much easier to design things so the built-in types already use message passing.

---

"[...] here's an illustration, extrapolating from my earlier example: [...]"

Now do that with tables. Or lists. Or numbers. Harder, ain't it?

---

"However, I currently don't have enough motivation for encapsulation to consider it a positive aspect of my designs;"

Once again... how is message passing "more encapsulated"?

---

"and at the beginning of your post you claimed that your message-passing only tries to tackle one half of that problem: extending the system with new kinds of value, not extending it with new operations."

Yes, because extend is already pretty darn good at covering the other half. So by combining the two, you can get the best of both worlds.

---

My point is not that "extend can't do that" because extend is definitely powerful enough to do it. OOP is also powerful enough to describe any program. That doesn't mean every program should be written in OOP style.

My point is that message passing takes the creation and modification of types, and makes it easier/shorter. That's all. If you prefer to use the more-verbose extend, then go ahead.

-----

1 point by rocketnia 5106 days ago | link

"Um... extending is just as encapsulated as message passing is..."

If we define a whole bundle of behaviors in one closure, or in a nearby set of closures, then the value can naturally encapsulate variables in the lexical scope. Essentially, the value itself is a closure but with a different interface.

If instead we extend the behaviors at the top level, outside the construction context, then we have to design the value's representation in such a way that those extensions can inspect it. This encourages a lesser degree of encapsulation, which in turn gives future programmers more freedom in using the code. On the other hand, sometimes there are just no other operations worth adding, and this just makes the code more scattered and harder to reason about.

Oh, I overlooked your post at http://arclanguage.org/item?id=14307. The encapsulation aspect is exactly the part that allows for certain optimizations.

---

"Please give a single example where extending is better than message passing, with regards to creating or modifying data types."

Well, I gave an example before the part you replied to, and I gave a more in-depth version in the same post as you replied to. I dunno, I think they're "better," but keep in mind that we're solving different problems: in my case, the expression problem, and in your case, convenient construction of custom data types. Based on the motivations you've expressed, I believe my problem has a strictly broader scope than yours, but by the same token, I do believe a narrower scope can allow for a more elegant special-case solution.

---

"I'm arguing that implementing the low-level data types using message passing would be less verbose and easier than using extend."

That actually sounds like a pretty nifty way to go, not because of verbosity and ease of implementation ('cause I have a pretty high tolerance of complexity in the core) but because it may eliminate some primitive types.

There'd be trouble eliminating all of them though: If you pass 'car a message, then when that message is parsed from the argument list, there may be an infinite loop.

---

"Voila, now all tables support your new message."

I think I specifically said it would be hard to add behaviors low on the prototype hierarchy. :) Specifically, it would be hard to add behaviors to things whose behaviors are determined by their constructors, rather than their prototypes; I'll get to this topic at ( * ).

---

"I'd be interested in hearing [how you plan to reconcile predicate dispatch and modules]."

I think I'd rather start a new thread for that. I'll try to get around to it soon.

---

"I'll note that with message passing, it can use a type of 'table, and thus work seamlessly with code that uses isa, whereas in wart you need to create a new type, and then extend code (like iso and filltable) to recognize your new type."

I haven't heard you say what happens when something needs to act seamlessly both as a 'table and as an 'input (my example at http://arclanguage.org/item?id=14258). You did have an example of alists being tables and cons cells, but I don't see how it would work, since you said (type my-alist) was 'cons. (http://arclanguage.org/item?id=14261)

---

"But with message passing, you get correct behavior with modules out of the box..."

( * ) Actually, I don't see the connection between message-passing and modules at all. That's probably your point, but if I did end up wanting to add a new operation on heirs (a desire you don't claim to be prepared for), the most straightforward way would be to redefine 'clone completely, in a way that intentionally defies module separation so that all the program's heirs use the new definition.

---

"[...]even if we moved to a `table?` approach, we'd still be keeping isa and type around, solely to support extensions[...]"

In http://arclanguage.org/item?id=14265 I mentioned that we could get rid of 'type and 'rep completely by using a (deftype wrap-foo unwrap-foo isa-foo) form, but that I'd prefer to let people have access to 'type and 'rep too, if only for emergencies.

---

"For instance, consider a macro like this"

How about failcall and a macro like this:

  ; This makes a new type, extends 'keys, 'ref, and 'sref with rules
  ; that check that their first argument is that type and delegate to
  ; its inner behaviors, and defines a function 'fn-make-foo and a macro
  ; 'make-foo for constructing values of the new type.
  ;
  (def-extension-type fn-make-foo make-foo
    keys ref sref)
  
  (make-foo
    ()                         ...
    (key (o default))          ...
    (new-val key (o default))  ...)
With failcall, there'd be no need to extend things like 'supports-keys too, and 'isa would be mostly redundant.

Note the fact that you have to say 'def-extension-type this way, but that each construction of the type is slightly shorter since it doesn't have to reiterate which "methods" it implements.

Anyway, I expect one could actually duplicate this exact syntax in a message-passing system that has failcall. So you probably still have a potential brevity advantage, thanks to the fact that your problem has a narrower scope than mine, but it's pretty close.

---

"Uh huh . And how are you going to get that to play nicely with the built in table type?"

You're right, if message-passing is merely a library, there's a good chance it won't be seamless with existing things in the language.

---

"Now do that with tables. Or lists. Or numbers. Harder, ain't it?"

I address tables and lists here: http://arclanguage.org/item?id=14337

Numbers, on the other hand, are more interesting. ^_^ It's super-tempting to make them efficient with the help of algorithms that manipulate the binary representation of the number. IMO, that means numbers should ideally be represented as binary strings.

In practice though, who makes custom types of numbers? Methinks there are two kinds of custom number type: small bundles of abstract properties, which 'extend is good for, and large bundles of optimized algorithms, which neither of these approaches is especially good for. :-p

-----

1 point by Pauan 5106 days ago | link

"If instead we extend the behaviors at the top level, outside the construction context, then we have to design the value's representation in such a way that those extensions can inspect it."

Yeah, that's what message passing does... gives a way to get at the internal details of the function. The alternative is to use a closure + extend. Both systems use closures to hide internal data, they just have different interfaces for "getting at" the internal data.

---

"Based on the motivations you've expressed, I believe my problem has a strictly broader scope than yours, but by the same token, I do believe a narrower scope can allow for a more elegant special-case solution."

Yeah, you're right. A good solution to the expression problem would be nice, but in the meantime, how about sprinkling a little sugar on the current difficult areas?

---

"There'd be trouble eliminating all of them though: If you pass 'car a message, then when that message is parsed from the argument list, there may be an infinite loop."

How so? If cons is defined as follows:

  (def cons (x y)
    (object cons
      car () x
      cdr () y))
And then `car` and `cdr` are defined as follows:

  (def car (x)
    (attr (coerce x 'cons) 'car))

  (def cdr (x)
    (attr (coerce x 'cons) 'cdr))
Then where's the problem?

---

"I think I specifically said it would be hard to add behaviors low on the prototype hierarchy. :) Specifically, it would be hard to add behaviors to things whose behaviors are determined by their constructors, rather than their prototypes;"

But isn't a prototype a constructor? The whole point of prototypes are that they remove the distinction between class/instance, so in a sense, every prototype is both an instance and a class at the same time.

---

"I haven't heard you say what happens when something needs to act seamlessly both as a 'table and as an 'input"

I was actually mulling that over last night... I see two (compatible) ways to do that: duck typing and allowing values to have multiple types.

For instance, let's say you wanted to make a table that also happens to be an iterator... tables have a 'keys, 'call, and 'set message. Iterators have a 'next message. So you would just combine them together:

  (object table
    keys ()    ...
    call (k d) ...
    set  (k v) ...
    next ()    ...)
If code just uses (attr foo 'next) then the above would work, but it fails if the code expects something of type 'iterator. So then comes in the other idea... allowing values to have multiple types at once. If we think of types as being interfaces, this is equivalent to saying, "I support this interface, oh, and that interface too."

  (object (iterator table)
    keys ()    ...
    call (k d) ...
    set  (k v) ...
    next ()    ...)
Neat. Now code that expects a table works, and code that expects an iterator works too. This is nice because it allows for data to use the same message name, without causing conflicts. Imagine if you created a new data type that uses the 'next message, but it has different behavior from an iterator. If you pass that data to a function that expects an iterator, it would probably fail silently.

But by using types-as-interfaces, this lets you give context to the messages. So an iterator that has a 'next message is different from something else that has a 'next message. If you really don't care about the type, you can just use `attr`.

---

"Actually, I don't see the connection between message-passing and modules at all. That's probably your point, but if I did end up wanting to add a new operation on heirs (a desire you don't claim to be prepared for), the most straightforward way would be to redefine 'clone completely, in a way that intentionally defies module separation so that all the program's heirs use the new definition."

The point is that you want one module to be able to create a new data type, and then let other modules use it. Because isa uses a symbol, and symbols are interned across modules, (isa foo 'table) would work just fine. This is not true with things like `table?` unless you jump through some hoops or have every module extend __built-ins*

As for adding a new operation on 'heir... it's possible with message passing, but I don't think it would be particularly superior to extend. In fact, it may just be easier to define that new behavior using extend, since message passing is more concerned with data and interfaces than behavior.

---

"Anyway, I expect one could actually duplicate this exact syntax in a message-passing system that has failcall. So you probably still have a potential brevity advantage, thanks to the fact that your problem has a narrower scope than mine, but it's pretty close."

Or to put it another way, rulebooks could be implemented as a library on top of message passing. Also, another concern... efficiency. When you extend a function, it has some overhead, yes? Message passing shouldn't have that overhead, so if you're creating many rules, that could make a difference.

-----

1 point by rocketnia 5105 days ago | link

"Yeah, that's what message passing does... gives a way to get at the internal details of the function."

By the time someone adds all the messages needed to extract the private data out of the type (which they can't do without modifying the library), it'll

---

"A good solution to the expression problem would be nice, but in the meantime, how about sprinkling a little sugar on the current difficult areas?"

Not to belittle, but it doesn't strike me as difficult. I'd rather sprinkle sugar on 'extend if I need to, using something like my 'def-extension-type example, but I haven't needed to yet.

---

"How so? If cons is defined as follows[...]"

If 'car is implemented as a function and you call it and 'car tries to get its argument using 'car, you're in trouble. There's a similar loop if you're sending the message 'car rather than calling a function 'car, but removing the built-in implementation of 'car for built-in cons cells is the problem here, not message-passing.

---

"But isn't a prototype a constructor?"

In your example at http://arclanguage.org/item?id=14331, the (annotate 'table (fn ...)) expression is an example of constructing an instance, and you specify instance-specific behavior in that expression, and they get to refer to some encapsulated data. It's hard, if not impossible, for someone using 'clone as a library to add behaviors to that instance which have access to the same data, whether we used prototypes or not.

---

"So then comes in the other idea... allowing values to have multiple types at once."

Would 'type return a list of types? I see no problem with the way this is going, but it feels more complicated now.

---

"This is nice because it allows for data to use the same message name, without causing conflicts."

Actually, I bet you'd get a conflict when a value needed to implement two interfaces that had completely different meanings for 'next. Whether you consider that important is up to you; it's a common gotcha in OO languages with inheritance. I consider it to be a namespacing issue, and that encourages me to use the global namespace somehow. I'd put 'next in the global namespace as an extensible function, but it could easily be a global variable holding a gensym message name instead.

---

"The point is that you want one module to be able to create a new data type, and then let other modules use it."

Yep, I understand that's what you want, and I understand that's all you expect your technique to tackle.

To phrase the other half of the expression problem that in those terms: I want one module to be able to create a new data type, and then let other modules and application code interpret that value according to interfaces of their own devising.

"Because isa uses a symbol, and symbols are interned across modules, (isa foo 'table) would work just fine. This is not true with things like `table?` unless you jump through some hoops[...]"

If two modules use different notions of table, then a single value like 'table can't uniquely identify them both. There's a name clash.

If they use the same notion, then they should both depend on whatever module introduces that concept anyway (probably the core), and that module can provide them access (and extension access) to functions like 'table? and 'keys.

---

"Or to put it another way, rulebooks could be implemented as a library on top of message passing."

Well, the kind of message-passing system you're talking about makes encapsulation very easy. The kind of 'extend system I'm talking about only thrives on unencapsulated types. They have an adversarious relationship right now.

That said, if you make your message receivers less encapsulated, rulebooks might indeed work as a library. ^_^

Failcall not so much, though. I managed to get failcall mostly working as an Arc library (in Lathe's arc/failcall.arc), but any preexisting code that uses 'apply is a potential abstraction leak. Failcall pretty much needs to be in the core to be seamless.

---

"When you extend a function, it has some overhead, yes?"

Actually, it can, but I expect almost all programs to have a small constant number of rules in each rulebook, and I expect (and encourage) constant-time failure conditions so one rule doesn't encumber the others. So I expect most rulebooks to have constant-time overhead, and that's the same complexity I expect of message-passing.

-----

1 point by Pauan 5105 days ago | link

"If 'car is implemented as a function and you call it and 'car tries to get its argument using 'car, you're in trouble. There's a similar loop if you're sending the message 'car rather than calling a function 'car, but removing the built-in implementation of 'car for built-in cons cells is the problem here, not message-passing."

It's not using the function car to get the message... it's using the symbol 'car. An infinite loop should only happen in the following situation (or similar):

  (= foo (object cons
           car () (car foo)))
In other words... a recursive loop. But that's the fault of the designer of the code, no different than any other recursive loop. This should work fine:

  (= foo (cons 1 2))

  (= bar (object cons
           car () (car foo)))

  (car bar) -> 1
But... let's assume you were using `car` as an example of a situation where something could potentially go wrong... in which case, I'm still not sure what the problem is. You get the internal details by using `attr`, which is a very simple function that shouldn't cause infinite loops:

  (def attr (x . args)
    (apply (coerce x 'fn) args))

  (attr foo 'car)
  (attr foo 'cdr)
---

"In your example at http://arclanguage.org/item?id=14331, the (annotate 'table (fn ...)) expression is an example of constructing an instance"

Hm... solvable, but a lot trickier. That's a good point.

---

"Would 'type return a list of types? I see no problem with the way this is going, but it feels more complicated now."

That would be the simple way to do it, yeah, which is what I was planning on doing with py-arc. There are other ways, but I doubt they'd be any simpler. Then isa would be defined like so:

  (def isa (x t)
    (in (type x) t))
---

"Actually, I bet you'd get a conflict when a value needed to implement two interfaces that had completely different meanings for 'next."

That's where coerce comes into play. Let's say we had two interfaces, foo and bar. They both have the 'something message, but the two interfaces give it different meanings. You could then use coercion to choose which meaning you want:

  (attr (coerce x 'foo) 'something) ; use foo's meaning
  (attr (coerce x 'bar) 'something) ; use bar's meaning
As for defining the actual coerce rules... that's clunky, so I'll need to think about that.

---

"If two modules use different notions of table, then a single value like 'table can't uniquely identify them both. There's a name clash."

Right, that's why you would use a different type... each type would map to a single interface. Two interfaces == two types.

---

"Well, the kind of message-passing system you're talking about makes encapsulation very easy. The kind of 'extend system I'm talking about only thrives on unencapsulated types. If you make your message receivers less encapsulated, rulebooks might indeed work as a library. ^_^"

I'm really not understanding you when you say "encapsulation", which seems to be very different from the way I think of "encapsulation". Please explain how extend is less encapsulated, and how message passing is too encapsulated.

---

"Actually, it can, but I expect almost all programs to have a small constant number of rules in each rulebook, and I expect (and encourage) constant-time failure conditions so one rule doesn't encumber the others. So I expect most rulebooks to have constant-time overhead, and that's the same complexity I expect of message-passing."

If you wish to create a new type that behaves like tables... you need to extend at least keys, sref, and ref, correct? Now if you wish to have 5 different table types, then that means that those functions need to be extended 5 times. Message passing should always be O(1) but extend is O(n) where n is the number of times the function has been extended.

Probably not a big deal in ordinary programs, but it might matter in a large program that uses libraries and has many rules. Obviously you shouldn't choose message passing solely because it's more efficient, but it is a (minor) benefit.

-----

2 points by rocketnia 5105 days ago | link

"It's not using the function car to get the message... it's using the symbol 'car."

Did you stop reading halfway through what you quoted? "There's a similar loop if you're sending the message 'car rather than calling a function 'car," and here's the loop:

1. The cons pair (car . nil) receives the message 'car with no arguments.

2. The message-passing function implementing the cons pair (car . nil) receives the argument list '(car).

3. The message-passing function implementing foo needs to determine the first element of its argument list to see what message it should handle. It sends that argument list the message 'car with no arguments.

4. Go to 1.

---

That's where coerce comes into play.

I don't see it. I don't think something really counts as two types unless you can use that value as either type right away. The way you're using 'coerce, every method pass will need to have an accompanying 'coerce just to be sure, and a few method-passing functions will need to check the current type as well as the method name. At that point you might as well just combine the type name and the method name into one symbol and go back to square one.

---

"Right, that's why you would use a different type..."

Two modules can be developed by two people who don't realize they're using the "same" type. You might as well tackle the issue of a module system by saying "that's why you would use a different global variable."

---

"I'm really not understanding you when you say "encapsulation", which seems to be very different from the way I think of "encapsulation". Please explain how extend is less encapsulated, and how message passing is too encapsulated."

This again.... Okay, here's an explanation by example. ^_^

  (def counter ()
    (let count 0
      (fn ()
        ++.count)))
If we call (counter), we get a value, and we can tell very little by inspecting this value.

  arc> (= foo (coun­ter))
  #<procedure: counter>
  arc> type.foo
  fn
  arc> rep.foo
  #<procedure: counter>
  arc> (is foo rep.f­oo)
  t
Because we constructed it, we know it will act as a counter, but we can't pass it to other parts of the program and expect them to figure that out, since they only have access to 'type and 'rep (unless they're super-sneaky with 'write or Racket pointer operations).

This value is encapsulated. Specifically, we can think of its implementation as encapsulating the mutable variable 'count by storing it in a lexical closure hidden within the 'fn value. This hypothetical implementation doesn't matter to what we can do with the value, but it does indicate how to reimplement it with less encapsulation:

  (def counter ()
    (annotate 'counter list.0))
  
  (defcall counter (self)
    (++ rep.self.0))
Now we can tell a lot more about the value dynamically:

  arc> (= foo (coun­ter))
  #(tagged counter (0 . nil))
  arc> type.foo
  counter
  arc> rep.foo
  (0)
Future code can observe and change its internal state in ways we didn't imagine. For instance, it might consider things of type 'counter to be iterators, and it might define a utility 'peek that acts on certain iterators, including counters.

  (extend peek (str) (isa str 'counter)
    (inc rep.str.0))
This is future code, such as an application that uses our library. Its developers know what they need better than we do, and they (should) know the representations of our types might change in future versions of the library, so this is a deliberate choice. They could also do the "right thing" and patch the library or contact the library author, but this way they don't have to stop their work, and they can even provide the author with working demos of why such implementation details should be exposed.

So, back to message-passing. Message-passing constructions like (annotate 'table (fn ...)) capture their lexical surroundings just like regular closures do, making it natural to use them for implementations similar to the first version of 'counter.

---

"Now if you wish to have 5 different table types, then that means that those functions need to be extended 5 times."

If you implement those five types using (annotate 'table (fn ...)), then you have to implement three methods per type, for the same overall result. I believe this is what akkartik meant by "moving parens around."

If there's actually duplicated code in those fifteen cases, we both have strategies for that: You can remove some of it using prototype inheritance, and I can remove some of it by way of rulebook "inheritance" using intermediate rulebooks (as explained in http://arclanguage.org/item?id=14330). Of course, we can both use plain old functions and macros too.

-----

1 point by Pauan 5105 days ago | link

"Did you stop reading halfway through what you quoted?"

Nope! Just not understanding why it's a problem, since I don't see how that would cause an infinite loop.

---

"1. The cons pair (car . nil) receives the message 'car with no arguments.

2. The message-passing function implementing the cons pair (car . nil) receives the argument list '(car).

3. The message-passing function implementing foo needs to determine the first element of its argument list to see what message it should handle. It sends that argument list the message 'car with no arguments."

You mean like this?

  ; argument list '(car)

  (fn (m)
    (case m
      'car 'car
      'cdr nil))


  ; cons pair (car . nil)

  (fn args
    (case (attr args 'car)
      'car car
      'cdr nil))
---

"At that point you might as well just combine the type name and the method name into one symbol and go back to square one."

Yeah, I admit it's not that great at handling the situation where a value wants to implement two different interfaces, but the interfaces have the same message name.

---

"This again.... Okay, here's an explanation by example. ^_^"

Alright, I'm starting to get it. Rather than representing types as a function that maps between keys/behaviors, you're representing it as a simple flat list. But that's quite possible to do with message passing as well:

  (def counter ()
    (let methods (obj call  (fn () (++ methods.'value))
                      value 0)
                      
      (annotate 'counter methods)))


  (= foo (coun­ter))
  (type foo) -> counter
  (rep foo)  -> #hash((call ...) (value 0))
I admit that's a tiny bit more verbose than your simple example, but it's nice because rather than exposing the type as an ordered sequence... we expose it as a table. Which means that rather than saying, "the 0th element is the value" we instead say, "the key 'value is the value", which makes it more robust if say... we decide to change it so the value is the 2nd element.

It also serves as self-documenting code. It also allows the built-in functions like `keys` to dispatch on the 'keys method... which doesn't work so well with a list. It also means we can easily write a macro to do the wrapping for us:

  (def counter ()
    (object counter
      call () (++ self.'value)
      value 0))
And if types are represented as tables... then you can use rep + sref to actually modify the internal representation... without needing to use extend at all. Unfortunately, if we went that route, then my simple examples like...

  (fn (m)
    (case m
      'keys ...
      'set  ...))
...wouldn't really work. But that's okay. Implementing types as tables seems like it'd be more flexible and overall better. The only downside is that now they're carrying around an internal table... but how is that worse than carrying around an internal list?

So... would you consider the above (using a table to implement message passing) to be "more encapsulated" than your idea?

-----

1 point by rocketnia 5105 days ago | link

"You mean like this?"

I don't know what you're getting at, but the cons cell (car . nil) and the argument list '(car) are supposed to be 'iso in my example; otherwise, why would I say it's a loop? I'm just referring to them in different ways to emphasize their different roles in the story.

I'm sure you're confused about something else in my example, but I don't know what it is. Here's a quick take at summarizing the issue anyway: Inspecting a list by sending it messages is incompatible with decoding a message by inspecting an argument list.

---

"The only downside is that now they're carrying around an internal table... but how is that worse than carrying around an internal list?"

Here's an old post of mine where I go over the pros and cons: http://arclanguage.org/item?id=12076. (Since then, I've learned to stop worrying about 'rep and 'annotate and love predicate dispatch. I put inheritance-aware multimethods in Lathe after that too, but I found they were less useful than I expected for what I was doing.)

With a predicate dispatch approach, it hardly matters whether the internal representation is a table or a list or even a function which dispatches to a complete getter/setter protocol, since I usually only interact with it in a single clump of extensions. Nevertheless, I do find this feature of list-based types pretty helpful:

  (let (foo bar baz) rep.x
    ...)
---

"So... would you consider the above (using a table to implement message passing) to be "more encapsulated" than your idea?"

Actually, I consider that example to be less encapsulated. Future code can come in and replace the 'call method and add new fields. That would be just fine with me.

...Do note that tables would have to be built into the core that way though. ;) It's to avoid loops again: Getting a table key by sending a message is incompatible with sending a message by getting a table key.

-----

1 point by Pauan 5105 days ago | link

"Here's a quick take at summarizing the issue anyway: Inspecting a list by sending it messages is incompatible with decoding a message by inspecting an argument list."

Okay... let me see if I understand you. What you're saying is, that if a function is given an argument list (like with rest args), then it can't extract the data out of that list, by sending messages? Why not? A cons is represented as the following form:

  (obj type 'cons
       rep ...
       car ...
       cdr ...)
You retrieve those by using attr [1]:

  (attr foo 'car)
  (attr foo 'cdr)
  ...
Now, the argument list is represented as a series of conses, and each individual cons is a table. Incidentally, this is similar to how I'm implementing conses in Python, except I'm using classes rather than tables. So suppose you have the following:

  (= args '(car))
Which is the same thing as this:

  (= args (obj type 'cons
               car 'car
               cdr nil))
               
  (= args.'rep args)
You can extract the car with (attr args 'car), and the cdr with (attr args 'cdr). All conses would be represented this way, including built-in types. So in the following function...

  (fn args ...)
...the variable args would also be represented that way, so calling (attr args 'car) inside the function should work fine. You did mention a recursive loop earlier:

"If 'car is implemented as a function and you call it and 'car tries to get its argument using 'car, you're in trouble."

Which would be represented as follows...

  (def car args
    (car args)
    ...)
...which is obviously a naive infinite loop. But that is solved by using attr:

  (def car args
    (attr args 'car)
    ...)
So I fail to see what that (infinite loop) has to do with message passing. What have I misunderstood?

---

"Nevertheless, I do find this feature of list-based types pretty helpful:"

Hm... that does seem pretty useful/neat/nifty! Not sure if it offsets the advantages of a table, though.

---

"Here's an old post of mine where I go over the pros and cons: http://arclanguage.org/item?id=12076. "

Neat! Great summary. For the record, I'm leaning toward the second form, with type being an attribute on the table. Then `type` can just be a convenience function (or defined for backwards compatibility).

"no way for different types' x fields to be accessed using the same code without doing something like standardizing the field order"

Yeah, that's kinda the killer (for me) for list-based types. Using lists seems so... short and light-weight, though.

Also, I found this amusing: "Hmm, that gives me an idea. Maybe what I miss most of all is the ability to tag a new datatype so that an existing utility can understand it"

Which is precisely what message passing does. But as you say, you seem to have moved onto other... different directions.

---

"...Do note that tables would have to be built into the core that way though. ;) It's to avoid loops again: Getting a table key by sending a message is incompatible with sending a message by getting a table key."

No worse than having fns in the core. :P Especially if I figure out a way to define fn using message passing. Then... tables would be primitives... and fn would be layered on top...?

---

* [1]: Why `attr`? Why not just use (foo 'car)? Because when a table has a 'call key, that will be used when calling it in functional position. In this case, it didn't have a 'call key, but in actual Arc code, conses would have a 'call key, which means that calling (foo 'car) wouldn't actually retrieve the 'car key... it would instead be equivalent to ((foo 'call) 'car). Thus, `attr` is necessary to distinguish between calling a table, and retrieving a key of the table.

-----

1 point by rocketnia 5105 days ago | link

"What you're saying is, that if a function is given an argument list (like with rest args), then it can't extract the data out of that list, by sending messages? Why not?"

Because that just sends arguments to another function, and that function tries to extract them, so it sends arguments to another function, and so on. At no point does any function succeed in determining what its first argument is.

"Why `attr`?"

FWIW, I believe I understand what 'attr is for. ^_^ It passes direct messages to a value without passing a "call" message. It just doesn't solve the paradox.

"But that is solved by using attr"

But how does the 'attr function extract its first argument?

---

"Not sure if it offsets the advantages of a table, though."

Me neither. ^_^ To even the field a bit:

Clojure has its own kinda wordy form of table destructuring. An Arc macro can do better than that when it's only trying to do non-recursive destructuring.

  (w/keys a b c rep.x  ; gets the 'a, 'b, and 'c keys
    ...)
I'm not sure if something this simple would help all the time, since I use alternate names for things pretty often (e.g. when dealing with two things of the same type), but it covers the more common cases.

Personally, I'd probably go for 'def-extension-type and blissfully ignore the internal representation altogether (except when I'm reading things at the REPL, hm...).

---

"['the ability to tag a new datatype so that an existing utility can understand it'] is precisely what message passing does."

It's what 'extend does too. The tagging is just done in a meta way. The way I see things now, the meaning of a value is only the op-understands-value relationships it supports (duck typing?), so extending an op is a valid way to change (tag) the meaning of a value.

---

"Especially if I figure out a way to define fn using message passing. Then... tables would be primitives... and fn would be layered on top...?"

I think you gotta encode behavior in some type or other. I see two ways to avoid functions: You could have tables hold s-expressions and interpret them, or you could have tables encode the behavior themselves. The latter would make tables objects, and they'd be indistinguishable (on a high level) from message-passing closures.

-----

1 point by Pauan 5104 days ago | link

"Because that just sends arguments to another function, and that function tries to extract them,"

Hm... well, I still don't get what you're saying, but I'll try implementing it and see if it's a problem or not.

---

"But how does the 'attr function extract its first argument?"

In a worst case scenario, it can be a built-in and use Python magic to avoid the paradox. Having one built-in (attr) is still waaaay better than having a ton of other built-ins (car, cons, cdr, maptable, etc.)

---

"Clojure has its own kinda wordy form of table destructuring. An Arc macro can do better than that when it's only trying to do non-recursive destructuring."

JavaScript also has a form of object destructuring [1]:

  {a, b} = {a: "foo", b: "bar"}

  // or was it like this...?

  [a, b] = {a: "foo", b: "bar"}
I think something that simple would help 99% of the time, so that's okay with me.

---

"Personally, I'd probably go for 'def-extension-type and blissfully ignore the internal representation altogether (except when I'm reading things at the REPL, hm...)."

That's all well and good for high-level coding, but somebody's going to have to determine what the low-level data type will be. I'm going with a table, in py-arc, thanks to our discussions.

---

"It's what 'extend does too. The tagging is just done in a meta way. The way I see things now, the meaning of a value is only the op-understands-value relationships it supports (duck typing?), so extending an op is a valid way to change (tag) the meaning of a value."

Ah, but the key difference is: where is the operator stored? Right now we're only using global functions as operators, but we could also store them as functions in a table. This is better in some situations... and extend is better in other situations.

So yeah, I'm leaning toward duck typing as well, and actually getting rid of the `type` function completely. My idea is that the "type" of something is based solely on it's prototype: `table` would be both a prototype and a constructor of tables (at the same time). You can then create new prototypes that inherit from `table`, and voila, they're tables.

This is sorta-kinda like how JavaScript does it, except JavaScript actually screwed up prototypes, so we can do a lot better in Arc. For starters, in JavaScript you can access the prototype, but you can't change it. That's definitely gonna change: in Arc, you should be able to dynamically change the prototype.

Something's a table, but want it to be a cons? Just change it's 'prototype attribute and bam, it now inherits from cons. Anyways, then the `isa` function would test for inheritance, rather than type. So rather than saying (isa foo 'table) you'd say (isa foo table), which would mean, "does foo inherit from table?" [2].

As far as I can see, the only downside is that isa would now be O(n) rather than O(1), but the prototype chain should be small enough that it won't make a big difference in ordinary programs.

---

"I think you gotta encode behavior in some type or other. I see two ways to avoid functions: You could have tables hold s-expressions and interpret them, or you could have tables encode the behavior themselves. The latter would make tables objects, and they'd be indistinguishable (on a high level) from message-passing closures."

I'm kinda leaning toward the second approach. Even in a worst case scenario, having two built-in primitives that can't be created in Arc (tables and fns) is still better than the current situation.

---

* [1]: https://developer.mozilla.org/en/New_in_JavaScript_1.7#Destr...

* [2]: I'll note that `isa` would probably be used rarely... just as I rarely use `instanceof` in JavaScript. It's only there on the off chance that somebody really does care what "type" something is.

-----

3 points by Pauan 5104 days ago | link

"JavaScript also has a form of object destructuring [1]:"

Okay, I just realized something... what if we changed destructuring so it worked on tables too? Then this would work:

  (let '(a b c) (obj a 1 b 2 c 3)
    ...)
This would also allow us to get a clunky form of keyword arguments:

  (def foo ('(a b c))
    (prn a b c))

  (foo (obj c 1 b 2)) -> nil 2 1
Which could be made nicer with a little sugar...

  (foo :c 1 :b 2) -> nil 2 1
  (foo {c 1 b 2}) -> nil 2 1
One advantage of this approach is that you can pass either lists or tables to the function:

  (foo '(1 2 3)) -> 1 2 3

-----

1 point by Pauan 5105 days ago | link

By the way, I just had an idea. We could even write the `annotate`, `type`, and `rep` functions in Arc:

  (def annotate (type rep)
    (fn (m . args)
      (case m
        'type type
        'rep  rep
              (apply attr rep m args))))
    
  (def type (x)
    (attr x 'type))
    
  (def rep (x)
    (attr x 'rep))
So calling `annotate` on a value would create a wrapper around it. But this also allows us to do something else... modify the type of an existing value, rather than wrap it.

-----

1 point by Pauan 5105 days ago | link

"If you implement those five types using (annotate 'table (fn ...)), then you have to implement three methods per type, for the same overall result. I believe this is what akkartik meant by "moving parens around.""

...but if creating a single table with extend is twice as verbose as message passing, then creating 5 tables with extend will still be twice as verbose as message passing. That seems to be more than just "moving parens around", though perhaps you could argue that the 2x difference doesn't matter.

Or perhaps you could argue that the 2x verbosity could be curbed with macros... or a construct like def-extension-type... in which case I reply that we can use the same techniques for message passing. But with O(1) efficiency rather than the O(n) efficiency of extend, and with the ability to easily define core functions in Arc itself.

-----

1 point by rocketnia 5105 days ago | link

Yes, that was exactly what I was saying with 'def-extension-type, and I even mentioned that you could use 'def-extension-type verbatim too (not that the name would make sense).

I've just written my reply to "the O(n) efficiency of 'extend" over here: arclanguage.org/item?id=14374

-----

1 point by rocketnia 5105 days ago | link

I missed this the first time around:

"Message passing should always be O(1) but extend is O(n) where n is the number of times the function has been extended."

As I said, the number of extensions of a function is almost always a constant in any given program (and a small one at that). I do believe your constant factor would be smaller than mine, but we can't decide that without runnable code. ^_^

-----

1 point by Pauan 5105 days ago | link

Right, which means you shouldn't be choosing one or the other based solely on efficiency. I mention it for completeness, since we're dissecting the pros and cons of the two approaches. And who knows, somebody might actually find the O(1) approach better for their needs.

However... if message passing can do everything just as good as extend (at least pertaining to data types), then that small efficiency advantage could be enough to tip things over.

-----

1 point by akkartik 5105 days ago | link

"It's not using the function car to get the message... it's using the symbol 'car."

Whoa. Should methods/messages be first-class like functions? Can I pass them around, store them in symbols, generate objects with gensym'd methods? How does 'call by name' interact with all that?

We can provide feedback about these things until we're blue in the face and it won't count for a thing because the first attempt at implementation will run into some blazingly obvious issue that we all missed. Why expend so much collective attention so inefficiently when you can just build it and see what happens?

-----

1 point by Pauan 5105 days ago | link

"Whoa. Should methods/messages be first-class like functions?"

Sure, why not? I actually envision them being implemented as functions, but other ways are possible. In JavaScript, methods are ordinary functions.

As for messages being first-class... they sorta already are, in the sense that the messages are symbols. I don't see much reason to give messages a special type; they can just be symbols.

---

"How does 'call by name' interact with all that?"

Hm... not sure.

---

"Why expend so much collective attention so inefficiently when you can just build it and see what happens?"

Because py-arc is in a pretty sad state right now. :P Once it's spruced up, implementing message passing should be trivial.

-----

1 point by akkartik 5105 days ago | link

"Because py-arc is in a pretty sad state right now. :P Once it's spruced up, implementing message passing should be trivial."

:) I think you're underestimating the difficulty of this. You need isa to somehow simulate a set of messages to an object before deciding that yes, it fits that interface. I have no idea how to do that without causing undesirable side effects. Performance is another concern.

I think interfaces work in Go because it's statically typed.

-----

1 point by Pauan 5105 days ago | link

It doesn't need to simulate anything... if something has a type of 'table, then it uses the table interface. If something has a type of 'stream then it uses the stream interface. Thus, isa could work exactly the same way it does in pgArc[1].

* [1]: Of course, when I try to implement multi-types, then isa would need to change, but that's not strictly speaking necessary right now.

-----

1 point by akkartik 5105 days ago | link

Hmm, so each value in arc would carry around a table mapping messages to functions?

-----

1 point by Pauan 5105 days ago | link

Yes, basically. That means if you want to create a custom table type, you only need to create an annotated function that maps messages (symbols) to behavior (functions), and voila, it all works seamlessly.

---

I find it amusing that your explanation pretty much sums up thousands and thousands of words that I've said. I really am too verbose. ^^;

-----

2 points by akkartik 5105 days ago | link

"I find it amusing that your explanation pretty much sums up thousands and thousands of words that I've said. I really am too verbose. ^^;"

You were probably more intelligible to everyone else; I seem to have been uncommonly dense :) What you're describing is indeed message passing. I think I was misled by your code snippets.

You're basically describing an s-expression syntax for smalltalk.

-----

1 point by Pauan 5105 days ago | link

"You were probably more intelligible to everyone else; I seem to have been uncommonly dense :) What you're describing is indeed message passing. I think I was misled by your code snippets."

I was implementing message passing by using functions + closures, so it's definitely a weird form of it... most languages implement message passing with objects + methods. So, your confusion is understandable.

-----

1 point by akkartik 5105 days ago | link

Once I make the connection to smalltalk, wrapping these objects in the (annotate ..) layer seems redundant. Just get rid of types entirely.

We're probably going to rediscover why a s-expression syntax for smalltalk is a bad idea, but it should be a fun process.

I'm reminded of this thread: http://people.csail.mit.edu/gregs/ll1-discuss-archive-html/m...

-----

1 point by Pauan 5105 days ago | link

"Once I make the connection to smalltalk, wrapping these objects in the (annotate ..) layer seems redundant. Just get rid of types entirely."

So... what... just use duck typing for everything? :P Duck typing is easier, but has a greater possibility for strange bugs and conflicts. But Arc is a LFSP, so that might be a good fit for Arc.

---

Neat! In the link you mentioned, I found this: http://okmij.org/ftp/Scheme/oop-in-fp.txt

Which, in fact, describes something almost completely identical to my message passing idea, except that it's more verbose in Scheme than in Arc. It even demonstrates prototypical inheritance.

The only difference is that they're using a separate function to represent each method... whereas in my examples, I've basically written the methods "inline" so there's only one function. However, I did demonstrate one way to give each method a separate function: http://arclanguage.org/item?id=14368

-----

1 point by akkartik 5105 days ago | link

Are you planning to have all say string values contain a pointer to the same table instance, or give each string value its own copy of the table?

-----

1 point by Pauan 5105 days ago | link

Not sure. Either way could work. That seems to be more of an optimization, though. The interface should work the same either way.

-----

1 point by akkartik 5105 days ago | link

It probably bears thinking about. It'll affect, for example, how you say, "I want all tables to have this new operation."

-----

1 point by Pauan 5107 days ago | link

By the way, you mentioned the "expression problem", and I looked it up. You're right, that is more or less what we're talking about. Except that my proposal is only concerned with creating new values, because I have not seen or thought of a proposal that is fantastic at creating both values and operations...

So rather than trying to completely solve the expression problem by creating a unified system, I'm instead proposing to split it up into two parts: message passing for data, and extend for programs. This is not as elegant as a unified model, but it sure as hell is easier to do, and I don't even know if a good unified model is possible.

---

Also, see this link:

http://www.cs.utexas.edu/users/wcook/papers/OOPvsADT/CookOOP...

I believe that is relevant to the discussion. What I am proposing is message passing (PDA), what you are proposing is extend (ADT). What I'm saying is that both are in conflict with each other, and that sometimes one is better than the other (depending on the situation)...

So rather than using extend for everything (your proposal), how about using extend for some things, and message passing for other things? That is what I'm proposing: using whatever model leads to shorter, more extensible code. I'm focusing on data types because data types are the only area (that I've seen so far) that message passing is significantly better than extend.

---

A quote that I found in the above pdf:

"In 1975, John Reynolds published a paper called “User-defined data types and procedural data structures as complementary approaches to data abstraction” [39] in which he compares procedural data abstraction to user-defined data types. He argued that they are complementary, in that they each has strengths and weaknesses, and the strengths of one are generally the weaknesses of the other. In particular, he found that PDAs offer extensibility and interoperability but obstruct some optimizations. ADTs, on the other hand, facilitate certain kinds of optimizations, but are difficult to extend or get to interoperate"

I have already demonstrated that in an earlier post (http://arclanguage.org/item?id=14261) that using extend for everything leads to problems, because extend isn't a silver bullet. Using a combination between extend and message passing is best.

-----

1 point by akkartik 5107 days ago | link

I enjoyed reading this, thanks.

"doesn't make it easy to make a new operation (message) that acts on existing kinds of values."

I can't imagine how anything could. Isn't this equivalent to "doesn't do your programming for you"?

"Functional programming tends to emphasize making new operations on existing values"

You're saying lisps cover this side, right? So it makes sense to focus on just the other half.

I think Arc already supports both sides, but it almost seems an accidental property of the implementation. Most lisps don't support extensibility, they lock down the primitives. I've complained about this before: http://news.ycombinator.com/item?id=2329680, http://arclanguage.org/item?id=12803.

I speculate that in the dawn of time a lisp introduced equal because they needed to compare lists and couldn't change eq. Or they assumed eq was set in stone since it was in the original McCarthy paper (http://paulgraham.com/rootsoflisp.html). I think of this as the original sin (http://www.arclanguage.org/item?id=13690).

But I'm preaching to the choir here :) I want a better syntax for extend just so that extensibility is baked into the language. And I want macros to be extensible just like functions. extend doesn't do that, but wart's def :case does (http://arclanguage.org/item?id=13790). I'd forgotten this when I called it syntactic sugar (http://www.arclanguage.org/item?id=14247).

-----

1 point by rocketnia 5107 days ago | link

"I can't imagine how anything could. Isn't this equivalent to "doesn't do your programming for you"?"

I demonstrated how 'extend could, and although there'd often be more code to write given languages or core library designs where the extension problem is hard, I wouldn't phrase it in as magical a way as doing your programming for you.

I'm not suggesting that a new operation will work on existing values without you specifying the behavior for those values; I'm suggesting that you're given the ability to specify behavior for existing values in the first place.

---

"You're saying lisps cover this side, right? So it makes sense to focus on just the other half."

It's not something that can be solved one piece at a time. Pauan's prototype system makes it hard for a programmer to tell whether a given value is really a core 'table or just inherits from 'table, and that makes makes it hard to specify new behaviors that only apply to cases low on the inheritance hierarchy.

Using the prototype system, I'm sure we can make a new kind of value with support for certain existing operations, but then it'll be hard to make a new operation with support for that kind of value.

---

"I think Arc already supports both sides, but it almost seems an accidental property of the implementation. Most lisps don't support extensibility, they lock down the primitives."

Right, that's a big reason I still use Arc, rather than some other language that's solved its expression problem. I see that as being kind of an emergent feature of having this particular language be so unrestrictive of the programmer. Intentional or not, it's a good language to explore the kind of extensibility I'm interested in. ^^

---

"I speculate that in the dawn of time a lisp introduced equal because they needed to compare lists and couldn't change eq."

Whatever the case, I use weak hash tables, I'd expect a well-designed weak hash table to do an 'eq comparison (the alternative being for unreachable keys to stick around in case an equivalent one is constructed), and at that point I could implement 'eq like so:

  (def eq (a b)
    (.b:weak-table a t))

-----

1 point by Pauan 5107 days ago | link

"It's not something that can be solved one piece at a time. Pauan's prototype system makes it hard for a programmer to tell whether a given value is really a core 'table or just inherits from 'table, and that makes makes it hard to specify new behaviors that only apply to cases low on the inheritance hierarchy."

Even if we accept prototypes (which aren't necessary for message passing), that's not true. Data types could have a 'type message that would return it's low-level implementation, in the rare event that you actually needed it. Or they could have a 'proto message that would return what their prototype is.

Also, perhaps you are unaware of how prototypes work in JavaScript... it is quite easy to tell (once you know how) whether something is an Array, or simply inherits from Array [1] (it is pretty much never necessary to actually do this, though...) It is also possible to get completely fine-grained control, so you can extend all arrays, or a specific subset of arrays, or an individual array.

But that's beside the point, because I'm not talking about prototypes (http://arclanguage.org/item?id=14305).

---

* [1] Object.prototype.toString.call(foo) will return "[object Array]" if and only if foo is an actual array. If it's a "subclass" of Array, then it will be "[object Object]" Clunky? Yes. But that has nothing to do with prototypes; it's clunky because prototypes in JavaScript suck. Prototypes can be done in a much more elegant way.

I think it should be `foo.constructor === Array`, but as I said, prototypes in JavaScript suck. That wouldn't be a problem in Arc.

-----

1 point by Pauan 5107 days ago | link

"You're saying lisps cover this side, right? So it makes sense to focus on just the other half."

Pretty much. That's what I'm doing: focusing on the other side. Which, in the case of data types at least, makes programs significantly shorter and easier to write.

I'm not proposing we write everything in message passing style. I'm only proposing we use message passing when it makes programs better.

-----

1 point by rocketnia 5111 days ago | link

"Let's suppose you wish to create a custom table type. How would you go about doing that in pgArc?"

I haven't gotten all the way through your post, but I'm pretty sure it's impossible to define callable types in pg-Arc without dropping to Racket, and even then it might be impossible to do it cleanly unless you've loaded ac.scm with Racket's 'compile-enforce-module-constants parameter set to #f. Because of this, I recommend Anarki and Rainbow when it's necessary to have callable types, since they have 'defcall.

But hey, here's an unclean way. ^_^

The only things callable in pg-Arc are functions, tables, strings, and lists, and the only one of those that can have significantly custom behavior is the function. (This is set in stone thanks to 'ar-apply being on the Racket side, where the Racket compiler determines it's a constant.) We'll need to represent custom tables directly as Racket functions that provide the getter behavior.

This doesn't give us an easy way to get the internal representation of the table, or even to distinguish our tables from other functions. Fortunately, we can tack extra metadata onto an encapsulated value like a function by stuffing it into a weak hash table. We just need to make sure our tables aren't 'eq? to any other functions, which in our case should be true thanks to the fact that they capture the local variable(s) holding the storage for this particular table.

(I think pg-Arc does allocate a new function every time a (fn ...) form executes, meaning we don't necessarily need to make sure the function captures something, but Rainbow will compile certain (fn ...) forms into constants, and for all I know Racket may change its behavior in a future version.)

  (mac $ (x) `(cdr `(nil . ,,x))  ; bug exploit to drop to Racket
  
  (= isa-my-table ($.make-weak-hasheq))
  (= rep-my-table ($.make-weak-hasheq))
  
  (def my-table ()
    (let internal-representation (...)
      (let result (fn (key (o default))
                    ...behavior for getting...)
        (= isa-my-table.result t)
        (= rep-my-table.result internal-representation)
        result)))
  
  (extend sref (self val (o default) . rest)
            (and isa-my-table.self no.rest)
    ...behavior for setting...)
At this point, type.x will be 'fn for custom tables. Let's say we don't want that.

We could overwrite 'type, but that doesn't change the type seen internally by 'ac-macro?, 'ar-tag, and 'ar-coerce in ac.scm. They all use 'ar-type directly. Fortunately, nothing uses 'ar-tag and 'ar-coerce but Arc's 'annotate, '+, and 'coerce, all of which we can overwrite on the Arc side. That leaves 'ac-macro? as the one holdout... and that one checks specifically for the type 'mac, so there's nothing lost if it sees 'fn rather than our table type.

It would be interesting to actually write the code to replace 'type, 'annotate, '+, and 'coerce, but I don't have time right now.

-----

1 point by Pauan 5111 days ago | link

"I haven't gotten all the way through your post, but I'm pretty sure it's impossible to define callable types in pg-Arc without dropping to Racket, and even then it might be impossible to do it cleanly unless..."

Yes, that was my entire point all along. The whole point of this post was in fact to demonstrate that it's impossible or obscenely difficult with pgArc, but incredibly easy with methods. That is what I've been trying to do this whole time: demonstrate why methods are awesome, and more extensible than things like (readc) and (readb) functions.

-----

1 point by akkartik 5111 days ago | link

What do you mean by methods?

-----

1 point by Pauan 5111 days ago | link

A callable thing (like a function) that accepts a key parameter (usually a symbol), that then does something. Yes, it's quite vague, intentionally. I'm using the word "method" because the concept is similar to "methods" in OOP languages.

Consider this function:

  (def foo (x y)
    (case x
      'add (+ 1 y)
      'sub (- 1 y)))

  (foo 'add 5) -> 6
  (foo 'sub 5) -> 4
I'm calling this thing a "method" in the sense of "foo has a method add and a method sub". It's not formalized, just a convention. They don't even need to be symbols. To look at it another way... it's inverting a function call. Thus:

  (add foo 5)  -> 6
  (foo 'add 5) -> 6
Sometimes the first form is appropriate, sometimes the second form is. I believe in the case of input streams, that the second form is better. In other words, (stdin 'char) rather than (char stdin), for the reasons already given.

A table could have methods as well:

  (= foo (obj
           add (fn (x) (+ 1 x))
           sub (fn (x) (- 1 x))))
           
  (foo.'add 5) -> 6
  (foo.'sub 5) -> 4
This is analogous to the concept of classes/objects/methods in languages like JavaScript or Python:

  var foo = {
      add: function (x) { return x + 1 },
      sub: function (x) { return x - 1 }
  };

  foo.add(5) -> 6
  foo.sub(5) -> 4
Incidentally, in JavaScript, a "method" is simply a function that is attached to an object. The only difference between a JavaScript "method" and an ordinary function, is the `this` binding.

-----

1 point by akkartik 5111 days ago | link

Perhaps you just want guards?

http://en.wikipedia.org/wiki/Guarded_Command_Language#Guarde... http://en.wikibooks.org/wiki/Haskell/Control_structures#Guar...

Guards work particularly well with case expressions.

http://en.wikibooks.org/wiki/Haskell/Control_structures#Equa... http://arclanguage.org/item?id=13790

-----

1 point by Pauan 5111 days ago | link

That's a potential way to make methods easier to define/use, but doesn't replace the concept. Let me put it this way... where do you define a function?

The traditional answer is "in the global scope" or maybe in a let block or something. That's all well and good, in most circumstances. In the case of a data-type, though, you want the possibility to create custom data types, like a custom table, or a custom stream.

The problem is... in this case, because things like peekc and readc are global functions, you end up needing to do some hacky stuff, extending them in a very verbose way to add in your custom stream type.

So instead, let's think of a different place to define the functions. In this case, the functionality is local to the stream: each individual stream decides how it is read. Thus, rather than defining a global function that operates on streams... we define a 'peek and 'char function on the stream itself. That, is the fundamental concept of methods.

Of course, this is only necessary for the primitive functions like readc and peekc. Functions that build on top of them (like readline) can be written in the normal way. So, I'm proposing that Arc remain essentially the same, with the key difference that data-types (tables, input, output, etc.) are implemented as methods, which makes it way easier to define custom data types.

-----

1 point by akkartik 5111 days ago | link

I'm totally with you, not trying to replace any concept, just suggesting alternative terminology, something that isn't so overloaded.

And pointing out that a language with these 'places to put functions' doesn't need functions anymore. Your 'methods' are just a form of dispatch, and you want programmable dispatch policies. In the simplest case, no dispatch policy. In a language with your methods you need no concept called functions.

Wart already does all this. You can already overload peek or readc. I'd love for you to take a look and see if it does what you're talking about. Don't panic :) it's super brief and readable by all accounts. (or if you've taken a look already just say so and I'll stop with all the PR.)

-----

1 point by Pauan 5111 days ago | link

Alright, alright, I'll take a look. :P

And yes, I'm aware that the term "method" is overloaded, but I still feel it's appropriate. Consider earlier where I showed functions that are placed into a table. I don't think that could reasonably be described as guards, yet they can be reasonably described as methods.

Also, you're right that methods (in OOP languages) can simulate functions, and vice versa, but I feel like functions are "lighter weight". For starters, a method must always be attached to something, it can't be global. I like functions, so I'd like to keep 'em, I just feel like methods are more appropriate in certain (probably rare?) circumstances.

Assuming that we use the function approach rather than the object approach, then "guard" would indeed be a very reasonable name for them. So thanks for that.

-----

1 point by akkartik 5111 days ago | link

Yeah, we can agree to disagree on terminology :) I think case expressions with guards are more general than C++-style vtables because you can dispatch on arbitrary expressions rather than just lookup a symbol in a table. It seems a pity to come so close and give up power you can get for free.

-----

1 point by Pauan 5111 days ago | link

Yeah, except that C style languages (including JavaScript) make `switch` so awful that you end up using if/else chains. A missed opportunity, to be sure. Arc does a better job, which is good.

By the way, I'm curious, how would you go about creating a custom table type in wart? In my hypothetical ideal Arc, it would be like this:

  (annotate 'table (def my-table (x n v)
                     (case x
                       'get ...
                       'set ...)))
Pretend that it's been made pretty with macros. The ... after 'get is called when doing (my-table 'foo) and the ... after 'set is called when doing (= (my-table 'foo) 5)

Incidentally, my hypothetical ideal Arc wouldn't even be particularly difficult to do, but I'm curious if there's a better (shorter or more extensible) approach than what I gave above.

-----

1 point by akkartik 5111 days ago | link

"By the way, I'm curious, how _would_ you go about creating a custom table type in wart?"

Hmm, I didn't really follow your example. Arc already has a table type. Do you mean you want a literal syntax for tables, so that you can say {a 1 b 2} or something?

-----

1 point by Pauan 5111 days ago | link

No... I'm talking about writing something in Arc that behaves like a table, and fits seamlessly into Arc. Consider my earlier prototype post (http://arclanguage.org/item?id=13838). A prototype behaves exactly like an ordinary table, with the only difference being that if it can't find a key, it checks it's parent.

As I explained in that post, it's impossible to do this seamlessly in Arc:

  (= my-proto (clone nil x "foo"))

  (my-proto 'x)           -> "foo"
  (= (my-proto 'x) "bar") -> "bar"
  (keys my-proto)         -> (x)
  (vals my-proto)         -> ("bar")
The above is impossible to do in Arc. What I am describing is a simple interface that makes the above not only possible, but easy. It lets you create new data structures that Arc treats as if they were tables. I will write a post about this ina sec.

-----

1 point by akkartik 5110 days ago | link

Ah, I see. Here's how you do prototypes in wart:

  $ git clone git@github.com:akkartik/wart.git
  $ cd wart
  $ git checkout a5d25c805a97347d1207 #Current HEAD for those trying this in future

  $ cat > 099heir.wart # Just kidding; copy and paste the following lines
                       # until the '^D' into 099heir.wart

  ; A new type: heir (because it inherits from a parent)
  (def clone-prototype(x)
    (annotate 'heir (obj val (table) parent x)))

  ; Extend ssyntax
  (defcall heir h
    [or rep.h!val._
        rep.h!parent._])

  ; Extend assignment
  (defset heir(h key value)
    (= rep.h!val.key value))

  ; Extend keys
  (def keys(x) :case (isa x 'heir)
    (join (keys rep.x!val)
          (keys rep.x!parent)))
  ^D

  # Now start your engines
  $ wart

  wart> ; And try the following expressions at the prompt
  (= a (table))
  (= b (clone-prototype a))

  ; inheriting from parent works
  (= a!x 23)
  (prn b!x) ; => 23

  ; assignment works
  (= b!y 34)
  b!y ; => 34
  a!y ; => nil

  ; keys works
  keys.a ; => (x)
  keys.b ; => (x y)

  ; vals is left as an exercise for the reader :)

  wart> (quit) ; or hit ctrl-d a few times

-----

1 point by Pauan 5110 days ago | link

Yeah, that's more-or-less the way to do it in pgArc as well (but using extend rather than :case).

It's nice that wart makes it so easy to extend stuff (with :case and defcall), but that doesn't help with the situation of having to extend multiple built-in functions (like keys, vals... anything that relies on tables, even user functions!) just to define one new type.

A further benefit of my system is that your custom tables can have a type of 'table, so existing code that expects tables can use them too.

-----

1 point by akkartik 5110 days ago | link

"that doesn't help with the situation of having to extend multiple built-in functions.. even user functions.. just to define one new type."

I can't think of a situation where I'll need to override user functions once I've overridden the basic operations. Can you? I think you just need to override these ops: https://github.com/akkartik/wart/blob/a5d25c805a97347d1207e7...

If user functions are untouched I think one approach is as verbose as the other, whether you say: (= x (fn args (case car.args ...))) or def :case over and over. The latter seems more declarative to me.

-----

1 point by Pauan 5110 days ago | link

The verbosity isn't in the actual declaration. It's in the need to extend built-in functions. My system avoids that completely: no extending.

As for extending user functions... consider this: what about code that expects something of type 'table? Your prototype is of type 'heir... so that won't work. My system does work in that situation. That can be mitigated by having functions coerce to 'table, and then define a defcoerce for heir, but why jump through so many hoops when you can get all that for free?

To put it another way, my system uses duck typing: if it supports these arguments, use it. This means that all you need to do is define something that looks like a table, and Arc automatically treats it as if it were a built-in table. No need to extend Arc's built-ins to convince it that, "yes I really really really want this to be a table."

-----

2 points by rocketnia 5110 days ago | link

"what about code that expects something of type 'table?"

I blame that code for its lack of extensibility. It should just go ahead and use the value as a table, trusting the extensions to be there (and using failcall to recover, of course ^_^ ).

If it must do a boolean check to determine the value's features (perhaps because failcall isn't available, or because it needs to fail fast), it shouldn't do a direct type check. Instead, it should call another global function dedicated to the inquiry of tableness (which is probably too specific an inquiry anyway, most of the time). That way, people can extend the tableness function too.

Extensibility isn't the only benefit of avoiding [isa _ 'table]. It also becomes possible to have a single value that supports table operations and (say) input stream operations at the same time. It's probably a silly combination, but a custom type based on an association list could warrant it.

-----

1 point by Pauan 5110 days ago | link

You're missing the point... the problem is not when the function expects something of type table... it's when the function needs to change it's behavior based on the interface. Consider `each`. It needs to change it's behavior depending on whether it's a table or a cons or a string. How is using (table? foo) better than (isa foo 'table)? Both are the same as far as extensibility goes. And in fact, you can do that right now, in Arc:

  (def table? (x) (isa x 'table))
And you're also missing the whole point of message passing... message passing enables duck typing, which means it's possible to write a function that doesn't care what it's type is, as long as it supports the required message.

You're trying to solve the symptom... I'm trying to solve the problem. What I'm saying is that re-extending all the relevant built-ins is ridiculously inefficient and verbose, and that there's a far better way. Message passing not only drastically reduces verbosity, but it also solves the problem you mentioned.

I suggest you go and read http://arclanguage.org/item?id=14237 which already explains this in more depth.

I think the current way that people think about types is fundamentally wrong, and I'll write a post about that in a moment.

Edit: here's the post http://arclanguage.org/item?id=14261

-----

1 point by rocketnia 5109 days ago | link

"Consider `each`. It needs to change it's behavior depending on whether it's a table or a cons or a string."

Yes. In a world with failcall, it can try treating the value as a table in one rule, as a cons as another rule, and as a string in a third rule. In a world without failcall, it can do the same thing, but with the help of extensible global functions 'atable, 'acons, and 'astring. That's merely a specific example of what I just said.

---

"How is using (table? foo) better than (isa foo 'table)? Both are the same as far as extensibility goes."

Sure, there's not a whole lot of difference between extending 'table? and extending 'isa. However, in my mind 'isa has a clear role as a way to check the most specific possible feature of a value: the kind of internal data it has.

One could go all the way down, making a (deftype wrap-foo unwrap-foo isa-foo) form that eliminates the need for 'annotate, 'rep, 'isa, and 'type altogether. In this example, 'isa-foo would be a function that indicates support for 'unwrap-foo, and 'wrap-foo would be a simple data constructor that makes an encapsulated value with nothing but innate support for 'unwrap-foo. (In a system with failcall, 'isa-foo is pretty much redundant.)

This is basically how the Kernel draft deals with custom types (except without an eye toward extensibility), but I haven't been inspired to go quite as far as this in practice. I think programmers should be free, if discouraged, to break encapsulation of values they don't know the nature of, like Arc programmers can when they use 'type and 'rep.

"And in fact, you can do that right now, in Arc[...]"

Cool, huh? :-p

Obviously pg-Arc hasn't been built from the ground up this way, but Arc Forumgoers have had an awful lot of extensibility ideas since Arc 3.1 was released, and pursuing any one of them in Anarki would change the core so much as to turn it into a separate language. That's why I don't call Penknife a version of Arc.

---

"And you're also missing the whole point of message passing... message passing enables duck typing, which means it's possible to write a function that doesn't care what it's type is, as long as it supports the required message."

That's the whole point of extending global functions too. I don't care what something's type is, as long as all the utilities I use have been extended to deal with it.

As I mentioned above with 'deftype, the 'extend approach can go all the way down to the level of making the 'type function itself merely a reflection tool. Obviously pg-Arc isn't built this way, but we've had an awful lot of extensibility ideas here since Arc 3.1 was released, and pursuing any one of them in Anarki would practically turn it into a separate language. (That's why I don't call Penknife a version of Arc.)

---

"I suggest you go and read http://arclanguage.org/item?id=14237 which already explains this in more depth."

I've already read that, thanks. I just wasn't convinced, and I found more interesting arguments to make over here. ^^

"Edit: here's the post http://arclanguage.org/item?id=14261 "

My next order of business is to read that.

-----

1 point by Pauan 5109 days ago | link

"Yes. In a world with failcall, it can try treating the value as a table in one rule, as a cons as another rule, and as a string in a third rule."

How? I've provided concrete examples where my solution is significantly shorter at creating new data types than the current solution, while still retaining `isa`, while avoiding problems. If you have a solution that can do all that, then please present it.

---

"In this example, 'isa-foo would be a function that indicates support for 'unwrap-foo, and 'wrap-foo would be a simple data constructor that makes an encapsulated value with nothing but innate support for 'unwrap-foo. (In a system with failcall, 'isa-foo is pretty much redundant.)"

Why? My system already does things similar to that, but in a very simple way, that's compatible with pgArc.

---

"However, in my mind 'isa has a clear role as a way to check the most specific possible feature of a value: the kind of internal data it has."

This is precisely what I meant when I said that I think the current view of types is fundamentally wrong. A table is not a hash. It's an interface that could potentially be implemented as a hash... or something else. Arc leaves it unspecified, which I consider to be a Good Thing.

---

"That's the whole point of extending global functions too. I don't care what something's type is, as long as all the utilities I use have been extended to deal with it."

...but as I've already mentioned repeatedly (and demonstrated in a post), that way is significantly more verbose than it has to be. My solution solves that, in a way that is more hackable/extendable/succinct. That is the goal of Arc, yes? Hackability and succinctness? I still have not seen a solution that matches mine for succinctness, let alone extensibility. Please provide one, if you're not convinced that my way is superior. Then I will back down.

-----

1 point by Pauan 5109 days ago | link

By the way, I just noticed something. Check out this link: http://wiki.call-cc.org/Python-like%20generators

Note that they're using the message-passing idea: after an iterator has been created, you can call it with 'status?, 'dead?, 'alive?, or 'kill! to get/set it's private data.

What I'm suggesting is to treat all compound data types like that. You can then create function wrappers around them, which is in fact what they did too: note the definition of `iterator-empty?`

Every individual iterator has it's own private state, so it doesn't make sense to require extending built-in functions every single time you create a new iterator. Instead, you define a function wrapper (like `iterator-empty?`) that accesses the internal data for you. No need to extend anything! That conciseness is what message passing gives you.

-----

2 points by akkartik 5109 days ago | link

"define something that looks like a table, and Arc automatically treats it as if it were a built-in table"

Bringing http://arclanguage.org/item?id=14268 back here, how would you convert code like (isa x 'table) into a method framework? Wouldn't it require getting rid of types altogether?

-----

1 point by Pauan 5109 days ago | link

"how would you convert code like (isa x 'table) into a method framework?"

Haha... that's what I've been explaining all along! That's the whole point of message passing: defining a low-level framework that makes it easy to both create new types, and also extend existing types.

That is, in fact, what this post is describing: http://arclanguage.org/item?id=14237

-----

1 point by akkartik 5106 days ago | link

Go seems to do this really well. Functions can ask for a specific interface, which is defined as a set of operations. But types don't have to declare that they implement the interface. http://golang.org/doc/go_tutorial.html

I still don't see what this has to do with methods, though. (I still can't follow that humongous comment[1]). Both C++ and Java have methods. No language but Go tests for an interface by seeing the functions it supports.

And this isn't the same as duck typing at all. Arc does duck typing just fine. In arc or python there's no way to convey tableness or any other ness. You just use the operations you care about, and hopefully any new types remembered to implement them.

Duck typing has no contract, Java has too rigid and inextensible a contract. Go is just right. And you're hosed in any of them if you write code of the form if (isa x table) ..

[1] I think I've missed half your comments in recent weeks. Either because they're so long and hard to follow, or because there's just too many of them and I forgot to go to the second page of new comments. The latter is a weakness of this forum structure. For the former, well, I think you spend too much time describing methods. Most of us have seen them in other languages. What you suggest is subtly different, and the subtleties are lost in the verbiage.

Which is why runnable examples are useful. You're right that wart differs from arc in subtle ways. That's why I have unit tests, to enumerate all the subtleties. I think you're calling it out (fairly) for lots of gritty details, and then doing the same thing even more egregiously.

Or maybe I just suck at reading comprehension compared to the rest of the group here :)

-----

1 point by Pauan 5105 days ago | link

"Go seems to do this really well. Functions can ask for a specific interface, which is defined as a set of operations. But types don't have to declare that they implement the interface. http://golang.org/doc/go_tutorial.html "

So... Go's interfaces actually look an awful lot like message passing. Here's a comparison:

  type reader interface {  // Go
      Read(b []byte) (ret int, err os.Error)
      String() string
  }


  (annotate 'reader        ; py-arc
    (fn (m b)
      (case m
        'read   ...
        'string ...)))


  (object reader           ; py-arc, with sugar
    read   (b) ...
    string ()  ...)
The idea is, every object is a collection of methods, and also a type. You can just call the methods directly, bypassing the type (this is duck typing), or you can check the type to make sure it implements the interface you want... or you can use coerce to change it's interface. Here is an example of the three ways, assuming foo implements the reader interface:

  (attr foo 'read)                   ; duck typing
  
  (if (isa foo 'reader)              ; interface checking
    (attr foo 'read))
     
  (attr (coerce foo 'reader) 'read)  ; coerce to reader
Message passing is simply the low-level way to create objects + methods... or prototypes. Or rulebooks. Or other things. To put it an other way, objects are just macros that expand to message passing.

Think of it like continuations and iterators. Continuations are lower-level than iterators, and it's possible to create iterators by using continuations. Message passing is the low-level idea, which can be used to create higher-level things, like prototypes, objects, or interfaces.

---

"Duck typing has no contract, Java has too rigid and inextensible a contract. Go is just right. And you're hosed in any of them if you write code of the form if (isa x table) .."

But what if `isa` tested for an interface? In other words, (isa x 'table) would be like saying, "does x support the table interface?"

---

"I think I've missed half your comments in recent weeks. [...] For the former, well, I think you spend too much time describing methods."

Yeah, I've probably been a bit too excitable about this... ^^;

---

"I think you're calling it out (fairly) for lots of gritty details, and then doing the same thing even more egregiously."

I'm not saying it's a bad thing that wart differs from pgArc, I was just explaining why I haven't immediately jumped in and started using wart. I think it's good to try out different approaches. Nor am I expecting people to immediately jump in and start using message passing... especially since message passing probably isn't useful for everybody.

I was presenting my ideas in the hope of getting useful feedback... so that my ideas can be improved, or rejected. I know that you guys are smart, and by presenting my ideas here, you all have already helped me improve them. I was just looking for some feedback (which I'm finally getting, which is good).

Also, I think it's perfectly possible to discuss and improve on the idea without needing a runnable example. You're right that I could have explained the idea better, though.

-----

2 points by akkartik 5105 days ago | link

"I think it's perfectly possible to discuss and improve on the idea without needing a runnable example. You're right that I could have explained the idea better, though."

Agreed. I'm giving you feedback on what would help me understand the idea. Sometimes if an idea has too many subtle nooks and crannies it helps me to see running code. That doesn't compel you to provide it, of course.

"I was just looking for some feedback."

You're not just looking for feedback, you're aggressively/excitably demanding it :) But you'll be more likely to get it from me (as opposed to super boring meta-feedback, and agonized grunts about your writing quality) if you provide runnable code. Pseudocode is no different than english prose.

"Nor am I expecting people to immediately jump in and start using message passing.."

I'm not sure what you're expecting when you say things like "what more do I have to do?". What more do you have to do for what?

(Btw, the answer to the question 'what more do I have to do' is.. give me runnable code :) I usually start out talking about an idea; if it is neither shot down nor embraced I go build it. If it's still neither shot down nor embraced I try to build stuff atop it. Keep doing that until you run into limitations or (slim chance) you build something so impressive people have to pay attention. Platforms need killer apps.)

---

"Go's interfaces actually look an awful lot like message passing."

Argh, here we go again..

  (annotate 'reader        ; py-arc
    (fn (m b)
      (case m
        'read   ...
        'string ...)))
Since you can write this in arc today, it's utterly confusing and distracting to me that you keep harping on it.

"what if `isa` tested for an interface?"

This seems to be the absolute crux of your idea. This needs changes to ac.scm. But it has absolutely nothing to do with message passing. Messages are not types, and your suggestion is to augment the type system with a keyword called interface or object that isa can accept. If you made that change, arc would support 'message passing' according to you. Am I right?

I'll summarize my position again. I buy that interfaces are foundational, but lots of things are foundational. Unification like in prolog, pattern matching like in haskell, lazy evaluation like in haskell, messages like in smalltalk, function calls like in lambda calculus. That something is foundational doesn't necessarily make it a good fit for a specific language. To show that something foundational is a good fit in arc and also backwards-compatible, provide runnable examples that look good/short and behave intuitively. Show us that existing mechanisms in arc can coexist with the new feature, that it occupies a niche that they don't already provide. No hand-wavy ellipses.

---

"I'm not saying it's a bad thing that wart differs from pgArc, I was just explaining why I haven't immediately jumped in and started using wart."

No explanation necessary. I really don't think wart needs to be part of this conversation. Just compare your approach with extend and we don't have to keep on going on about "backwards compatible? at least it runs!" and "I don't care about running it, it's not backwards compatible." :)

-----

1 point by Pauan 5105 days ago | link

"Since you can write this in arc today, it's utterly confusing and distracting to me that you keep harping on it."

Having built-in types like table/cons/etc. written in the same way, rather than as opaque blobs in Racket.

---

"I'm not sure what you're expecting when you say things like "what more do I have to do?". What more do you have to do for what?"

"What more do I have to do to demonstrate that message passing has advantages over extend?"

I asked that because it seemed that people weren't convinced that message passing was actually better than extend. This was unfortunate because:

1) It appeared to ignore the evidence that I was presenting

2) It seemed to basically dismiss message passing, but without explaining why. How am I supposed to know how message passing is flawed, unless people explain?

If the dismissal was caused by not understanding my idea, then it's my fault for not explaining well enough.

---

"[...] This needs changes to ac.scm."

Yup!

---

"But it has absolutely nothing to do with message passing. Messages are not types, and your suggestion is to augment the type system with a keyword called interface or object that isa can accept."

You're right: message passing is the low-level idea. Interfaces are built on top of message passing, but aren't necessary to support message passing.

---

"If you made that change, arc would support 'message passing' according to you. Am I right?"

No, in order for Arc to support message passing (at least in my mind), it would be necessary for the built-in types to also use message passing. As you pointed out, user-created Arc code can already use message passing, but that's not useful if you want to create something that behaves like a built-in type.

---

"[...] provide runnable examples that look good/short and behave intuitively."

Kay. I'll need to patch up py-arc first, though, because right now it doesn't support basic stuff (I'm looking at you, `apply`). Once I get py-arc into a decent enough shape, it should take less than a day to get message passing working.

-----

1 point by akkartik 5105 days ago | link

"Having built-in types like table/cons/etc. written in the same way, rather than as opaque blobs in Racket."

"in order for Arc to support message passing, it would be necessary for the built-in types to also use message passing"

Are you planning to replace (f x) everywhere with (x f)? That hardly seems backwards-compatible. (Forgive me if this is a stupid question. I have zero 'expertise' since I haven't read 75% of what you have written in this thread.)

If you provide a way to handle (f x) as an 'f message to x, then you shouldn't need to implement primitives.

"It seemed to basically dismiss message passing, but without explaining why. How am I supposed to know how message passing is flawed, unless people explain?"

You have to try it :) It's pretty clear that nobody here has tried quite what you're proposing, so what you thought of as dismissal was just people thinking (like me for the past few weeks) that they had nothing to add since they haven't tried it, and wanting to reserve judgement until they had an opportunity to play with an implementation. But maybe that's just me :)

Or did I miss a dismissive comment?

-----

1 point by Pauan 5105 days ago | link

"Are you planning to replace (f x) everywhere with (x f)?"

Nooope. Let's assume my-table is a table. On the left side is the current pgArc interpretation. On the right side is message passing:

  (my-table 'something)       -> (my-table 'get 'something)
  (= (my-table 'something) 5) -> (my-table 'set 'something 5)
  (keys my-table)             -> (my-table 'keys)
...but that's all hidden behind functions, so ordinary Arc code doesn't need to know that. In other words, `apply` would automagically convert (my-table 'something) into (my-table 'get 'something), so Arc code can't tell the difference. That's why it's backwards compatible.

---

"Or are you seeing dismissal in statements rather than silence?"

Mostly rocketnia, but that's fine since they now understand what I'm talking about, so we can actually discuss the idea. :P Their dismissal seemed to be because of a conflict in motivations.

-----

1 point by rocketnia 5105 days ago | link

"So... Go's interfaces actually look an awful lot like message passing."

That's a bit of a tautology. ^_^ If I'm not mistaken, you took the term "message-passing" from its object-oriented "foo.bar( baz )" context to begin with.

---

"Yeah, I've probably been a bit too excitable about this... ^^;"

Well, I (rocketnia) can't blame you for that, and I can't blame you for not having runnable examples either. I tend to be guilty of both those things a lot, and this discussion is a great example. >.>

-----

1 point by akkartik 5110 days ago | link

my system uses duck typing: if it supports these arguments, use it. all you need to do is define something that looks like a table, and Arc automatically treats it as if it were a built-in table.

I see, yeah you're right. The alternative to duck typing would be subtypes. Ugh.

No need to extend Arc's built-ins to convince it that, "yes I really really really want this to be a table."

You're saying more than that, you're saying how. That logic is common to both approaches. Your approach is superior anytime user code has language like if (isa x 'table) ... Otherwise the two are equivalent in power.

I always treated branching on types as a code smell. wart basically grew out of trying to remove every case of that pattern in the arc codebase. You've given me something to think about, thanks.

-----

1 point by Pauan 5110 days ago | link

By the way... I know using (case) is ugly. On the bright side, my system doesn't care how you write the actual function, just so long as you follow the interface. So you could use wart's :case for it as well:

  (def my-table ()
    ...)
  
  (def my-table (m) :case (is m 'keys)
    ...)
  
  (def my-table (m k) :case (is m 'get)
    ...)
    
  (def my-table (m k v) :case (is m 'set)
    ...)
...at least, I think that's how :case works? I actually like the wart style. That's supposed to be equivalent to this:

  (= my-table (fn (m k v)
                (case m
                  'keys ...
                  'get ...
                  'set ...)))
So yeah, it doesn't matter how you define your function (case or :case or `if` or extend or whatever), just so long as it follows the interface.

P.S. My goal here isn't necessarily to increase raw power, it's to increase extensibility and conciseness. Being able to do the same thing, but in a very hacky and verbose way doesn't sound appealing to me. So if wart has equivalent power, that's great! But I want to do better. I want to have equivalent power and make it shorter and make it more extensible.

-----

1 point by akkartik 5110 days ago | link

Yeah that was the half of your point I understood when I was peddling wart :) But yes wart won't handle user code that checks the type of an object if that is important to you.

-----

1 point by aw 5110 days ago | link

My http://awwx.ws/table-vivifier patch sounds like it may be close to what you want, though it goes too far: it extends tables so that if a key isn't found in the table, it calls a specified function to generate the value -- and then inserts the new value in the table.

If it would be useful, I could simplify it to leave off the last part, and then you could specify the action to take if the key isn't found (such as looking it up in a parent, for example).

-----

1 point by Pauan 5110 days ago | link

Pretty cool. But with my proposal, you could write that in Arc itself.

My solution is general... it works on all compound data types: tables, input, lists, etc.

Imagine being able to write cons, car, cdr, instring, peekc, readc, and more, all in Arc! It drastically cuts down on the number of primitives Arc needs.

-----

1 point by akkartik 5111 days ago | link

"C style languages (including JavaScript) make `switch` so awful that you end up using if/else chains. A missed opportunity, to be sure. Arc does a better job, which is good."

I don't follow that at all.

-----

1 point by Pauan 5111 days ago | link

In languages derived from C, including JavaScript, they use `switch` rather than `case`, like so:

  ; Arc

  (case x
    "foo" ...
    "bar" ...)


  // JavaScript

  switch (x) {
  case "foo":
      ...
      break;
  case "bar":
      ...
      break;
  }
As you can see, switch blocks are atrocious, especially because they require a break; statement or they'll fallthrough. So instead of using switch, some developers prefer if/else chains:

  if (x === "foo") {
      ...
  } else if (x === "bar") {
      ...
  }
Which, as you can see, are less verbose. Arc gets it right, JavaScript doesn't.

-----

2 points by thaddeus 5110 days ago | link

I'm going to argue that you're comparing apples to oranges to pears.

Arc and Javascript case functions are not expected to do the same thing. JS does not allow expressions for input arguments while Arc can. And that break; statement is a feature that some would say Arc lacks. ie, what if you want your case statement to fall through? - now Javascript is golden and one could say Arc is lacking.

I also don't believe developers, generally speaking, prefer if/else chains over switch/case statements as they are different tools intended for different purposes.

They both are not perfect, both are missing features that the other could benefit from.

I think Clojure got it right having it all: case + cond + condp.

-----

1 point by Pauan 5110 days ago | link

Not true... JavaScript allows arbitrary expressions in switch statements, so it does behave like Arc's `case`, only it's much more verbose:

  switch (true) {
  case 5 > 1:
      console.log("5 > 1");
      break;
  }
---

On the contrary, switch should not have a break statement! It should have a fallthru; or continue; statement. Why make the common case difficult? This should be uncontroversial... it's been well established that switch's design is poor, and they should have made it explicit fallthru (like with a continue statement) rather than implicit.

As for Arc "lacking" fallthru... can't you implement that with continuations? That would be the equivalent of explicit fallthru.

---

My point was that switch statements are so atrocious that even in the situations that they were intended for, some developers still prefer if/else chains because they're shorter and more readable.

-----

1 point by waterhouse 5110 days ago | link

> Why make the common case difficult?

I think that's because the above semantics correspond more directly to assembly language, which I imagine was done because "switch" was defined back when that was either important or just not seen to be bad. (Perhaps I'm just making that up, though.) Here's how a switch statement would look in x64 assembly:

  switch:
        ;put desired thing in rax, let's say
  case_1:
        cmp rax, val_1
        jne case_2      ;jump if not equal
        <case 1 code>
  case_2:
        cmp rax, val_2
        jne case_3
        <case 2 code>
  case_3:
        cmp rax, val_2
        jne done
        <case 3 code>
  done:
        <whatever>
By default, the machine will just plow through the remaining cases. If you want it to break out after one case, you have to tell it to do so:

  switch:
        ;put desired thing in rax, let's say
  case_1:
        cmp rax, val_1
        jne case_2      ;jump if not equal
        <case 1 code>
        jmp done        ;jump.
  case_2:
        cmp rax, val_2
        jne case_3
        <case 2 code>
        jmp done
  case_3:
        cmp rax, val_2
        jne done
        <case 3 code>
  done:
        <whatever>
Assembly language is kind of awesome, by the way. And so is the Miller-Rabin primality test. Some disorganized code: http://pastebin.com/raw.php?i=wRyQ2NAx

-----

1 point by Pauan 5110 days ago | link

Fine. That's great and all for C, but I dislike how Java copied C word-for-word, and then JavaScript copied Java. It would have been nice if they had said, "hm... using continue rather than break would make a lot more sense."

It's not a huge deal, and developers can simply avoid switch if they don't like it. My point was merely that "JavaScript's switch sucks, Arc's (case) is awesome."

-----

1 point by thaddeus 5110 days ago | link

a reply for one layer down:

> JavaScript allows arbitrary expressions in switch statements

That's right... looks like it was the test data I was thinking of, which isn't entirely relevant, given you can accomplish the same thing in the end.

> Why make the common case difficult?

A fall-through could be nicer than many breaks. Trying to think of all the scenarios required to make this usable, i.e., do you need a break to get out of a fall-through? Maybe that's why they just went with break. either way, it's a good idea.

-----

1 point by Pauan 5110 days ago | link

The current behavior for switch is simple: it will continue until it finds a break statement. Thus:

  switch (true) {
  case true:
      console.log(1);
  case true:
      console.log(2);
  case true:
      console.log(3);
      break;
  case true:
      console.log(4);
  }
...which outputs 1, 2, and 3. If you wanted to have the same behavior using the continue statement, you would use this:

  switch (true) {
  case true:
      console.log(1);
      continue;
  case true:
      console.log(2);
      continue;
  case true:
      console.log(3);
  case true:
      console.log(4);
  }
As far as I know, this would have exactly the same power as using break; but would be much better suited for the very common case of not wanting fallthru. I think it's much more readable, too: makes it very obvious where it falls through. And it's less error prone... no more strange bugs if you accidentally forget to add in a break; statement.

In fact, in my years and years of programming in JavaScript, I have used if/else blocks many many times (some of which could have been switch statements), and only wanted fallthru a handful of times. I think allowing fallthru is fine, but it should be made explicit, rather than implicit.

The only possible argument I can think of in favor of implicit fallthru is using a switch inside a `for` loop:

  for (var i = 0; i < array.length; i += 1) {
      switch (array[i]) {
      case "foo":
          continue;
      }
  }
...but that's actually ridiculous because I'm fairly sure that breaking out of a loop is far more common than continuing. In which case explicit fallthru would be better with loops.

-----

1 point by akkartik 5110 days ago | link

Argh, we're totally talking about different things. And now this thread's gotten hijacked by switch/case/break C whining :)

Part of the problem was that I too was using the wrong terminology. So fuck the names and terms, when I said 'case expressions' I was referring to wart's (def .. :case ..) syntactic sugar to extend any function. This is why I linked to http://en.wikibooks.org/wiki/Haskell/Control_structures#Equa... earlier in the thread. See how I extend keys in http://arclanguage.org/item?id=14244.

-----

1 point by Pauan 5110 days ago | link

Oh, that makes a bit more sense. Technically speaking, Arc's case expression is sorta like guards, though, in the sense of dispatching to different code depending on a condition. You're right that wart's :case is essentially a super nice version of `case`.

Also, what's wrong with `switch` whining? :P This is a JavaScript forum, right? Oh wait...

-----

1 point by akkartik 5111 days ago | link

You echo several of my frustrations. Exensibility is baked into the design of wart: http://github.com/akkartik/wart

---

Vaporware announcement: I've been trying today to make streams and generators duals. So (read stream) and (yield value) would be ways to bounce control between an input stream and its user.

I started playing with this after rocketnia's observation about accum (http://arclanguage.org/item?id=14198), which name had been bothering me ever since I started using waterhouse's accumulate on a regular basis. "Ooh, so I can replace (accum acc ... (acc x)) with just (as list ... (yield x))? Sign me up."

But what about output streams? I'm still thinking..

-----

1 point by aw 5107 days ago | link

There's a problem with using extend here: extend checks to see if it's going to override the original function first, before the original function has a chance to run. That's not what we want for defining custom callables such as for (my-custom-table 'key). The call in extend to (isa f 'table) would go into an infinite loop trying to call isa.

-----

1 point by rocketnia 5107 days ago | link

In my Penknife draft, there's a calling rulebook, extensible just like any other core rulebook. To resolve the paradox, two callable types are built in: Functions and rulebooks. The core calling behavior tries those two cases first, then finally calls the "rb-core-call" rulebook.

If someone replaces rb-core-call with something other than a core function or rulebook, they'll get an infinite loop, and it'll be their own fault.

I think Arc could easily be patched for something like this: Just change the "Function call on inappropriate object" error message so that it calls an Arc function that raises that error message instead. We could call it 'custom-call, or maybe 'defcall-missing. :-p

-----

1 point by Pauan 5107 days ago | link

Would you mind explaining rulebooks more? Or give a link to where you've explained them in the past?

-----

2 points by rocketnia 5107 days ago | link

"Rulebook" may actually be a bad name for it, 'cause there are more complicated notions of rules. I chose the name because I considered it to be a simplified form of Inform 7's rulebooks, but I might have simplified the "rule" part right out of it.

In my Penknife draft, every function can either return a value, raise an exception, or fail with a failure message. Failure means the function doesn't meaningfully apply in this case, and it lets the caller try something else if they they happened to use failcall to provide an alternative. If they didn't use failcall, it just raises an exception.

A Penknife rulebook can be called as a function. It's represented by a sequence of functions that are called one at a time until one returns a value or raises an error. If they all fail instead, then the whole rulebook fails.

A rulebook is like a series of 'extend cases, but being its own type, it's less encapsulated. There could theoretically be rulebook utilities that let you count the number of extensions, etc. While an implementation of 'extend could do this too, it's a little unheard of.

The exact implementation of a Penknife rulebook is kind of up in the air at this point. I was just using a box with a list of functions inside, but recently I've been considering focusing all of a module's mutation onto one easily swappable place, rather than having it spread out over a namespace and an arbitrary number of rulebooks, global tables, and other such things.

-----

1 point by Pauan 5106 days ago | link

So... that sounds an awful lot like message passing + prototypes. So much so that I'm fairly sure you could create rulebooks as syntatic sugar on top of message passing + prototypes.

And if rulebooks are sugar on top of prototypes, and the built-ins are implemented with message passing...

-----

1 point by rocketnia 5105 days ago | link

My reply to this topic is at http://arclanguage.org/item?id=14347 ("Well, the kind of message-passing...").

-----

1 point by rocketnia 5106 days ago | link

Oh, another okay name for 'custom-call or 'defcall-missing is 'ref, for symmetry with 'sref.

-----

1 point by Pauan 5107 days ago | link

Okay, so my example truly is impossible in pgArc. That's what I already suspected, but it's nice to actually know. Using something like defcall would probably work, though.

-----