kmarekspartz

Multitools

March 11, 2014

On choosing the best tool for the job

Last week, I gave a talk entitled “Why Not Django”. This is an essay version.

My preferred [multitool](https://www.spyderco.com/catalog/details.php?product=206).

Multitools are great. They can do many things, though not always well.

Occasionally, I find things I can't do with my multitool, but I do them anyway. If I need a prybar, I'll use the flat-head screwdriver from my multitool. If I need a saw, I'll use the tiny one on my multitool.

If all you have is a hammer, everything looks like a nail.

– Law of the instrument

Using tools for different or bigger jobs than they are meant for can lead to damage. My flat-head screwdriver isn't so flat anymore. At what point is a damaged multitool no longer considered useful? I'd argue that point happens sooner than if they were individual tools, as carrying around a multitool with only a few working tools isn't worth the extra effort.

Multitools are useful, but they have their place. Everyone should have one in their toolbox, but you should always prefer single-use tools.

[Use tools] that do one thing, and do it well.

– Doug McIlroy, UNIX philosopher.

By using single-use tools, we decompose the multitool into a bigger toolbox, with tools more apt for their respective jobs. This decomposition leads to a separation of concerns, so the bottle opener doesn't need to be hanging off the side of your screwdriver. When your requirements change (and they always do), you don't need to worry about the interdependencies of your tools.

Some may argue that multitools give you greater flexibility than other tools. However, there is more than one way to define flexibility in this context. In addition to having flexible tools which can be used for different use cases, we also need the flexibility to choose the best tool for the job.

Formatting Classes with Multiple Inheritance in

March 8, 2014

Formatting rules for Python

PEP 8 says that line length should generally be fewer than 80 characters. It doesn't discuss formatting for classes with multiple inheritance, but we can infer a preferred style from the formatting for methods.

Consider this class with a long list of parent classes:

class OurClass(AMixin, AnotherMixin, SomeParentClass, AnotherParent, ThisIsJustTooMany):
    pass

This could be formatted:

class OurClass(
    AMixin,
    AnotherMixin,
    SomeParentClass,
    AnotherParent,
    ThisIsJustTooMany
):
    pass

Another formatting could be:

class OurClass(AMixin,
               AnotherMixin,
               SomeParentClass,
               AnotherParent,
               ThisIsJustTooMany):
    pass

Due to the way methods are formatted in PEP 8, a better style might be:

class OurClass(AMixin, AnotherMixin, SomeParentClass,
               AnotherParent, ThisIsJustTooMany):
    pass

I'm not sure here. I prefer the latter two. What would you prefer?

Pirsig on Tools

March 2, 2014

Why tools are important

Nothing's quite so demoralizing as a tool hang-up. Buy good tools as you can afford them and you'll never regret it. If you want to save money don't overlook the newspaper want ads. Good tools, as a rule, don't wear out, and good secondhand tools are much better than inferior new ones.

– Robert Pirsig, Zen and the Art of Motorcycle Maintenance, p. 322

Interfaces and Layers

March 2, 2014

Interfaces and layers, two perspectives.

I've become much more interface-oriented than I used to be. Descriptions of the inputs and actions and outputs of methods with no code. I love writing that stuff. I also enjoy writing the code that implements it, but I do less of that nowadays than I used to. And of course it's important to have had an experience doing that so you don't design impossible specifications. You should have an idea what the implementation is going to look like as you design the interface. You should have an idea for the implementation. Then if someone comes along with a better one, that's fine.

– Guy Steele, Coders at Work, p. 343

[...] it seems like when you're just building layer on top of another layer on top of another layer, you don't really get the benefit of writing, say, a DFA. I think by necessity algorithms–new algorithms are just getting more complex over time. A new algorithm to do something is based on 50 other little algorithms. Back when I was a kid you were doing these little algorithms and they were fun. You could understand them without it being an accounting job where you divide it up into cases and this case is solved by this algorithm that you read about but you don't really know and on and on. So it's different. I really believe it's different and most of it is because the whole thing is layered over time and we're dealing with layers. It might be that I'm too much of a curmudgeon to understand layers.

– Ken Thompson, Coders at Work, p. 483

A Language which isn't Recursively Enumerable

February 28, 2014

A Language which isn't Recursively Enumerable

We've been working through the Chomsky hierarchy at the computational linguistics reading group. This past week, the existence of a language which wasn't recursively enumerable came up. Here's my attempt at a solution. Please let me know where my math and reasoning go off track.

By definition, a recursively enumerable language is a language for which there exists a Turing machine which must halt on all strings in the language, but may not necessarily halt on strings not in the language.

By definition, “a universal Turing machine (UTM) is a Turing machine which can simulate an arbitrary Turing machine on arbitrary input”.

The existence of a UTM requires a string-encoding of a Turing machine, that is, a string which can be interpreted by the UTM to simulate the described Turing machine.

Since languages are sets of strings, the set of strings which describe Turing machines that halt for all inputs is a language, L.

Due to the halting problem, there cannot exist a Turing machine which halts for all strings in L.

Since the definition of a recursively enumerable language requires the existence of such a Turing machine, the language L is not recursively enumerable.

While L is not recursively enumerable, supersets of L may be recursively enumerable. As an exercise for the reader, what is an example of a language which is a superset of L, but is recursively enumerable?

Software as a Capital Asset

February 27, 2014

Software as a Capital Asset

... [T]here is no such thing as a capital asset that doesn't require ongoing maintenance and investment. You should expect that there is going to be some cost associated with maintaining a growing library of reusable software. .... You have to think of [software] the way you would think of a capital asset.

– L Peter Deutsch (from Coders at Work, p. 434)

Harry Truman on Iteration

February 26, 2014

Iteration and revision, according to Robert Pirsig, anyway:

Harry Truman, of all people, comes to mind, when he said, concerning his administration's programs, “We'll just have to try them ... and if they don't work ... why then we'll just try something else.” That may not be an Exact quote, but it's close.

– Robert Pirsig, on Harry Truman, on Iteration, Zen and the Art of Motorcycle Maintenance, p. 284.

A MathJax Issue

February 25, 2014

I'm having a small issue with MathJax. I wanted to use it in yesterday's post, but it wasn't working so I used LaTeXiT to generate a PNG.

Here's the PNG:

Here's the MathJax version:

$$factorial = \lambda n . \left\lbrace \begin{array}{c c} n & \textrm{if $n = 1$} \\ n * self (n-1) & \textrm{if $n > 1$} \end{array}\right. $$

The MathJax version generates fine on the MathJax website, so I suspect it has something to do with my configuration (the default Hakyll MathJax config). Can anyone help? Feel free to browse the source.

Update: Thanks to Daniil Frumin for the fix! It all seems to be working now.

The (Linguistics) Pragmatic Programmer

February 24, 2014

Applying principles of natural language linguistics to programming languages

What follows is an mildly revised version of a final paper from the linguistics pragmatics course I took last spring.

Programming language theory and theoretical linguistics are two fields with overlapping histories. However, most programming languages are not designed to behave like natural language. This post identifies research in the pragmatics of natural language for programming language theory researchers to adapt to programming language theory, and vice versa.

Code is for people

To effectively synthesize these fields, we need to remember that programming languages are for people and not computers. People need to be able to effectively convey ideas in programming languages, as well as effectively understand ideas in programming languages.

A common example of this is when a programmer is working on a team, other people will look at the code and need to understand it in order to know how to use it. A less common example, though just as important, is the diachronic situation: after a long period of time the programmer returns to the code they wrote. They need to be able to understand what they previously wrote. Since this is the case, Grice's cooperative principle applies.

On the other hand, the language is restricted in that it needs to be convertible into language for the computer to understand and evaluate efficiently. This limits the kind of grammars and the flexibility of the language designer, but does not prevent pragmatics from appearing in the design.

Each of the Gricean maxims can apply to programming languages:

Quality: The code should work (in most cases). This is usually an assumption.
Quantity: Do not write too much (or too little). If too much is written, it wastes the computer's and more importantly the programmer's time. If too little is written (as in the sed programming language and 'one-liners'), the intent and function of the code is unclear.
Relevance: Do not write unnecessary things. Code that is not needed for the intent of the program should not be included in the program.
Manner: Do not obfuscate. This could be as simple as unclear variable names, or as complex as using obscure syntactic forms.

The Pragmatic Programmer applies pragmatism (not pragmatics) to the workplace structure of a software engineer, so it would not seem as related as its title suggests. However, it does provide some tips to consider when designing and writing software. These tips can be cast in the frame of the Gricean maxims.

One tip is the DRY principle. DRY stands for 'Don't Repeat Yourself'. This corresponds to Quantity, since a lack of repetition results in a shorter program, but also to Manner since repetitive software could be unclear to subsequent programmers using the program.

Another way of thinking about Quantity and Manner is in terms of efficiency. In programming, we can think of efficiency in a number of ways: code complexity, time complexity, space complexity. These correspond to the engineering idiom “Good, Fast, or Cheap: Choose two!”

Another tip from The Pragmatic Programmer suggests the separation of concerns. This principle states that each section of code should only handle one concern. Intuitively, this corresponds to Relevance.

Quality is a harder maxim to identify since programs are generally considered to be correct if they run correctly. Some programming languages make more or less claims about their correctness. This seems to be the most highly weighted maxim for programming languages; the others can be flouted to satisfy Quality.

Determiners

Most programming languages do not have a notion of determiners. For example, even in object-oriented languages where the class or prototype of an object is also an object, we cannot use a determiner with the class or prototype of the object to get an instance of that class or prototype. However, an analogy of determiners and adjectives can be made to database query languages and to object-relational mapping, which provides a bijective map of objects and rows in a relational database.

Lets consider the indefinite noun phrase, “a big red dog”. In SQL, a database query language, this can be expressed as:

SELECT FIRST(*) FROM Dogs WHERE color='red' AND size='big';

The semantics of this statement are similar to the English. In SQL, there are Tables and Rows. Tables correspond to the general noun, e.g. dog, whereas Rows correspond to an individual. This statement just gets some row from the table where the adjectives specified are satisfied.

For definite determiners, such as “the big red dog”, we want to make sure there is only one result which satisfies those conditions. In SQL this can be expressed as:

SELECT a FROM
(SELECT * FROM Dogs WHERE color='red' AND size='big') a
WHERE COUNT(a)=1;

Note that this is purely a semantic translation into SQL. What if there exists other big red dogs, but we can infer from context which big red dog was being referred to? Pragmatic reasoning should improve the result. This should be possible in a programming language. If you only have one member of a class with those properties in the current namespace, then that should be what is referred to, and if there isn't one, check the next highest namespace. This is approximately how natural language works.

There is a slight wrinkle in this suggest, but it can be resolved consistently. When I said “a big red dog” did it create a new object with those properties or utilize an existing big red dog? If it created a new object, then later when I say “the big red dog” it refers back to that same object. However, if it utilized an existing big red dog (or if if failed to point to anything), then “the big red dog” escapes out of the current namespace and looks around trying to find somewhere else to provide an object to refer to. If we were walking outside, it might be to some dog we see, but there is also certain background knowledge in deeper namespaces which come in handy even when there is no local referent. For example, with “the big red dog”, you may think of Clifford, the big red dog. This would be pragmatic reasoning, and would be implementable into a programming language.

Anaphora

Some programming languages make use of anaphoric lexemes. However, the use of anaphora in such languages is very limited. This section explores current anaphoric structures in programming languages.

Anaphoric lambdas

Anaphoric lambdas are used to create recursive anonymous functions (Let Over Lambda, Chapter 6). Lambda expressions are used as anonymous functions in many programming languages, but it is difficult to represent recursive functions anonymously.

Consider the following. We would like to represent the factorial function anonymously. Recall that $n! = n * n – 1 * ... * 2 * 1$. So we write $\lambda n . n * n – 1 * ... * 2 * 1$. However, this would not work, since the ellipsis represents recursion. Instead, we provide an anaphor self which can be used within the lambda expression to represent recursive calls to itself. With this in mind, we can write the following anonymous recursive function:

$$factorial = \lambda n . \left\lbrace \begin{array}{c c} n & \textrm{if $n = 1$} \ n * self (n-1) & \textrm{if $n > 1$} \end{array}\right. $$

Another use of self (or this) occurs when you want to refer to the object wrapping the current namespace. For example, when defining a class in Python, you use self to refer to the eventual instance of the class:

class Dog:
    size = string()
    color = string()
    name = string()
    owner = Person()
    location = Location()

    def run_to(self, object):
        self.location = object.location

    def fetch(self, stick):
        self.run_to(stick)
        self.run_to(owner)

In other languages, the self in the methods would refer to the methods themselves, rather than the instance object of the class. This is more common in prototypal object-oriented languages (as opposite to classic), due to how prototypes work For example, JavaScript, which uses this in place of self, has this issue. As a work around to this (no pun intended), a common idiom is to create a that variable which refers to the object above the current one. In Python syntax, but JavaScript scoping:

def a(x):
   that = this
   def b(y):
       print( this === a )  # false!

       # inside b, `this` points to b, not a
       print( this === b )  # true

       # but `that` points to a!
       print( that === a )  # true

The last notion of anaphor considered here is called cascade, or method chain. Originally implemented in Smalltalk, cascade allows for programming language constructions similar to “the owner threw the stick, told the dog to fetch it, and watched the dog”. Here is that sentence translated into Smalltalk:

[EmilyElizabeth threw: stick
                tell: clifford to: fetch the: stick
                watch: clifford]

Cascade is implemented by returning the object (EmilyElizabeth, in our case) from each method (hence 'method chaining'), rather than some some other object or an attribute of the object. This may be more readily understood in a more mainstream syntax:

EmilyElizabeth.threw(stick)
              .tell(Clifford, fetch, stick)
              .watch(Clifford)

Hoare triples and presupposition

Another analogy can be made between Hoare triples and presupposition. Hoare proposed a method of verifying program correctness using the following:

Pre-condition (Set of possible states)
Statement(s)
Post-condition (Set of possible states)

To use Hoare triples, one specifies the expected post-condition for the program and works backward through each statement to find the needed precondition for the program. Each statement influences the conditions by changing the set of states, for example, by assignment. In an assignment statement, everything that previously was true about the right hand side of the assignment is now true about both the right hand side and the left hand side of the assignment.

The pre-conditions are similar to presupposition. This later influenced the dynamic semantics approach to presupposition.

Type inference

In programming languages, typing is controversial, with some preferring strong static typing for reliability and efficiency, with others preferring weak dynamic typing for its flexibility.

However, ignoring the social aspects, we may still be able to find a natural language analogy to type inference. Type inference allows the computer to infer certain properties about the various objects under consideration. For example, consider the following subset[^1] of Haskell:

[^1]: This is not actual Haskell, particularly the + operator. I simplified the language for the sake of demonstrating inference, but the principles apply to the larger language, too.

x = 3

We would like to assign each entity a type. We find x and we cannot give it a type. We find 3 and we know it is an integer. Then we can assign x to be an integer, since it is declared to refer to 3.

inc a = a + 1

Here, there are more things that we need to infer. We see inc takes an argument, so we know it must be of a function type, with some input type and some result type. We see a and we know that it must be the same as the input type of inc, but we cannot do anything else yet. We see 1 and know it is an integer. We see + and know that plus takes two integers and that it takes in a, so a must be of type integer, and inc must take in and return integers.

g f i = (f i) + 1

Here, we see g and see that it takes two arguments, so it must be a function that takes some argument and returns another function which takes another argument and returns a result. We see f and i and unify them with the respective arguments in the type of g, but cannot do anything else yet. We see (f i) and know that f must be a function from the type of i to some other type. We then see the 1 and know that it is an integer. We then see the + and know that it takes two integers and returns an integer. Then we know the return type of f is integer, and the return type of g is integer. In summary: g :: (a -> Integer) -> a -> Integer where a is a polymorphic type corresponding to i.

This analysis can be done either during compilation (static) or at runtime (dynamic). Static type inference is similar to what the speaker of natural language does in sentence formation using semantics. Dynamic type inference (for example, duck typing) is similar to what the hearer does using pragmatic inference.

Contracts

Contracts, an extension to type theory inspired by Hoare triples, provide additional means of software verification. Contracts are typically used to verify the pre- and post-conditions of a function by either runtime checking the arguments and results, or statically unifying the contracts of the various functions and their calling conditions. For a long time, this was difficult to do in languages with first class functions, since there would be a recursive check with no base case. Findler, et al. figured out how to include the base case, and extends contracts with the notion of higher-order contracts.

Instead of typing variables, contracts provide limitations to the properties of the variables. We can use the correspondence between Sets and Boolean functions to efficiently represent and check these conditions: Since conditions are just sets of possible states, each contract can be takes a state and returns boolean value as to whether it is in the set or not.

Contracts therefore must have the contract Any -> Boolean. The Any is where the base case comes in. Since Any is a contract, it also has the contract Any -> Boolean, but Any is defined to always return true, so when checking the contracts of contracts, it ends up here.

There is also a None contract which always returns false, but it is of little practical use.

There are also contract combinators, which correspond to the Set and Boolean combinators. For example, the contract -2 or Positive takes the -2 contract which checks for equality with -2, and the Positive contract which checks for a positive number, and unifies them into a composite contract. This is equivalent to the set union or boolean or operators.

These contract combinators along with a minimization function may provide a form of pragmatic reasoning for programming languages. A minimization function would take these contracts and simplify them. For example, Number and Integer simplifies to Integer since anything in Integer is also in Number.

Static and dynamic binding

In programming languages, scope refers to the method of resolving variables to values. The two primary methods are dynamic and static scope.

Both scope methods have a stack of namespaces, with a new one for each block. If the variable is not found in the current namespace, the variable is searched for up the stack. However, where they differ is the searching method. Dynamic scope uses a single namespace for variables, and thus searches through each activation record on the stack. Static scope only examines activation records which are deeper than the first occurrence of the namespace for that block.

There does not seem to be much of a difference at first glance, but the implications are numerous. Static scope is unintuitive to implement, but intuitive to use, understand, and reason about within a programming language, and thus very few programming languages use dynamic scope.

This notion of scope is similar to the admittance of Karttunen, and later dynamic semantics approaches. Perhaps the typology of programming languages can be used to inform our understanding of name resolution in natural language.

RESTful Email

February 21, 2014

A proposed REST API for email

There are a few different options for Email REST APIs, but none of them, as far as I can tell, are a replacement for the traditional email protocols, SMTP, POP, and IMAP. What would such a system look like?

First, instead of our usual email addresses, we'd have routes, so my email, kyle.marek.spartz@gmail.com, would look more like:

gmail.com/kyle.marek.spartz

To GET all of my mail:

GET gmail.com/kyle.marek.spartz

If I wanted mark message 123 as spam:

POST gmail.com/kyle.marek.spartz/123/labels/spam

To get a message's labels:

GET gmail.com/kyle.marek.spartz/123/labels

To get all spam messages:

GET gmail.com/kyle.marek.spartz/labels/spam

URL query strings on GETs could be used to filter results by the email metadata:

GET gmail.com/kyle.marek.spartz?from=gmail.com%2Fmichaeljburling

To send an email, we could POST to someone else's inbox, with the message content in the request body:

POST gmail.com/michaeljburling

For security, we can incorporate public keys into the system. Before I can POST to gmail.com/michaeljburling, I would have to get Michael's public key, and then use it and my private key to sign my message:

GET gmail.com/michaeljburling/public_key POST gmail.com/michaeljburling

Any messages with an invalid signature would go directly to spam, so email spoofing would not be possible.