Thursday, July 9, 2009

Shipping Container

shipping container buildingImage by dav via Flickr

I had the eyeball ache yesterday. The best part of living in a shipping container is not seeing the world and the jostling and thunderous echoing pounding of sheet metal.
Reblog this post [with Zemanta]

Wednesday, July 8, 2009

This started as a posting about Statistical Causality...

Interesting blog posts back and forth between Don Rubin, Judea Pearl, and Andrew Gelman. I cannot understand the original point of contention between Don Rubin and Judea Pearl. I have no idea what "M-bias", "controlling for pre-treatment versus post-treatment variables" mean.

http://www.stat.columbia.edu/~cook/movabletype/archives/2009/07/disputes_about.html

The Illustrated Sutra of Cause and Effect: 8th...Image via Wikipedia

Happily, the fundamental point of disagreement seems to be getting clear with a comment by Philip Dawid (who offers a very nice on-line document "Principles of Statistical Causality" which covers a lot of concepts in only 94 pages). This is a very clear summary of what is meant by "causal inference":
Like Pearl, I like to think of "causal inference" as the task of inferring what would happen under a hypothetical intervention, say F_E = e, that sets the value of the exposure E at e, when the data available are collected, not under the target "interventional regime", but under some different "observational regime". We could code this regime as F_E = idle.
...
It should be obvious that, even to begin to think about the task of using data collected under one regime to infer about the properties of another, we need to make (and should attempt to justify!) assumptions as to how the regimes are related.
(Emphasis on the turn of phase that caught Andrew Gelman's eye)

What is the nature of the discipline of "statistical causal modeling"? Beats me.

I have the idea of several DAGs (gaps filled in with plausible causal contributors), several multi-dimensional collected sample distributions (gaps filled in with smoothing or naive-Bayesian techniques), several simulations based on plausible natural or un-natural laws (simulations providing multi-dimensional distributions). Between creatures of all three types, we test them against historical data, future data, and intervention experiments. They fight against themselves, and only a few remain.

I have the idea of defining a distribution as the opportunity to open a sample portal to an alternate universe to the same sample stream. You can open sample portals at a certain rate, so the total number of samples available to you has a rate that increases as the square of time.
You desire the sample portals to all have the same correct driving distribution.

Now, obviously, you cannot actually create sample portals to alternate universes. So you create more fake sample portals. Each has the possibility of one or more failure modes.

Taken all together, these form a pessimistic analysis. Human action is inherently optimistic, so, take the opportunity to make explicit biases of personality and human real-world population studies of life outcomes consistent with living a happy and fulfilled life. The third leg of the stool is a model of human effectiveness:
  • rationality,
  • choice,
  • free will,
  • changes in capability,
  • changes in habitual action,
  • goal-directed action,
  • effectiveness,
  • consciousness,
  • self-knowledge/introspection,
  • knowledge of the world,
  • consistency between behavior and professed mental abstractions of desired behavior,
  • and morality -
leading to the outcome of a happy and fulfilled life.

That is what I got right now.

The way to study how "human action is inherently optimistic" is with happiness research (covered in Daniel Gilbert's Stumbling On Happiness, and the sections on happy outcomes in Dan Ariely's Predictably Irrational).

The model of human effectiveness is based on there being only a few chances over time for exercising free-will.

We greatly limit the role of free-will, because human behavior is better explained by:
  • habitual actions,
  • personality,
  • IQ,
  • daily exercised capability,
  • daily exercised responses,
  • environment,
  • etc.
We expect to see meaningful change caused by free-will events over the course of 10 years - just because there are so few meaningful free-will events during those many years.

A sketch of the human brain by artist Priyan W...Image via Wikipedia

Like Julian Jaynes arguing in The Origin of Consciousness in the Breakdown of the Bicameral Mind that consciousness only came into existence 3000 years ago from stresses on growing populations searching and competing for sustaining resources, I don't think that free-will is a capability of all humans since Homo sapiens originated 200,000 years ago. I think meaningful free-will also is a recent phenomenon. And I don't think it is expressed in all people - it only gets exercised under a peculiar stress of personality and environment, when the capability exists.

I reject the idea that free-will is simply random behavior, or the absence of a mechanism for the deterministic prediction of behavior.

It cannot be meaningfully separated from these: rationality, goal-directed action, effectiveness, consciousness, self-knowledge/introspection, consistency between behavior and professed mental abstractions of desired behavior, morality, and other issues.

The reason for the need to consider time periods on the scale of a decade is that free-will events are the residue of actions and behaviors that cannot be explained by simpler means. Because it is a residue, we are only interested if there is a over-riding consistency and progression - because we don't want to give the label of "free-will" to an odd-ball collection of junk.

That's enough for now.

Added: Andrew Gelman explores issues further:

More on Pearl's and Rubin's frameworks for causal inference

He describes the idea of using Minimal Rubin, Full Rubin, Minimal Pearl, Full Pearl as techniques for statistical causality. And this note on the benefit of Rubin's approach:
Be explicit about data collection. For example, if you're interested in the effect of inflation on unemployment, don't just talk about using inflation as a treatment; instead, specify specific treatments you might consider (adding these to the graphs, in keeping with Pearl's principles). This also goes for missing data. ...
I don't understand the section on "Controlling for intermediate outcomes".

Pearl then contributes a long and difficult comment. Gelman and Pearl argue over this: "the correct thing to do is to ignore the subgroup identity" - I just don't understand this at all.
Reblog this post [with Zemanta]

Poopland Duck


Drew this on the back page of the Logitech desktop speaker LS11 Quick-start guide. I like these speakers because they have a headphone jack and over-sized power/volume control knob.

Unlike the image on the Logitech website, they do not hover in space.

USD 19.99
Reblog this post [with Zemanta]

Tuesday, July 7, 2009

Group Selection - Altruism born from warring, genocidal tribes

Article in The Economist June 6th 2009, p. 77. Taking about research of Samuel Bowles (in Science) "Did Warfare Among Ancestral Hunter-Gatherers Affect the Evolution of Human Social Behaviors?"

Science 5 June 2009:
Vol. 324. no. 5932, pp. 1293 - 1298
DOI: 10.1126/science.1168112

(Also work by Mark Thomas "Late Pleistocene Demography and the Appearance of Modern Human Behavior", summarized by NPR "Larger Populations Triggered Stone Age Learning". About population density needed to support technologies, and how population fluctuations caused technologies to be lost, only to be independently discovered later, again and again.)

Obsidian arrowhead.Image via Wikipedia

Group selection takes place by warring, genocidal tribes (a process described as "genetically terminal" for the losers). This group selection was found to be mathematically compatible with a hard "selfish gene" approach to natural selection. With this approach, there is selection for altruistic traits.

Is this the basis for all actual group selection? The more I read, the more I see that a hard "selfish gene" is the correct way to approach natural selection. Is this mechanism of warring, genocidal tribes all we have to make actual group selection?
Reblog this post [with Zemanta]

Searing Acids Of Pain

Original drawing by my daughter. The problems of many acids.

Reblog this post [with Zemanta]

Pajama Railroads

This drawing is a collaboration between my daughter and myself.
Reblog this post [with Zemanta]

Why Dynamic Languages? Why Strongly Typed Languages?

From Reddit commenter "munificent", the best analogy of why productive coders use Dynamic Programming Languages and Strongly Typed Programming Languages (best analogy I have ever seen...):
> > So, the problem is that software developers have poor foresight and a complete lack of self-awareness?

No, the problem is that the success of most software projects can not be predicted.

* {{en}} Picture of Yukihiro Matsumoto, creato...Image via Wikipedia

When you go camping, you just pitch a tent. When you're moving, you build a house. With many software projects, it's impossible to tell beforehand if you're just going camping or will be taking permanent residence.
It doesn't make sense to spend a month building a new house every time you go somewhere, if 90% of the time you end up leaving after a couple of days.

What does make sense, and is becoming a common pattern is this:

1. Stake out a new territory and pitch a tent (v1 in a dynamic language)
2. If it turns out to be a hospitable place, start building a new house next door (migrate back-end services to more strongly-typed languages).
3. Once that's done, ditch the tent and move in (move the production system over to the new back-end).

See: twitter (Ruby -> Scala), Facebook (PHP -> Erlang), etc.

هذا و توّ الليل ماراح نصفه . .Image by ThaRainbowRaider. via Flickr

Yes, a lot of academic computer science completely ignores the economic constraints of software engineering.


Reblog this post [with Zemanta]