Research Abstracts - 2007
Data-Purpose Algebra

Chris Hanson & Gerald Jay Sussman

Data is often encumbered by restrictions on the ways it may be used. These encumberances may be determined by statute, by contract, by custom, or by common decency. Some of these restrictions are intended to control the diffusion of the data, while others are intended to delimit the consequences of actions predicated on that data.

The allowable uses of data may be further restricted by the sender: "I am telling you this information in confidence. You may not use it to compete with me, and you may not give it to any of my competitors." Data may also be restricted by the receiver: "I don't want to know anything about this that I may not tell my wife."

Although the details may be quite involved, as data is passed from one individual or organization to another and combined with other data the restrictions on the uses to which it may be put are changed in ways that can often be formulated as algebraic expressions. These expressions describe how the restrictions on the use of a particular data item may be computed from the history of its transmission: the encumberances that are added or deleted at each step. A formalization of this process is a Data-Purpose Algebra description of the process.

The purpose of this project is to build systems that track the provenance of data, as it is transmitted, aggregated, and distilled, and to compute the allowable purposes for the use of these results at each step. This may be thought of as an end-to-end protocol for accounting for the use of the data.


