Draft presented at Visual Representations and Interpretations 1998, University of Liverpool, Sept. 22-24 1998. Also available: the final version of this paper, to appear in volume published by Springer-Verlag, 1999.

Words and pictures -- Goodman revisited

John Lee
Human Communication Research Centre
and EdCAAD, Dept. of Architecture
University of Edinburgh
2 Buccleuch Place
Edinburgh EH8 9LW

J.Lee@ed.ac.uk

(All section headers link back to this table)

Goodman's theory of notation

This discussion aims to consider varous aspects of the distinction between linguistic and pictorial representations. It has been suggested (Lee 1997) that the notion of the pictorial is to be identified through the idea of an interest-relative structure-mapping. This depends to some extent on rejecting certain elements of Goodman's well-known views about pictorial representation, which deserve a deeper examination. Here, some of the relevant issues are briefly opened up as a prelude to a more extended discussion.

The locus classicus of comparative study between graphical and linguistic systems is Nelson Goodman's Languages of Art (1968). Goodman is concerned with a general issue about how representation works -- how marks on paper are related to various kinds of things in the world[1] -- in a range of cases such as pictures, music and other kinds of notation. He therefore seeks to establish what distinguishes a "notational symbol system" from other kinds of symbol system. His approach forms the prototype for most later formal theories in this area, in as much as he considers even pictures to be symbol systems which represent not in virtue of any notion such as resemblance, but due to their being subject to certain systematic rules of use.

According to Goodman, there are five basic conditions required for a symbol system to be notational.

1. It must consist of symbols (utterances, inscriptions, marks) which form equivalence classes (characters) on the basis that they can be exchanged without syntactical effect. Alphabets are a prototypical example -- any "a" is as good as any other; they are "character-indifferent", and the characters have to be disjoint, so that no mark qualifies as an instance of more than one character. In general, Goodman takes compound inscriptions (e.g. sentences) to be characters as well.

2. Characters have to be "finitely differentiable" (or "articulate") in the sense that their disjointness is feasibly testable, which rules out e.g. "dense" systems where any two (ordered) characters have another between them.

3. Notational systems must be unambiguous, so that the extension (which Goodman calls the "compliance-class") of an inscription is invariant with respect to time, context, etc.

4. The compliance-classes of all characters must be disjoint. (Also, the system will ideally be non-redundant.)

5. Compliance-classes must also be finitely differentiable. Thus, for example, any system which is "semantically dense", in that its compliants form an ordering such that any two have another between them, is excluded.

Goodman elaborates these points in relation to clocks and pressure gauges. These measure quantities that are infinitely variable, so that the semantic domain can always be seen as dense, and if there are no marks on the dial, then there is no syntactic differentiation of characters, so the representation system is clearly non-notational. It can become syntactically notational if, say, dots are distributed around the dial and each is taken to be the centre of a disjoint region such that the pointer appearing anywhere within that region counts as an inscription of a certain character. If the ranges of pressure correlated with these regions are also disjoint (and articulate, of course), then the system meets the semantic requirements as well, and hence is simply a notation. On an ordinary clock face, the hour hand is typically used notationally in this way, whereas the minute hand may be seen as marking the absolute elapsed time since the passing of a particular mark, and hence is non-notational.

Diagrams, one might think, are typically non-notational. Goodman observes that many topological diagrams are in fact entirely notational. This also applies e.g. to many drawings used in architecture and design, where although there may be a non-notational impression of form, measurements etc. are always given and the use of the drawing becomes mainly notational. Ordinary road maps are a typical example of mixed diagrams, with both notational and non-notational aspects. Where non-notational, Goodman observes, diagrams are equivalent to two-dimensional models, if we narrow the latter term (which in general can mean "almost anything from a naked blonde to a quadratic equation") so that it excludes descriptions and samples. Models or diagrams of molecules, say, are usually entirely notational; others range all the way to being entirely non-notational.

Goodman approaches the difference between diagrams and pictures by introducing a further notion of "repleteness". A symbol is relatively replete if a relatively large number of its properties are involved in its identity as a symbol; something is more a picture, and less a mere diagram, if there is less about it that can be changed without making it into a different picture. This concept recieves more detailed discussion below.

Goodman's general view is summarised as follows:

Descriptions are distinguished from depictions not through being more arbitrary but through belonging to articulate rather than to dense schemes; and words are more conventional than pictures only if convention is construed in terms of differentiation rather than of artificiality. (230-231)

According to his own account, however, Goodman is not here trying to define the pictorial. Writing much later, in "Representation re-presented" (Goodman & Elgin 1988, ch. VIII), he says:

Nowhere in my writing to date have I proposed a definition of depiction, but have only suggested that the everyday classification of symbols into pictures and nonpictures is related in an important way to the line between symbols in a dense or 'analog' system and those in a finitely differentiated or 'digital' system. Being analog seems perhaps a necessary condition for a depiction, but not a sufficient one; an ungraduated thermometer indicates temperatures in an analog system but does not, in common parlance, picture them. (Goodman & Elgin op. cit., 123)

This characterisation is then sharpened up somewhat by noting that the distinction between analog and digital does not depend on the semantics of the system. Considering only the syntactic aspect (called a scheme, where clearly a scheme, being susceptible of having different denotations assigned to it, can belong to more than one system), Goodman notes that digital and analog schemes can be categorised on the basis of differentiation among the symbols in the scheme. Goodman is thus led to claim that the pictorial can be distinguished from the verbal on a purely syntactic basis, despite the apparently paradoxical facts that "all symbols belong to many digital and analog schemes", and "some schemes consisting entirely of pictures ... are digital" (130). The key to resolving this paradox is that one has to consider the comprehensive or full scheme for a whole language (e.g. English) or pictorial system (e.g. our presystematic notion of what a picture is).

Symbol systems in use

There is a tension between this account from ch. VIII of Goodman & Elgin (op. cit.) and ch. VII of the same book. In ch. VII[2], the point is hammered home that our competence to understand novel representations using some system cannot in practice be accounted for on the basis of syntax and semantics alone, scorning "a pair of related misconceptions: ... the conviction that understanding a symbol is an all-or-nothing affair [and] that a symbol has a single, uniquely correct interpretation" (119). Invariably, contextual and often background knowledge is brought into play. "Literal" meaning is ill-defined; metaphor is rife. Language use does not depend simply on the application of rules, and picture use does not depend on our capacity for visual recognition of resemblances[3].

This vehement argument begins to call into question the very existence, or at least definiteness, of the system of rules; the syntax and semantics. The identification -- and hence identity -- of a word, or its location in a grammatical category, becomes open to question. If we look back at Goodman's approach to defining a syntax, we note that it depends on discriminable marks that fall into equivalence classes and are interpreted unambiguously. In fact, few symbol systems in practical use will meet these criteria, and the observations in the previous paragraph serve to emphasise that even when they may appear to this is likely to be an illusion. How, in fact, are the relevant equivalence classes identified? -- By the patterns of use that the symbols are subject to, e.g. what can be exchanged "without syntactical effect". But such effects can only be identified on the basis of a certain amount of theorising, which in generating the distinction between syntax and semantics (and that which is neither) departs from the reality of practice where context and relation to experience are everything. Any distinction so generated is surely to be regarded as bounded and perhaps temporary, certainly subject to revision in the face of different kinds of usage.

In these circumstances, can we really speak of a comprehensive symbol scheme? Difficult as this must be for the symbols of a language, it seems still more so for those constituting a pictorial system. As Goodman himself emphasises, one and the same picture may appear in one situation as a digital character, in another as an analog picture. It seems manifestly implausible that we can tell which is which on purely syntactic grounds, because this requires us to establish when the picture can be substituted by another; and even if this can be found out from an agnostic scrutiny of patterns of usage, it surely still depends on what the picture is taken to represent. On the one hand, it is deeply problematic to identify the system that is at hand when any symbol is being considered; on the other hand, as far as pictures are concerned it appears that when used analogically each is a unique exemplar of a symbol and hence, as Elkins (1993) observes, that "there is very little sense in calling non-notational images 'systems'" (361).

A defence of the syntactic approach is mounted by Scholz (1993, 101-2) on the basis that pictures are common enough which do not denote at all -- e.g. pictures of fictional objects. We can accept this without finding it very helpful. In all symbol systems there's a sense in which what something means is distinct from the question of whether anything corresponds to this. Elgin (1993, 135), responding to Scholz, makes a similar point in observing that reference, as understood by herself and Goodman, encompasses more than denotation, including e.g. exemplification, expression and allusion. For these or other reasons, we surely have to insist that symbols which fail to denote "real world" objects are not thereby shown to lack interesting semantic properties; but also it is hard to see that syntactic properties alone can be enough to distinguish pictures from other symbols.

Goodman worries that

The pictorial is distinguished not by the likeness of pictures to something else but by some lack of effective differentiation among them. Can it be that -- ironically, iconically -- a ghost of likeness, as nondifferentiation, sneaks back to haunt our distinction between pictures and predicates? (131).

The ghost has some substance. Nondifferentiated pictures are not necessarily "like" each other in the sense that they visually resemble each other, but rather in that they have similar uses; and though this use may not be identified through their likeness to something else, it seems difficult to disentangle from their reference to something else.

Structure mappings

If we accept this, we are thrown back once again into the difficult area of determining what is distinctive about the way pictures, as compared with words, secure reference to their objects. We accept that likeness is not, in any simple sense, the answer here, and nor is recognitional capacity (as proposed by Schier 1986; cf. discussion in Lee 1997). The notion of structural mapping, which goes so naturally with the notion of analog (and analogical) representation, seems the most promising direction in which to seek progress.

It can be said that any formal semantics is based on a structure-mapping. Wittgenstein's so-called "picture theory of meaning" is a prototypical way of presenting the semantics of natural language as a relation between the structure of the linguistic expressions and the (logical) structure of the world. More modern versions of the story use mappings between set-theoretic models or algebraic signatures to achieve a similar result. What is emphasised by Wittgenstein's later work, however, is that there's no definitive, given way of doing the mapping. Various kinds of symbol systems come into being and acquire such mappings only by virtue of being used by communities of people for various, typically communicative ends. Conventions evolve that "standardise" to some extent the ways in which this is done, so that people can usefully generalise their understanding from one case to another, but there is always a good deal of latitude. The organisation of symbols into systems emerges from the development of these conventions, but then it also emerges that symbols and systems have many different kinds of properties at different levels of structural abstraction. Not only that, but there are different ways of structuring the "world" onto which symbol structures are mapped: it can be subjected to different schemes of conceptualisation (what Goodman might think of as different "ways of worldmaking"), some of which may be more conventional than others. Following Gurr (1998) and Gurr et al. (in press) we call these abstract world-representations "a-worlds".

The upshot is that we have a mapping between two structures (a-worlds) that are susceptible of the same general kind of formal description, which allows us to examine particular properties of the mapping. One property that seems to be important has been called systematicity (cf. Gurr et al., op. cit.). A mapping between two structures is systematic, crudely speaking, when the mapping involves and preserves properties and higher-order properties (i.e. properties of properties, such as transitivity etc.) that hold among the entities mapped. Thus a family tree can be based on a systematic mapping in that connections by lines represent parenthood relations (which are both intransitive), whereas being above represents being an ancestor of, which are transitive relations. If lines to represent parenthood were drawn in random directions[4], the diagram would still in principle be usable, but a number of useful topological features of trees would no longer be shared by the diagram, and e.g. ancestorhood would have to be inferred by following multiple parenthood links, rather than being represented directly. Relative to an a-world in which the ancestorhood relation is explicit, this diagram would be less systematic than the tree. Systematicity of this kind is important where diagrams are used for reasoning; but it is also relevant to issues of depiction.

Note here that systematicity is a property of the relation between a-worlds, and not of the abstractions themselves. If both a-worlds are very "flat" and contain only first-order relations, then a mapping that only maps these relations may still be maximally systematic (i.e. isomorphic at all levels). We may feel that a set of parenthood relations just inevitably induces the ancestorhood relation. However, this remains a feature of the domain that we might not have included explicitly in our abstraction; in which case its omission is no fault of a diagram intended to communicate that abstraction. Arguably in such a case the tree, with its tendency to be read as illustrating a transitive relation, would be implying too much.

Structure and repleteness

Systematicity can be compared, and to some degree contrasted, with Goodman's notion of relative repleteness. The latter is defined (Goodman 1969, 229f) in terms of a distinction between features of symbols that are constitutive or contingent in a given scheme, i.e. the features that are relevant to distinguishing between symbols. For a given diagram, it might be that most of its geometrical features are irrelevant, hence contingent, and can be varied without affecting its identity, provided that the topology is maintained. In that case, the diagram is less replete than a picture where (Goodman suggests) changing almost any detail will turn it into a different picture. It might thus be argued that the family tree is more replete than the diagram where arrows point in all directions, since the directions of the arrows are constitutive in the one, and can be inconsequentially changed (or changed more) in the other. Since repleteness is a very "flat" notion, in that it relates only to the surface features of a symbol (its first-order properties), it seems somewhat less useful in explanatory terms than the systematicity of a proposed mapping. However, in another sense it might be thought a fuller notion in that it is not apparently relativised to the construction of some particular pair of a-worlds. Being supposedly syntactic, it can be evaluated by simply looking at whether a particular diagram, seen as a symbol, just has more constitutive properties. A picture like the Mona Lisa evidently has far more constitutive properties than a tree diagram. But here we are returned to our earlier difficulty of determining what seen as a symbol might mean. How can one make sense of this, especially for analog symbols, in purely syntactic terms? In fact, characterisation of a range of items, e.g. marks on paper, as a symbol scheme amounts to defining the a-world on one side of a semantic mapping and, as Goodman observes, different such schemes will treat the same marks very differently. In general, and especially for analog schemes, this procedure is only coherent in relation to some other a-world onto which a mapping will be defined. What systematicity requires is that wherever a scheme is relatively more or less replete, so will have to be the a-world description of the domain it represents. The smile of the Mona Lisa is merely contingent if her image is treated as a symbol for any girl -- the symbol has fewer constitutive properties. Although it may be true, in principle, that a scheme with this syntax can be described purely in terms of those properties, it is clearly neither feasible nor useful to do so without adverting to the intended use as a representation of arbitrary girls.

We said: "especially for analog schemes". Repleteness, as Goodman uses it, seems to apply only to analog schemes, but it can also be considered in relation to notations, such as text. Features like spatial layout seem clearly able to have a function. Petre and Green (1992) discuss the concept of secondary notation. Where there exists a well-defined diagrammatic system, diagrams may often be constructed which go beyond the defined system -- prototypically, items in an electronic chip design may be grouped by experienced designers in ways that indicate useful facts about their relationships. By the standards of the simplest parenthood abstraction, use of the vertical direction to induce ancestorhood in family trees could be seen as a case of secondary notational use of the arrow-based representation. However, it would always be possible to define a new a-world with respect to which the secondary notation is well-defined and hence now "primary". This would also be a system entailing a scheme in which more properties were relevant to symbolic identity, and hence more replete. Though Petre and Green speak of diagrams, the idea of secondary notation appears to cover aspects of text, as in the issue of spatial layout raised above. Of course, natural language is not a well-defined system, so let's consider computer programming languages. These are very commonly defined without regard to the nature of the "white-space" characters between the various lexical items, but whether a character is a space, a tab or a newline has a dramatic effect on the visual appearance of the program code (text), as normally presented. The resulting layout is crucial to the usability of the text for a human reader, precisely because there is a relationship, though it may be intuitive, vague and hard to define, between the layout structure and the abstract structure of the program. This must be in some sense implicit in (derivable from) the unformatted code itself, but in that form it's unavailable to the human user. Layout here implies a secondary representation system with a more replete scheme and a systematic mapping to a more explicit abstraction of the domain structure.

For Goodman, secondary notation may often not be notation. Though a programming language is probably as close to a true notation, in his terms, as anything in practical use will get, the various uses of layout are likely to fail the five criteria[5]. But this is perhaps true of all real notations, including Goodman's favourite example, musical notation. Elkins (op. cit.) discusses a Bach autograph score, suggesting in effect (without of course using this terminology) that many of its features may be seen as secondary notation. Aspects of natural language text, such as layout, the use of various fonts, italics, etc. -- and likewise prosody in speech -- seem plausibly to fall under a similar account. Perhaps also, though this is less clear, the approach will extend to those aspects of language known as "iconicity" among linguists (e.g. Haiman 1985; briefly discussed in Lee and Stenning 1998), where for example the sequencing of items in sentences may relate to temporal ordering, etc. The sharp dichotomy that Goodman sets up between the continuous and the discrete is valuable in theory but often very blurred in practice.

The role of the interpretant

Our discussion has emphasised that the relationship between a symbol and what it represents is dependent on a particular way of abstracting a view of the latter. Goodman is indeed keen also to make this point, and it has been seized on by others as a way of responding to his critique of the role of resemblance in representation. Files (1996), for example, draws an instructive analogy with Peirce's tripartite distinction between representation (symbol), representational object and interpretant (interpretation in an interpreting agent). The interpretant corresponds to what has been here repeatedly termed the use of a representation[6]. In non-artificial symbol systems (including e.g. painting and natural language), considerations of use give us our only basis for describing the abstractions that are in play. In artificial systems, as we have seen, secondary uses are likely to usurp the supposedly clean and well-defined abstract semantics which is supposed to account fully for issues of interpretation. Files urges that whereas this framework may explain how something can be a symbol at all, more is required to explain, or ground, what in particular it represents -- its content. He suggests that resemblance plays a role in grounding iconic representations. Our alternative is to ascribe something like this role to structure-mapping in general: it grounds by modulating the use of representations in relation to objects.. Mappings will only affect use if they can be somehow apprehended by the user; to this extent, mappings that coextend with what are usually thought of as (visual or other) resemblances may well be important, but they are accorded no special status. It is not clear that mappings where the resemblance is obscured, e.g. anamorphic pictures which require curved mirrors before their resemblance to their object can be recognised, have any less right to be called "pictures" (cf. Lee 1997), or to be considered any less effectively grounded as representations.

Another view of the tripartite nature of representation is offered by Bull (1994), who combines Goodman's approach with that of Gombrich to produce an interesting emphasis on the notion of a schema, described (in terms that for present purposes are undesirably mentalistic) as "our prior concept of an object's appearance" (214). So we have images, objects and schemata, where the latter form a differentiated symbol scheme which can be used to link images and objects by denoting both. Though taking a very different route, Bull seems to arrive somewhere quite close to Files' position. The schema has very much the role of an interpretant: "We recognise an image correctly if and only if we see it as the schema with which it complies, but the act of recognition does not itself depend on the compliance relationship" (loc. cit.)[7]. What we wish to stress here is that equally the compliance relationship does not depend on the act of recognition. Rather it depends on a structural mapping -- an abstract schema -- that provides for a certain kind of use of the image as a representation. Resemblance and the assistance of visual recognition is just one kind of way in which a mapping can facilitate such use. And this is not to disagree with Elgin (1991) who has noted that

... the scheme/content distinction has come into disrepute, and rightly so. The orders we find are neither entirely of our own making nor entirely forced upon us. There is no saying what aspects of our symbols are matters of conventional stipulation and what are matters of hard fact. For there are few purely conventional stipulations, and no hard facts. (18)

The parallel construction of a-worlds reflects just this kind of mutual interdetermination of our conceptions and our ways of representing them.

Repleteness and relativity

What accounts for the fact that secondary notations emerge? This has to be explained at an extra-systematic level, relative to the original symbol system. There must be a process whereby a new a-world abstraction is (in effect) devised and found to be a proper extension of the original. Alternative such abstractions inevitably exist, and cannot, of course, be evaluated against the original system; instead their evaluation (and indeed the motivation for creating them in the first place) must come from some consideration of the purpose for which they are being used. This may be to do with reasoning, in which case a fairly minimal scheme is likely to be attractive, reducing the danger of unwanted implicatures[8] and other worries. Or it may be to do with aesthetic appreciation. Here, systematicity and repleteness seem again to come apart. I look up and see on the wall a painting by Cezanne which appears to depict a group of women bathers. It is important to my understanding and appreciation of the work that I see it as a picture of such a group, but it does not matter whether there ever actually existed such a group, or whether if so they were very much as depicted. With respect to groups of women, the nature of this painting can be compared to that of a diagram. At the same time, the picture has very many properties that are critical to its appreciation -- line, colour, composition, etc. -- but are of no significant representational interest. In as much as these properties are constitutive of the identity of the painting as an artwork, but largely contingent in relation to what it might depict, we see how thoroughly repleteness is a relative notion: the painting is replete or not only as considered for the time being as a particular kind of symbol in a particular scheme. Here, however, we restate: notwithstanding that the precise semantics is in many respects unimportant, the representational nature of the work in so far as it is considered to be a symbol is central. The relevant scheme cannot be coherently identified except as part of some particular system, and once again the system will ideally exhibit thoroughgoing systematicity.

References

Bull, M. (1994) Scheming schemata. British Journal of Aesthetics 34:3, 207-217.

Elgin, C.Z. (1991) Sign, symbol and system. Journal of Aesthetic Education 25:1, 11-21.

Elgin, C.Z. (1993) Outstanding problems. Synthese 95:1, 129-140.

Elkins, J. (1993) What really happens in pictures: misreading with Goodman. Word and Image 9:4, 349-362.

Files, C. (1996) Goodman's rejection of resemblance. British Journal of Aesthetics 36:4, 398-412.

Goodman, N. (1969) Languages of Art. Oxford University Press.

Goodman, N. & Elgin, C.Z. (1988) Reconceptions in Philosophy and Other Arts and Sciences. Routledge.

Gurr, C. (1998) On the Isomorphism, or Lack of it, of Representations. In Theories of Visual Languages, K. Marriot and B. Meyer eds. 288--301, Springer-Verlag.

Gurr, C., Lee, J. and Stenning, K. (1998, in press) Theories of diagrammatic reasoning: distinguishing component problems. Minds and Machines.

Haiman, J. (1985) Iconicity in Syntax (ed.). John Benjamin.

Lee, J. (1997) Similarity and Depiction. In Proceedings of the Interdisciplinary Workshop on Similarity and Categorisation, M. Ramscar and U. Hahn (eds.) Dept. of Artificial Intelligence, Univ. of Edinburgh. ISBN 0 907330 27 4.

Lee, J. and Stenning, K. (1998) Anaphora in Multimodal Discourse. In Multimodal Human-Computer Communication, Harry Bunt, Robbert-Jan Beun, Tijn Borghuis eds. 250-263 Springer-Verlag.

Oberlander, J. (1996) Grice for graphics: pragmatic implicature in network diagrams. Information Design Journal 8:2, 163-179.

Petre, M. and Green, T.R.G. (1992) Requirements of graphical notations for professional users: electronics CAD systems as a case study. Le Travail Humain 55, 47-70.

Schier, F. (1986) Deeper into Pictures. Cambridge University Press.

Scholz, O. (1993) When is a picture? Synthese 95:1, 95-106.

DRAFT: Last Updated 24.9.98