00 kO
a,
u
(D g
Oi
a>
o
nS u
fH
.o


^ ^»•i-'.T. i-L;-.-.;---••
Volume 20 Number 1 February 1996 ISSN 0350-5596

Informatica
An International Journal of Computing and Informatics
Profile: V.A. Fomichov A Model of Conceptual Items A Logic for Conditional Recomendation Informational Frames and Gestalts The New York Academy of Sciences
The Slovene Society Informatika, Ljubljana, Slovenia
,, h -ti.
Informàtica
An International Journal of Computing and Informatics
Basic info about Informatica and back issues may be FTP'ed from ftp.arnes.si in magazines/informatica ID: anonymous PASSWORD: <yóur mail address> FTP archive may be also accessed with WWW (worldwide web) clients with URL: http://www2.ijs.si/"mezi/informatica.html
Subscription Information Informatica (ISSN 0350-5596) is published four times a year in Spring, Summer, Autumn, and Winter (4 issues per year) by the Slovene Society Informatika, Vožarski pot 12, 61000 Ljubljana, Slovenia. The subscription rate for 1996 (Volume 20) is
-	DEM 50 (US$ 35) for institutions,
-	DEM 25 (US$ 17) for individuals, and
-	DEM 10 (US$ 7) for students
plus the mail charge DEM 10 (US$ 7).
Claims for missing issues will be honored free of charge within six months after the publication date of the issue.
FIIeX Tech. Support: Borut Žnidar, DALCOM d.o.o., Stegne 27, 61000 Ljubljana, Slovenia. Lectorship: Fergus F. Smith, AMIDAS d.o.o., Cankarjevo nabrežje 11, Ljubljana, Slovenia. Printed by Biro M, d.o.o., Žibertova 1, 61000 Ljubljana, Slovenia.
Orders for subscription may be placed by telephone or fax using any major credit card. Please call Mr. R. Mum, Jožef Stefan Institute: Tel (+386) 61 1773 900, Fax (+386) 61 219 385, or use the bank account number 900-27620-5159/4 Ljubljanska banka d.d. Slovenia (LB 50101-678-51841 for domestic subscribers only).
According to the opinion of the Ministry for Informing (number 23/216-92 of March 27, 1992), the scientific journal Informatica is a product of informative matter (point 13 of the tariff number 3), for which the tax of traffic amounts to 5%.
Informatica is published in cooperation with the following societies (and contact persons): Robotics Society of Slovenia (Jadran Lenarčič) Slovene Society for Pattern Recognition (Franjo Pernuš) Slovenian Artificial Intelligence Society (Matjaž Gams)
Slovenian Society of Mathematicians, Physicists and Astronomers (Bojan Mohar)
Automatic Control Society of Slovenia (Borut Zupančič)
Slovenian Association of Technical and Natural Sciences (Janez Peklenik)
Informatica is surveyed by: AI and Robotic Abstracts, AI References, ACM Computing Surveys, Applied Science & Techn. Index, COMPENDEX*PLUS, Computer ASAP, Cur. Cont. & Comp. & Math. Sear., Engineering Index, INSPEC, Mathematical Reviews, Sociological Abstracts, Uncover, Zentralblatt für Mathematik und ihre Grenzgebiete., Linguistics and Language Behaviour Abstracts, Cybernetica Newsletter
The issuing of the Informatica, journal is ßnancially supported by the Ministry for Science and Technology, Slovenska 50, 61000 Ljubljana, Slovenia.
PROFILES
In the last few years, in some countries of the East Europe and the Central-East Europe, dramatic political and social changes have taken place. Scientists of these regions have been confronted with elementary professional, even surviving problems, when large research organizations, being supported by the state, closed their doors, when salaries at universities and research institutes highly decreased, and when in some countries even reduced, salaries were payed with prolongated delays under conditions of rapid inflation. Though new sources for funding scientific researches have been engaged from abroad (e.g.. National Science Foundation in U.S.A., European Communities' Foundations), they have improved the situation only partly, and very often the researchers in these countries need to undertake considerable additional efferts in order to carry out scientific studies, especially fundamental studies.
Professor Vladimir A. Fomichov belongs to the generation of scientists who, notwithstanding all difficulties, have continued to work persistently and fruitfully in the chosen directions. He has an excellent scientific potential which will—as we hope and wish—show the power of new concepts in the formal semantics of natural language. This should afford new important opportunities in the field of information processing concerning language understanding, language generation, knowledge representation, and knowledge transfer. The readers are advised to make them acquainted with a Fomichov's paper in this issue of Informatica entitled A Mathematical Model for Describing Structured Items of Conceptual Level.
Vladimir A. Fomichov (Fomitchev)
Vladimir A. Fomichov^ is a professor of computer science at the Moscow State Aviation Technology and also State University and the Moscow State Institute of Electronics and Mathematics (Technical University). He holds a M.Sc. with honours in mathematics and Ph.D. in mathematical computer
'The transcription of Russian names from the Cyrillic into Latin usually foüows a French transcription. Thè original name B.aaanMiip AjìeKcanapoBHH ^oMiraès, its last part, can be transcribed as Fomitchev, Fomitchov, and/or Fomichov (phonetically).
science (technical cybernetics and information theory) in 1973 and 1983, respectively. Thè title of his thesis was "Development and Application of Methods of Logic-Algebraic Modelling the Mechanisms of Speech-Formation Semantics".
In 1967, he finished a secondary school in Moscow specialized in mathematics with the gold medal. In 1973, he became a laureate of the All-Union competition of students' scientific works.
Besides in the Russian language, he speaks, reads, and writes in English, German, and French, reads in Polish and Italian.
Membership in Scientific Academies and Associations
V.A. Fomichov is an active member of the New York Academy of Sciences (founded in 1817) since December 1994, with the primary interest in computer science, linguistics, and science education.
He is a member of the International Cybernetics Academy "Stefan Odobleja" (head office in Lugano, Switzerland; secretariats in Milano, Italy and Bucharest, Romania) since 1994. He is an active member of the Editorial Board of the international journal Informatica since 1992.
Further, he is a scientific member of the International Association for Cybernetics (Namur, Belgium), and a member of the Moscow Mathematical Society.
In October 1995, he was nominated for inclusion into the 14th edition of Marquis Who's Who in the World, scheduled for publication in December 1996. In January 1996, he passed successfully the initial screening, and his data are scheduled for publication.
V.A. Fomichov has been nominated by the Editorial Board of the American Biographical Institute, Inc. (Releigh, NC) for biographical inclusion in the Fifth Edition of "Five Hundred Leaders of Influence" (Publication: Early 1997).
Present and Previous Working. Positions
V.A. Fomichov is a full professor of computer science at the Moscow State Aviation Technology University since February of 1966, an associate professor of computer science at the Moscow State Institute of Electronics and Mathematics (Technical University)
since 1984, and an associate professor of computer science at the Faculty of Mechanics and Mathematics of the Moscow State University since 1993.
In 1985, he was awarded the title "Associate Professor of Cybernetics" by the Highest Certifying Commission at the Council of Ministers of USSR. In 1973-84, he served as assistant professor of computer science at the Moscow Institute of Electronic Engineering, Faculty of Applied Mathematics, Department of Cybernetics.
His experience in delivering lectures concerns discrete mathematics, theory of formal languages, theory of algorithms and computational processes, theoretical foundations of informational languages, mathematical foundations of natural-language-processing systems, and algorithmic languages and programming.
International Activity in the Last Year
V.A. Fomichov was a member of the Program Committee of the First Workshop on Applications of Natural Languages to Data Bases (Versailles, France, June 1995).
He worked as the chairperson of the Symposium From Ideas of Artificial Intelligence and Cognitive Science to New Theories and Methods of Teaching being part of the 14th International Congress on Cybernetics, in Namur, Belgium, in August 1995.
He was an invited speaker of the First Workshop Algebraic Methods in Language Processing— AMiLP '95 at the University of Twente, Enschede, The Netherlands, in December 1995.
Main Scientific Results and Interests
V.A. Fomichov is the author of 65 scientific papers and technical reports, including one monograph (1988) in Russian (B.A. $oMiiqeB: IIpejncTaBJie-HHe HH^iOpMaHHH Cpe/ICTBaMH K-HCHHCXEeHHH, Mo-CKBa, 1988—Representing Information by Means of K-calculuses, Moscow, 1988).
A. His main innovation is the creation of an appropriate basis for developing a general mathematical theory of natural language (NL) use by intelligent systems (or of NL-communication).
As a consequence, flexible mathematical methods for describing semantics and pragmatics of NL-texts are elaborated which may be effectively applied to and provide unique large opportunities for designing
arbitrarily complicated natural-language-processing systems, including, in particular, biomedical and legal textual (or full-text) databases (TDBs) and TDBs storing administrative, business, or scientific-technological documentation, and for developing powerful and flexible hybrid knowledge representation systems.
B. Proceeding from the ideas of the artificial intelligence theory and cognitive science, he developed together with Olga S. Fomichova a new, highly effective theory of teaching being, in essence, a theory of bridging gaps between the inner world's pictures (or conceptual systems) of a teacher and a learner. That theory, based on new methods of teaching, opens very large new prospects for teaching children and university students, training and retraining of teachers, building Intelligent Tutoring Systems with friendly interfaces.
Fomichov's Main Results and Interests in Computer Science and Mathematical Linguistics
Professor Fomichov developed a new, highly powerful and flexible approach to the formalization of semantics and pragmatics of natural language and to the use of mathematical methods in the design of NL-processing systems. This approach is logic-algebraic and is called Integral Formal Semantics (IFS). The central constituent of IFS is the theory of K-calculuses, algebraic systems of conceptual syntax, and K-languages (the KCL-theory).
Judging by the literature, the KCL-theory is now the only theory providing the possibility to describe effectively structured meanings (SMs) of arbitrarily complicated real discourses pertaining to science, technology, medicine, business, law, etc.
The KCL-theory opens large prospects to develop new, highly powerful and flexible hybrid knowledge representation systems (terminological knowledge representation systems).
The KCL-theory provides unique opportunities for building semantic representations of arbitrarily complicated visual images (drawings of technical objects, etc.) and hence may be considered as a powerful tool to design visual information management systems. A part of the KCL-theory is presented in the author's paper in this issue of Informatica.
The development of the KCL-theory can be divided into two periods, the first one (1975-83) and
the second one (1984 to present).
In the first period, the theory of S-models, S-calculuses and S-languages, T-calculuses and T-languages (STCL-theory) was developed. This theory provided new powerful mathematical means for describing structured meanings of NL-sentences and complicated NL-discourses—in particular, more powerful than the means provided by Montague Grammars, Discourse Representation Theory, Situation Semantics, and Theory of Generalized Quantifiers.
In the second period, the theory of K-calculuses, algebraic systems of conceptual syntax, and K-languages (the KCL-theory) has been created on the basis of the STCL-theory.
The kernel of the KCL-theory is an original mathematical model describing many regularities of structured meanings of NL-sentences and NL-discourses of arbitrarily great length and complexity.
For the first time such a class of formulas is determined that the structured meaning of practically arbitrary NL-text of arbitrary great length can be represented by a compact formula or a set of formulas from this class. The opportunities to describe structured meanings of NL-sentences and NL-discourses afforded by the KCL-theory by far exceed the opportunities of main popular approaches to the formalization of NL-semantics—Montague Grammar and its non-dynamic extensions. Discourse Representation Theory, Situation Semantics, Theory of Generalized Quantifiers, Dynamic Predicate Logic, and Dynamic Montague Grammar. As a consequence, the KCL-theory provided in 1988 much more powerful mathematical methods to describe structured meanings of sentences and discourses than methods stated in the most complete until recently text-book on mathematical linguistics, B.H. Partee, A. ter Me-ulen & R.E. Wall: Mathematical Methods in Linguistics, Kluwer, Dordrecht, 1990.
The expressive power of standard K-languages exceeds the expressive power of Episodic Logic, suggested by L.K. Schubert and C.H. Hwang.
It is highly important that the KCL-theory provides effective mathematical means both for describing structured meanings of NL-texts, interrelations of surface and semantic structures of NL-texts and for representing knowledge about the reality, developing powerful knowledge representation systems, including terminological languages.
The key advantage of IPS consists in providing a widely-applicable framework for mathematical mo-
delling the use of natural languages, i.e. for studying and formalizing the logic of natural language use. The analysis indicates that IPS may be effectively applied to the design of arbitrarily complicated NLPSs, including full-text D Bs.
IPS provides means, principles, and methods for speeding-up the development of the mathematical theory of natural language use. The author is on the way to develop the foundations of a theory of information retrieval in textual databases. A possible direction of studies is also the elaboration of powerful knowledge representation systems and models of knowledge bases.
During the 1980's and 1990"s, V.A. Pomichov has been the head of the research groups (at Technical University) applying the methods and models of IPS to the design of NL-processing systems: NL-text animation subsystems of training complexes for preventing the collision of ships, NL-interfaces to flexible manufacturing systems, expert systems for technical diagnostics, and autonomous intelligent agents (robots). The theoretical basis for these applied studies was provided by several original mathematical models of the correspondences
Text
^a sentence,^ a discourse, a discourse fragment
Knowledge
Semantic, \ ^	. -r ^
, r	,1 Representation of I ext
Informational;
and several algorithms of semantic-syntactic analysis.
The KCL-theory affords new possibilities for the development of powerful and flexible hybrid knowledge representation systems processing and greater expressive power than the language developed in the framework of the known project LILOG^.
The theory of K-calculuses, algebraic systems of conceptual syntax, and K-languages (KCL- theory) is a tool of unique effectiveness from the standpoint of designing textual (or full-text) databases (TDBs). The designers of TDBs for the first time get a universal instrument for formal describing (representing as formulas) the conceptual structures (including re-
^O. Herzog & C.-R. Rollinger (Eds.), Text Understanding in LILOG. Integrating Computational Linguistics and Artificial Intelligence. Final Report on the IBM Germany LILOG-Project, Springer-Verlag, Berlin, 1991.
ferential structures) of analyzed fragments of NL-discourses, the used notions, and knowledge items of other kinds.
Thus, the creation of the KCL-theory means the emergence of a universal effective formal tool for meditating and recording the results of mediating in the course of designing the TDBs.
Besides, Integral Formal Semantics (IFS) creates the preconditions for the development of a mathematical theory of conceptual information retrieval in TDBs as a theory stating logical foundations of constructing TDBs.
Fomichov's Main Results in the Field of AI Theory, Cognitive Science, and Theory of Teaching
Simultaneously with the work in the mathematical theory of NL-processing systems. Professor Fomichov has been developing during several years, together with Dr. Olga Fomichova, the foundations of a theory of teaching called the Theory of Dynamic Conceptual Mappings (DCM-theory) and being motivated by ideas of the Al theory, cognitive psychology, and cognitive linguistics. The DCM-theory bridges gaps between the inner world's pictures (conceptual systems) of a teacher and a learner. On this basis, highly effective methods of teaching children foreign languages have been elaborated, called the Methods of Emotional-Imaginative Teaching (EIT-methods). They may be effectively applied to teaching young children mathematics, other theoretical disciplines, and English as the mother language^.
The obtained results open new, very large and unique prospects to construct Intelligent Tutoring Systems with friendly interfaces helping children and adults to learn foreign languages. DCM-theory and EIT-methods provide new opportunities for training and retraining teachers of varied disciplines.
®See, e.g., V.A. Fomichov and O.S. Fomichova: The Role of the Artificial Intelligence Theory in Developing New, Highly Effective Methods of Foreign Languages Teaching, in PEG '93, Proceedings of the Seventh International PEG Conference on AI Tools and the Classroom: Theory into Practice. Moray House Institute of Education, Heriot-Watt University, Edingjaurgh, 1993, pp. 319329 and V.A. Fomichov and O.S. Fbmichova: The Theory of Dynamic Conceptuai Mappings and Its Significance for Education, Cognitive Science, and Artificial Intelligence. Informatica 18 (1994) [2] 131-148.
Professor Fomichov's Family
The family of V.A. Fomichov is, in a full sense of the word, the academic (scientific) one. His wife, Olga S. Fomichova (Fomitcheva), holds a Ph.D. in philology and is his scientific colleague in one of the fields he works. Her research interest is in the theory and methods of teaching, especially of English as a second language. She speaks Russian, English, and German. She has been nominated by the Editorial Board of the American Biographical Institute, Inc. (Releigh, NC) for biographical inclusion in the Fifth Edition of "Five Hundred Leaders of Influence" (Publication: Early 1997).
Their son Dmitri manifested very early mathematical abilities and skipped three grades after the third grade. In 1994, he finished a secondary school in Moscow specialized in mathematics and entered the Faculty of Computational Mathematics and Cybernetics of the Moscow State University being 13 years old. He likes mathematics, programming, sport swimming and running and is a student of the second year of the Moscow State University.
The mother-in-law, Dr. Ljudmila Udalova, who lives with the family, holds a Ph.D. in biology, and is retired.
Addresses:
Professor vladimir A. Fomichov
Department of Information Technologies, Faculty of Applied Mathematics and Information Technologies, Moscow State Aviation Technology University, Petrovka Street 27, 103767 Moscow, Russia.
Department of Discrete Mathematics, Faculty of Mechanics and Mathematics, Moscow State University, Vorobyovy Hills, 119899 Moscow, Russia, E-mail: vafQnw.raath.msu.su, Phone: -(-7 (095) 930 98 97.
Address for correspondence: Department of Mathematical Provision of Information-Processing and Control Systems, Faculty of Applied Mathematics, Moscow State Institute of Electronics and Mathematics— Technical University, Boi. Tryokhsvyatitelsky pe-reulok, 3/12, 109028 Moscow, Russia, E-mail: rootaonti.miera.msk.su, Fax:-t-7 (095) 916 28 07 (for Prof. V. Fomichov).
Compiled and commented by A.P. Železnikar
A Mathematical Model for Describing Structured Items of Conceptual Level
Vladimir A. Fomichov
An Active Member of the New York Academy of Sciences
Department of Discrete Mathematics, Faculty of Mechanics and Mathematics,
Moscow State University, Vorobyovy Hills, 119899 Moscow, Russia
E-mail: vaf@nw.math.msu.su, Phone: +7 (095) 930 98 97, and
Department of Math. Provision of Information-Processing and Control Systems,
Faculty of Applied Mathematics, Moscow State Inst, of Electronics and Math. - Techn. Univ.
Bol. Tryokhsvyatitelsky pereulok, 3/12, 109028 Moscow, Russia
Keywords: artificial intelligence, natural language processing, natural language semantics, conceptual level, discourse, structured meaning, semantic representation, mathematical model, formal language, integral formal semantics, restricted K-calculus, restricted standard K-language, terminological knowledge representation language, universal metagrammar of conceptual structures, universal stationary metagrammar of deep conceptual syntax
Edited by: Anton P. Železnikar
Received: January 10, 1995 Revised: August 16, 1995 Accepted: August 22, 1995
A ma,thema,tical model is determined and investigated which is destined for describing structured items of conceptua.1 level, in particular, for describing structured meanings (SMs) of natural language (NL) sentences and discourses. It defines a new class of calculuses called restricted K-calculuses and a new class of formal languages called restricted standard K-languages. The model is a modified propositional component of the theory of K-calculuses and standard K-languages being the central constituent of Integral Formal Semantics ofNL—an original approach to the mathematical study of NL use. The formal means provided by the model are convenient, in particular, for describing SMs of discourses with references to the mentioned entities and to the meanings of phrases or larger fragments of texts, with complicated designations of sets, with definitions of concepts. It is shown that the possibilities of the model to describe SMs of sentences and discourses and to represent concepts considerably exceed the possibilities of Montague Grammar, Situation Theory, Theory of Generalized Quantifiers, Discourse Representation Théory, Theory of Conceptual Graphs, Episodic Logic, and several other approaches.
It is pointed out that the results may be used, in particular, for developing mathematical theory of NL use, or mathematical linguocybernetics, creating new hybrid knowledge representation systems, building very powerful and flexible languages of semantic representations of texts. The model is considered as an important contribution to the development of a Universal Metagrammar of Conceptual Structures. The hypothesis is put forward that the model may be interpreted as a variant of a Universal Stationary Metagrammar of Deep Conceptual Syntax.
1 Introduction	ted raising new demands to the theory of natural
language processing (NLP).
In the beginning of the nineties, at least three
. grandiose scientific-technical tasks were formula- The first and second tasks are working up
michov 1981, 1982; Fomitchov 1983, 1984; Fomi-chov 1988, 1992, 1993b, 1994).
If a model is convenient for describing arbitrary conceptual structures of NL-texts and representing arbitrary knowledge about the world, we'll say about a Universal Metagrammar of Conceptual Structures, or a Universal Conceptual Metagrammar (UCM).
The reason to say about a metagrammar but not about a grammar is as follows. A grammar of conceptual structures is to be a formal model dealing with elements directly corresponding to some basic conceptual items (like "physical object", "space location", etc.). An example of such semi-formal grammar is provided by the known Conceptual Dependency theory of R. Schank. On the contrary, a metagrammar of conceptual structures is to postulate the existence of some classes of conceptual items, to associate in a formal way with arbitrary element from each class certain specific information, and to describe the rules to construct arbitrarily complicated structured conceptual items in a number of steps in accordance with such rules (proceeding from elementary conceptual items and specific information associated with arbitrary elements of considered classes of items).
All approaches mentioned above (including Montague Grammar) practically don't give the cues to the construction of a UCM. The same conclusion is valid also as concerns the Theory of Conceptual Graphs (Sowa 1984, 1991; Ellis 1995) and the Theory of Semantic Structures of R.S. Jackendoff (Jackendoff 1990). Conceptual graphs are rather close to the formulas of first-order logic, and its restrictions are well known. Semantic structures suggested by R.S. Jackendoff may be approximated by a special variant of first-order logic with other sorts of entities besides individuals, as it is shown in (Zwarts & Verkuyl 1994).
The present paper continues a long series of works on IPS aimed at constructing a UCM and including, in particular, the works (Fomi-chov 1981, 1982; Fomitchov 1983, 1984; Fomichov 1988, 1992, 1993b, 1994). The monograph (Fomichov 1988) contains the foundations of the theory of K-calculuses, algebraic systems of conceptual syntax, and K-languages (the KCL-theory). The kernel of the KCL-theory is the theory of K-calculuses and standard K-languages (the KCSL-
theory). It appears that the KCSL-theory may be interpreted as the first approximation of a UCM. The papers (Fomichov 1992, 1993b, 1994) set forth many principal ideas of the KCSL-theory. In fact, these three publications state (without many mathematical details) a method to describe structured meanings (SMs) of practically arbitrary (very likely, arbitrary) NL-texts and to represent diverse pieces of knowledge about the world.
The aim of this paper in the narrow sense is to determine and investigate a widely-applicable CM considerably extending the spectrum of mathematical means for describing SMs of NL-texts and knowledge about the world provided by the approaches mentioned above (except IFS). The model to be defined is a modified and simplified propositional component of the KCSL-theory.
The aim of this paper in a broad sense is to overcome the existing "barrier of complexity" on the way of developing a Universal Metagrammar of Conceptual Structures and to make a considerable part of the work on that way; in particular, to suggest a general methodology for solving this problem.
The composition of the paper is as follows. Section 2 contains the task statement. Sections 3-12 introduce mathematical definitions and contain a mathematical investigation of the determined formal objects. Section 13 illustrates some expressive possibilities of defined languages and outlines the directions of extending the expressive power of the constructed model. Section 14 points out the advantages of the model in comparison with a number of related approaches. Section 15 indicates some possible applications of the described model.
2 Task Statement
There are some important aspects of formalizing NL-semantics which were underestimated or ignored until recently by the dominant part of researchers.
First of all (see also Section 3 of Fomichov 1993a), this applies to formal investigating SMs of
(a)	narrative texts including descriptions of sets;
(b)	discourses with references to the meaning of sentences and larger fragments of texts; (c) phrases where logical connectives "and", "or" are used
in non-traditional manners and join not fragments expressing assertions but descriptions of objects, sets, concepts; (d) phrases with attributive clauses; (e) phrases with the words "concept", "notion".
Besides, the most popular approaches to the mathematical study of NL-semantics (mentioned in Section 1) practically don't take into account the role of knowledge about the world in NL comprehension and generation and hence do not study the problem of formal describing knowledge fragments (definitions of concepts, etc.). It should be added that texts have authors, may be published in one or other source, may be inputted from one or other terminal, etc. The information about these external ties of a text may be important for its conceptual interpreting. That's why it is expedient to consider a text as a structured item having a surface structure T, a set of meanings 5 (in most cases S consists of one meaning) corresponding to T, and some values Vi^.. denoting the author (authors) of T, the date of writing (uttering) T, indicating the new information in T, etc. The main popular approaches to the mathematical study of NL-semantics provide no formal means to represent texts as structured items of the kind.
The analysis shows that the limits of traditional mathematical logic are too narrow for taking into account the enumerated restrictions in formalizing the use of NL. That's why the task of creating logical foundations of designating intellectually powerful NLPSs demands not only to extend the first order logic but rather to develop new mathematical systems compatible with the first order logic and allowing us to formalize the logic of NL use.
We'll proceed from the hypothesis that there is only one mental level for representing meanings of NL-expressions, which may be called a conceptual level but not the semantic and the conceptual levels. This hypothesis is advocated by a number of scientists (see, in particular, Meyer 1994).
Let's demand that the formal means of our model allow us:
1. To build the designations of structured meanings (SMs) of both phrases expressing assertions and of narrative texts; such designations are called usually semantic representations (SRs) of NL-expressions.
2.	To build and to distinguish the designations of items corresponding to (a) objects, situations, processes of the real world and (b) concepts qualifying these objects, situations,
■ processes.
3.	To build and to distinguish the designations of (3.1) objects and sets of objects, (3.2) concepts and sets of concepts, (3.3) SRs of texts and sets of SRs of texts.
4.	To distinguish in a formal manner concepts qualifying objects and concepts qualifying sets of objects of the same kinds.
5.	To build compound representations of concepts, e.g., to construct formulas reflecting the surface semantic structure of the NL-expressions like "a person graduated from the Stanford University and being a biologist or a chemist".
6.	To construct explanations of more general concepts by means of less general concepts; in particular, to build strings of the form (a = Des{b)) where a designates some concept to be explained and Des(b) designates a description of some concretization of the known concept b.
7.	To build designations of ordered n-tuples of objects (tz > 1).
8.	To construct: (8.1) formal analogues of complicated designations of sets like the expression "this group consisting of 12 tourists being biologists or chemists", (8.2) designations of sets of tuples, (8.3) designations of sets of sets, etc.
9.	To describe set-theoretical relationships.
10. To build designations of SMs of phrases containing, in particular: (10.l) the words "arbitrary", "some", "all", "every", etc.; (10.2) expressions formed by means of applying the connectives "and", "or" to the designations of (10.2a) things, events, (10.2b) concepts, (10.2c) sets; (10.3) expressions where the connective "not" is located just before a designation of a thing, event, etc.; (10.4) indirect speech; (10.5) the participle constructions and the attributive clauses; (10.6) the words "a concept", "a notion".
11.	To build designations of SMs of discourses with references to the mentioned objects.
12.	To indicate.explicitly in SRs of discourses causal and time relationships between described situations (events).
13.	To describe SMs of discourses with references to the meanings of of phrases and larger fragments of a considered text.
14.	To express the assertions about the identity of two entities.
15.	To build formal analogues of formulas of the first-order logic with the existential and/or universal quantifiers.
16.	To build formulas with logical quantifiers on sets.
17.	To consider non-traditional functions (and other non-traditional relations of the kind) with arguments or/and values being: (17.1) sets of things, situations (events); (17.2) sets of concepts; (17.3) sets of SRs of texts.
18.	To build conceptual representations of texts as informational objects reflecting not only the meaning but also the values of external characteristics of a text: the author (authors), the date, the application domains of the stated results, etc.
This task statement develops the task statements from (Fomichov 1981,1982; Fomitchov 1983, 1984; Fomichov 1988, 1992).
3 Simplified Sort Systems
Let's begin to solve the task posed above. Suppose that we are to describe some application domain and we've decided to consider some entities as elementary entities (people, numbers, events, concepts, etc.). Then determine compound entities for the given domain as such entities which will be considered as sets or 7i-tuples (n > 1) formed by more simple entities. We'll interpret concepts as general designations of entities belonging to some distinguished by people classes of entities.
Objects are determined as such entities which are not treated as concepts. The class of objects
includes, in particular, SRs of texts and sets of concepts.
Suppose that describing a domain begins with choosing some finite set of strings (sorts) denoted by St and containing the designations of the most general concepts qualifying elementary objects from our domain.
Let St contain a distinguished sort P qualifying SRs of propositions (assertions) expressed by sentences and discourses. Assume that some binary relation <—• on St is given, and for s^t £ St the designation s <— t means that either s and t are identical or a concept associated with t is a generalization of a concept associated with s.
Definition 3.1 A triple S of the form {St, P,
(1)
is called a simplified sort system (s.s.s.), if St is a finite set of symbols, P G St, St \ {P} ^ 0, <— is a partial order on St, St ^ {u £ St I u and P are comparable with respect to'^—}, and St doesn't include the symbols t, {,},(,), [t ent], [t ob], [t conc], [ent], [ob], [conc]. The elements of the set St are called sorts, P is called the sort "sense of proposition", <— is called the generality relation. If t, u G St, t <— u, then we'll say that u is a generalization oft. □
Example 3.1 Let Ai be the alphabet consisting of small and capital Latin letters, ciphers 0,1,2,...,9, and the symbol "." (the dot), and = {Pro, sit, nat,int, real, ins, mom}. We'll interpret the elements of Sti as the designations, respectively, of the concepts "semantic representation of a declarative text", "situation", "natural number", "integer", "real number", "intelligent system", "moment". Let the sets Gi, G2, Geni be defined as follows:
Gl = {(u,u)|« e -Sil},
G2 = {{nat, int), {nat, real), {int, real)}.
Geni = Gl U G2.
Then it is easy to see that the triple Si of the form {Sti, Pro, Geni) is an s.s.s. •□
4 Types Generated by a Simplified Sort System
Let us define for any s.s.s. S a set of strings Tp{S) whose elements are called the types of the system
S and are interpreted as characteristics of entities which are considered while reasoning in a selected domain. Agree that if in a reasoning about an entity z it is important that z is not a concept (a notion) then we'll say that z is an object.
Suppose that the strings ["j" ent], [f conc], [t ob] are associated with the terms "an entity", "aconcept", "an object", respectively, or, in other words, these strings are types of semantic items corresponding to the expressions "an entity", "a concept" ("a notion"), "an object".
Formalizing the reasonings, let's agree to proceed from the following recommendations. If the nature of an entity z considered in a reasoning is of no importance then we'll associate with z the type [ent] in the course of reasoning. If all that is important concerning z is that z is an object then we'll associate with z the type [ob]. If, to the contrary, all that is important as concerns 2 is that 2 is a concept (a notion) then we'll associate with 2 the type [conc]. The destination of types [ent], [oò], [conc] may be explained also by means of the following examples.	)
Let El and E2 be, respectively, the expressions "the first entity mentioned on the page 12 of the issue of the newspaper 'The Moscow Times' published on October 1, 1994" and "the first object mentioned on the page 12 of the issue of 'The Moscow Times' published on October 1, 1994". Then we may associate the types [ent] and [oò] with the entities referred in Ei and E2, respectively, in case we haven't read the page 12 of the indicated issue.
However, after reading the page 12 we'll get to know that the first entity and the first object mentioned on this page is the city Madrid. Hence, we may associate now with the mentioned entity (object) a more informative type popul.area (a populated area).
Let E3 be the expression "the notion with the mark AC060 defined in the Longman Dictionary of Scientific Usage (Moscow, Russky Yazyk Publishers, 1989)". Not seeing this dictionary, we may associate with the notion mentioned in £3 only the type [conc]. But after finding the definition with the mark AC060, we get to know that it is the definition of the notion "a tube" (a hollow cylinder with its length much greater than its diameter). Hence we may associate with the notion mentioned in E3 a more informative type ph
(designating the concept "a physical object").
Let's consider the strings [^ ent], [t ob], [I conc], [ent], [ob], [conc] as symbols in the next definitions.
Definition 4.1 Let S be an s.s.s. of the form (1),
Spectp = {[t ent], [f ob], [f conc]}, Toptp = {[ent], [ob], [conc]}.
Then the set of types Tp{S) is the least set M satisfying the following conditions:
1.	Spectp U Toptp U StU {t s \ s e St} C M. The elements of Spectp and Toptp are called special types and top types, respectively.
2.	Ift£M\ Spectp, then the string of the form {i} belongs to M.
3.	Ifn> 1, for i= 1,.. .,n and ti e M\Spectp, then the string of the form {ti,..., tn) belongs to M.
4- If t £ M and t has the beginning { or (, then the string f t belongs to M.
□
Definition 4.2 If S is an s.s.s., then Mtp{S) = Tp{S)\Spectp; the elements of the set Mtp{S) are called main types. □
Let's formulate the principles of establishing correspondences between the entities considered in a domain with an s.s.s. S of the form (1) and the types from the set Mtp{S). The types of concepts, as distinct from types of objects, have the beginning j". With a concept denoted by a string s from St we associate a type t s. The type {i} corresponds to any set of entities of type t. li xi,... ,Xn are the entities of types ti,...,tn , then the type (il,...,tn) is assigned to the n-tuple {xi,.. .,Xn).
Example 4.1 We may assign the types from Mtp{Si) to some concepts and objects (including relations and other sets) with the aid of the following table:
ENTITY
TYPE
the concept "a set"	t {[ewi]}
the concept' "a set of objects"	t {[oö]}
the concept "a set of concepts"	t {[cone]}
the concept "a person"	v	t
Tom Soyer	ins
the concept "an Editorial	t {«»^s} Board"
the Editorial Board of	{ins}
"Informatica" the concept "a pair of integers" t {int, int)
the pair (12, 144)	"	(int, int)
We can also associate with the relation "Less" on integers the type {{int,int)}, with the relation "To belong to a set" the type {{[ent], {[ent]})}, with the relation "An object Y is qualified by a concept C" the type {([oò], [conc])}, and with the relation "A concept D is a generalization of a concept C" the type {{[conc], [conc])}. □
Some types may be particular cases of other types. That's why formulate several definitions in order to determine the notion of a concretization of a type.
Definition 4.3
(V,
Let S be an s.s.s. of the form
4- If t £ Mtp[S), t includes a substring [conc], u G Tc{S), and w is the result of simultaneous replacing each occurrence of the symbol [conc] by the type u, then w is a possible result of applying the transformation tr4 to t.
5. If t belongs to Mtp{S), r,u belong to St, t contains a substring r, u <— r, u ^ r, w is the result of replacing in t some occurrence of r by the sort u, then w is a possible result of applying tr^ to t.
Tc[S) = {i G Tp{S) \ {Spectp U Toptp) \
the symbol t is the beginning of t},
Tob{S) = Tp{S) \ {Spectp U Toptp U Tc{S)).
Then determine the transformations tri,...,tr^ partially applicable to the types of the set Mtp[S) (called main types) in the following way:
1.	If t e Mtp{S), t includes a substring [ent], then ir I is applicable to t and tri {t) is the result of simultaneous replacing in t each occurrence of [ent] by the type [oò].
2.	If t e Mtp{S), t includes a substring [ent], then tr2 is applicable to t andtr^it) is the result of simultaneous replacing in t each occurrence of [ent] by the type [conc].
3.	If t e Mtp{S), t includes a substring [ob], u e Tc{S), u e Tob{S), w is the result of simultaneous replacing each occurrence of the symbol [o6] by the type u, then w is a possible result of applying tr^ to the type t.
□
Example 4.2 If Si is the s.s.s. built in the Example 3.1, ti = [oò], wi = ins, then wi is a possible result of applying the transformation tr^ to ti. If t2 = {{ins, real)}, W2 = {{ins,nat)}, then W2 is a possible result of applying tr^ to Ì2 (since nat <— real).
If S is an s.s.s., then denote by Strings{S) the set of all strings in the alphabet St U Toptp U {',',!/{','}',(,)} (hence Strings{S) contains the symbol "comma").
It is not difficult to prove the following proposition:
Let S be arbitrary s.s.s., i G {l,...,5},i G Mtp{S), the transformation tpi may be applied to the type t, and the string w G Strings{S) be a possible result of applying just one time the transformation tri to the type t. Then w G Mtp{S). □
Definition 4.4 Let S be an s.s.s., t,u £ Mtp{S). Then the type u is called a concretization of the type t, and t is called a generalization of the type u (the equivalent designation is t \- u), if u = t or there are such xi,.. .,Xn G Mtp{S), where n > 1, that xi =t,Xn = u, and for i = 1,... ,n — 1 there is such k £ {1,..., 5} that the transformation trk is applicable to Xi and Xi+i is a possible result of applying trk to Xi. □
Thus, a binary relation h is determined on the set of main types Mtp{S) of arbitrary s.s.s. S.
Example 4.3 Let Si be the s.s.s. built in the Example 3.1. Then it is not difficult to show that
[ent] h [ob], [ent] h [conc], [ob] h ins, [ob] I- {({ins}, nai)}, [ob] h {nat, nat), [conc] h t ins, [conc].h t {ins}.
One can easily show that the following statement is true:
If S is an arbitrary s.s.s., then the relation h is a partial order on the set Mtp{S). □
5 Concept-Object Systems
Let's proceed from the assumption that for describing an application domain on the conceptual level we should choose some sets of strings X and V. The first set is to contain, in particular, designations of concepts, physical objects (people, ships, books, etc.), events, n-ary relations (ra > 1). The set V should consist of variables which will play in expressions of our knowledge representation language the roles of marks of diverse entities and, besides, will be used together with the quantifiers 3 and V.
We'll distinguish in X some subset F containing the designations of diverse functions. Each function / with n arguments will be considered as some set of n -F 1-tuples (a;i,..., a;„, y), where y = /(xi,..., Xn)- Besides, we'll introduce a mapping tp assigning to each element d £ X U V a type tp{d) characterizing the entity denoted by d.
Definition 5.1 Let S be any s.s.s. of the form (1). Then a four-tuple Ct of the form
{X, F, tp, F)
(2)
is called a concept-object system (c.o.s.) for S, iff the following conditions are satisfied:
(1)	X,V^ are countable non-intersecting sets of symbols;
(2)	tp is a mapping from X UV into Tp{S);
(3)	{u 6 y I tp{v) = [ent]} is countable;
(4)	F C {r £ X \ ip(r) is a string with the beginning {{and the ending)}},
(5)	St C X and for each s G St, tp{s) = t s.
The set X is called a primary universe, the elements of V and F are called variables and functional symbols, respectively.
If d £ X \JV, tp{d) — t, then we say that t is the type of d. □
Example 5.1 Construct some c.o.s. Cti for the s.s.s. 5i determined in Example 3.1. Let N be the set of all such strings str formed out of the ciphers 0,1, ..., 9 that the first symbol of str is distinct from "0" in case str contains more than one symbol. Let
Ui = {concept, person, chemist, biologist, tour.gr, expll, /irei, J.Price, R.Scott, N.Cope, P.Somov, Friends, Numb, Less, Cause, Knows, /si, Elem, Subset, Includel, Noiu, Before}.
The strings of Ui denote, respectively: the notions "a concept", "a person", "a chemist", "a biologist", "a tourist group", "an explosion", "a fire"; some concrete persons with the initial and surname J.Price, R.Scott, N.Cope, P.Somov; the function "Friends" assigning to a person 2 the set of all friends of z; the function "Numb" assigning to a set the number of elements in it; the relations "Less" (on the set of integers), "Cause" (on situations), "Knows" ("An intelligent system knows something"), "Isl" ("An object is qualified by a concept"), "Elem" ("An entity zi is element of a set 22"), "Subset" ("To be a subset of a set of entities"), "Includel" ("An intelligent system Zi includes an object Z2 into some set of objects Z3 at the moment 24"), "Before". The string Noiu will be used in SRs of texts for denoting a current moment of time.
Define a mapping ti from Ui into rp(5i) by the following table:
X	h{x)
concept	[t conc]
person, chemist.	t ins
biologist	
tour.group	t {Ì71S}
expll, firel	t sit
J.Price, R.Scott,	ins
N.Cope, P.Somov	
Friends	{{ins, {ins})}
Numb	
Less	{{int, int)}
Cause	{{sit, sit)}
Knows	{{ins. Pro)}
Isl	{(M,[conc])}
Elem	{{[ent], {[ent]})}
Subset	{({M]},{[eni]})}
Includel
Now Before
{(ins, [obi {[ob]},
mom)} mom
{(sit, sit)}
Let
Vent = {u I u = an, a is the letter "x"}, Vset ={v\v = bn, ò is the letter "M"}, Ve={v\v = dn, d is the letter "e"}, Vmom = {u I i; = gn, g is the string "mt"}, Vp= {v \ v = hn, h is the letter "P"}, where n £ N.
We'll interpret the strings from the sets Vent,..., Vp as variables and use them for marking out, respectively, arbitrary entities, sets of entities, situations (in particular, events), moments of time, and SRs of propositions. Let
Vi = Vent U Vset U Ve U Vmom U Vp-, Xi = NuStiUUi,
and the mapping tpi from Xi U Vi into Tp{Si) be defined as follows:
z £ N tpi{z) = nat-, s e Sti tpi{s) = t s; zeUi=> tpi{z) = tl{z);
tpi{v) = [ent]; tpi{v) is the string {[ent]};
v G Vent v G Vset v e Ve =
tpi{v) = sit] v G Vmom tp{v) = mom; v E Vp ==> tpi (u) = Pro.
Let Fx = {Friends, Numb}, Cti = {Xi,Vi, tpi, Fl). Then it is not difficult to verify that Ct^ is a c.o.s. for the S.S.s. ^i. □
6 Systems of Quantifiers and Logical Connectives. Simplified Conceptual Bases
Presume that we define a s.s.s S of the form (1) and a c.o.s. Ct of the form (2) for S in order to describe an application domain. Then it is proposed to distinguish in the primary universe X two non-intersecting and finite (hence non-void) subsets Inti and Int2 in the following manner: we distinguish in St two sorts inti and int2 and suppose that for m = 1,2, Intm = {x e X \ tp{x) =
intm}- The elements of Inti correspond to the meanings of expressions "every", "some", "any", "arbitrary", etc. (and, may be, "almost every", etc.) in situations when these expressions are parts of the word groups in singular. The elements of Int2 are interpreted as semantic items corresponding to the expressions "all", "several", "almost all", "many", and so on; the minimal requirement is that Int2 contains a semantic item corresponding to the word "all". Let Inti contain a distinguished element ref considered as an analogue of the word "some" (when it is associated in a context with singular) in the sense "some quite definite" (but, possibly, unknown). If Ct is a c.o.s. of the form (2), d G X, d denotes a concept, and a SR of a text includes a substring of the form ref d (e.g., the substring sm chemist, if ref = sm, d = chemist), then we'll suppose that this substring denotes some concrete entity (but not an arbitrary one) which is characterized by the concept d. Let X contain the elements =,-1, A, V interpreted as the connectives "is identical with", "not", "and", "or", and the elements V and 3 interpreted as universal and existential quantifiers.
Definition 6.1 Let S be any s.s.s. of the form (1), Ct be any c.o.s. of the form (2) for S, ref G X, the different elements inti, int2,eq, neg, binlog,ext «'"ß some distinguished sorts from St \ {P}, and each pair of these sorts is incomparable with respect to the generality relation i—. Then the 7-tuple Ql of the form
[inti,int2, ref, eq, neg, binlog, ext) (3)
is called a system of quantifiers and logical connectives (s.q.l.c.) for S and Ct, iff the following conditions are satisfied:
1.	For each m = 1,2, the set Intm = {d £ X I tp{d) = intm} is a finite set; ref G Inti.
2.	{=,-., A,V,V,3} C X; besides, tp{=) = eq, ip(-i) = neg, tp{A) = ^p(V) = binlog, tp{\f) = tp{3) = ext.
3.	There are no such d e X \ (IntiU M2U {=,-1, A, V,V, 3}) and no such s G {inti, int2, eq, neg, binlog, ext} that tp(d) and s are comparable with respect to the relation <—.
4- For eàch u G {inti,int2, eq, neg, binlog, ext}, u and the sort P ("sense of proposition") are incomparable with respect to <—.
The elements of Inti and /nÌ2 are called intensio-nal quantifiers, ref is called the referential quantifier, V and 3 are called extensional quantifiers. □
Example 6.1 Let Si = {Sti, Pro^Gem) be the s.s.s. built in Example 3.1, Cti = {Xi,Vi,tpi, Fi) be the c.o.s. for considered in Example 5.1. Suppose that A2 is the alphabet consisting of small and capital Latin letters, ciphers 0,1, ..., 9, symbols
"." (dot),=,->,A,V,V,3; Sr = {qintl, qint2, eql, cnot, bin, exq}, Gen2 — Geni U {(s, s) | s € Är}, St2 = Sti U Sr, S2 = (5^2, Pro, Gena); U2 = {sm, all, =, -1, A, V, V, 3}
where the string sm is interpreted as the semantic item "some" associated with the singular; X2 = Xi \J Sr U U2. Since, obviously, S2 is an s.s.s., determine a mapping tp2 from X2UV1 into Tp{S2) in the following way:
seSr-.
deXi
veVi--
tp2{s) = t s; > tp2{d) = tpi{d)-, tp2{V) = tpiiv)-, tp2{sm) = qintl, tp2{all) = qint2, tp2{=) = eql, tp2{-^) = cnot, tp2{^) = tp2{\/) = bin, . tp2{y) ^ tp2{B) = exq.
Let
a2 = iX2,Vi,tp2,Fi),
Qli = [qintl, qint2, sm, eql, cnot, bin, exq).
Then it is easy to see that Ct2 is a c.o.s. for S2, and Qli is an s.q.l.c. for S2 and Q2; besides, the element sm is interpreted as the referential quantifier of the system Qli. □
Definition 6.2 A triple B of the form
{Sr a, Ql)	(4)
is called a simplified conceptual basis (s.c.b.), iff S is an arbitrary s.s.s., Ct is an arbitrary c.o.s. of the form (2) for S', Ql is an arbitrary s.q.l.c. for S and Ct, and
(xuy)n{v, (, ), (, ),&} = 0.
We'll denote by S{B), Ct{B), Ql{B) the components of an arbitrary s.c.b. B of the form (4). Each component with the name h of the mentioned systems of the forms (1), (2), (3) will be denoted by h(B). E.g., the set of sorts, the distinguished sort "sense of proposition", and the primary universe of B will be denoted by St{B),P{B),X{B). We'll interpret simplified conceptual bases as formal enumerations of
(a)	primary items needed for building SRs of NL-texts and for describing knowledge about the reality,
(b)	the information associated with these items and required for constructing such SRs of NL-texts and forming knowledge fragments.
□
Example 6.2 Let S2,Ct2,Qh. be, respectively, the s.s.s., c.o.s., and s.q.l.c. determined in Example 6.1. Then, obviously, the triple Bi = {S2,Ct2,Qh) is an s.c.b., and
St{Bi) = St2, P{Bi) ^Pro, X(5i) = X2, y(ßi) = Vi.
□
Proposition 1 The set of all simplified conceptual bases is non-void. □
Proof. The truth of this proposition follows immediately from the Example 6.2.
7 The Idea of Determining Three Classes of Formulas Generated by a Simplified Conceptual Basis
The notions introduced above allow us to determine for every s.c.b. B a set of formulas Formrs{B) convenient for describing structured meanings (SMs) of NL-texts and operations on SMs.
Definition 7.1 For arbitrary s.c.b. B, Drs{B) = X{B)uV{B)\j{',\ (, ), (, )}
(hence Drs{B) includes, in particular, the symbol "comma"), Drs+(B) is the set of all non-empty finite sequences of elements from Drs{B). □
The essence of the approach to determining conceptual formulas proposed in this paper is as follows. Some assertions P[0], F[l],..., P[10] will be defined; they are interpreted as the rules of building SRs of NL-texts out of the elements of the primary universe X{B), variables from V{B), and several special symbols, if B is an s.c.b. destined for describing a considered domain. For any s.c.b. B, the rule F[0] provides an initial stock of formulas.
Definition 7.2 Denote by F[0] the assertion
"If d e X[B)uV{B),teTp[S{B)),
tp = tp{B),tp{d) = t, then d £ L{B) and the string of the form dkt belongs to T°{B)"
Let B be any s.c.b., L{B) and T°(jB) be the least sets defined by the assertion -P[0], Lnro[B) = L[B) (the designation "Lnr" is deciphered as "L" numbered). Then, obviously,
Lnro{B) = X{B) U V{B), T^{B) = {6 | b = dkt, de X{B)UV{B), teTp{S{B)), t^tpid)}.
□
E.g., for the s.c.b. Bi defined in Example 6.2, T°(jBi) includes the formulas
J.Price kins., Friends k {{ins, {ins})}.
If 1 < i < 10, then for any s.c.b. B and for A; = 1,..., i the assertions P[Q], P[l],..., P{i] determine by conjoint induction some sets of formulas
LnniB) C Drs+{B), T^{B), Tnr}{B),..., Tnr\(5) and Ynr}(ß),..., Ynr\{B).
The set Lnri{B) is considered as the main subclass of formulas generated by P[0],..., P[i]. The formulas from this set are destined for describing SMs of NL-texts.
If 1 < fc < i, then the set Tnrf{B) consists of strings of the form bkt, where b G Lnri{B), t G Tp{S{B)), and b is obtained by means of applying the rule P[k] to some simpler formulas at the final step of an infererice. It should be added that the inference of b out of the elements of X{B) and V{B) may use any of the rules F[0],..., P[k],..., P[i]; these rules may be applied arbitrarily many times. If s.c.b. B is chosen to describe some domain, then b can be interpreted as a SR of a text or as a fragment of a SR of a text pertaining to the selected domain. In this case, t may be considered as the designation of the kind of the entity characterized by such SR or by a fragment of SR. Besides, t may qualify b as a SR of a narrative text.
E.g., it will be shown below that Lnr^{B{)^ ..., LnriQ[Bi) include the formula Elem{P.Somov, Friends{J.Price)), and Tnr^{Bi), ..., TnriQ{Bi) include the formula Elem{P.Somov, Friends{J.Price)) & Pro, where Pro is the distinguished sort P{Bi) ("sense of proposition").
Each string c € Ynrf{B), where 1 < k < i, can be represented in the form c — ajk ... ka^kb, where ai,...,am,ò G Lnr{{B). Besides, there is such t e Tp{S{B)) that the string bkt belongs to Tnrf{B). The strings ai,..., «m are obtained by employing some rules from P[0], ...,F[i], and b is constructed out of "blocks" ai,..., a^ (some of them can be a little bit changed) by applying just one time the rule P[k]. The possible quantity of "blocks" ai,..., am depends on k. Thus, the set Ynrf{B) fixes the result of applying the rule P[k] just one time. E.g., we'll see below that the sets Ynrl{Bi),.. .,YnrfQ{Bi) include the formula
Elem & P.Somov & Friends{J.Price) k Elem[P.Somov, Friends{J.Price)).
Let for i = 1,...,10,
Tri{B) = T°(J5) U TnrjiB) U ... U Tnr\{By,
Yri{B) = Ynr}{B) U ... U YnrÌ{B);
Formi{B) = Lnri{B) U Tri{B) U Yri{B)
We'll interpret Formi[B) as the set of all formulas generated by the s.c.b. B. This set is the union of three classes of formulas the principal class being Lnri[B). The formulas from these three classes will Jje called ^-formulas, i-formulas, and y-formulas, respectively.
The class of t-formulas is needed for assigning a type from Tp{S{B)) to eacli b e Lnri{B), where i — 1,...,10. The class of y-formulas is considered on technical reasons: for describing (not in this article) an algebra of partial operations on n-tuples (n > 1) formed out of SRs of NL-texts and out of fragments of SRs. It is important that for ž = 0,..., 9, Lnri{B) C Lnri+i{B). The formal language Lnrio{B) is a subset of some standard K-language in stationary basis with empty marking-out of variables. The class of such languages is defined in Fomichov (1988). That's why ^-formulas will be called also K-strings.
The pairs of the form {B, Rls), where B is a s.c.b., Rls is the set consisting of the rules P[0],...,P[10], will be called restricted K-calculuses.
8 The Use of Intensional Quantifiers in Formulas
The rule P[l] allows us to join the intensional quantifiers to the designations of notions (concepts).
Definition 8.1 If B is an s.c.b., then for m = 1,2
Intm{B) = {q e X{B) I tp{q) = intm{B)}, Int{B) = Inti[B) U Int2{B), Tconc{B) = {te Tp{S{B)) \ t has the beginning t} U Spectp
where (see Definition 4-1)
Spectp = {[t ent], [t oh], [f conc]].
□
Using the rules P[0] and P[l], we can build some strings of the form qd, where q G Int{B), d G X{B), tp{d) e Tconc{B). E.g., if Bi is the s.c.b. determined in Example 6.2, then we can construct the ^-formulas sm per son, sm.tour.gr, sm concept, allperson, alltour.gr, all concept generated by Bi-
It is possible also to build more complex strings of the form qa, where q is an intensional quantifier, with the help of the rule P[l] using the rule P[8] (see Section 11) and, may be, some other rules (besides the rules P[0] and P[l]) for constructing a. E.g., it will be possible to build the
^-formulas smtour.gr * (Numb, 12), alltour.gr * {Numb, 12) in the s.c.b. Pj. These formulas are interpreted as SRs of the expressions "some tourist group consisting of 12 persons" and "all tourist groups consisting of 12 persons".
The transition from a Mormula c designating a notion to some ^-formula q c, where q is an intensional quantifier, is described with the help of some special function h.
Definition 8.2 Let B be any s.c.b., S = 5(P).
Then the mapping h : {1, 2} x Tp(S) -> Tp(S) is determined as follows: if u e Tp{S) and the string t u belongs to Tp{S), then
h{l,^u) = u,	u) = {u};
h{l, [t ent]) = [ent],	h{2, [t ent]) = {[ent]};
h{l,[tob])^[ob],	hi2,[tob]) = {[ob]}-,
h{l, [t conc]) = [conc],	h{2, [t conc]) = {[conc]}.
From the standpoint of building SRs of texts, the mapping h describes the transformations of the types in the course of the transition:
(a)	from the notions "a person", "a tourist group" to the SRs of expressions "some person", "arbitrary person", "some tourist group", "arbitrary tourist group", etc. (in case the first argument of h is 1 ) and to the SRs of expressions "all people", "all tourist groups", etc. (if the first argument of h is 2);
(b)	from the notions "an entity", "an object", "a concept" to the SRs of the expressions "some entity", "arbitrary entity", "some object", "arbitrary object", "some concept", "arbitrary concept", etc. (luhen the first argument of h is 1) and to the SRs of the expressions "all entities", "all objects", "all concepts", etc. (when the first argument of h is 2).
□
Definition 8.3 Denote by P[l] the assertion
"Let a e L{B)\ V{B), u e Tconc{B), k e {0,8} and the string aku belong to T''{B). Let m e {1,2}, q G fntm, t = h{m,u), and b be the string of the form qa. Then b G L(B), the string bk.t belongs to T^ [B), and the string of the form qkakb belongs to Y^{B)". □
Example 8.1 Let B be the s.c.b. Bi considered in Example 6.2; L{B), r°(S), T^{B), Y^{B) be
the least sets of formulas jointly, defined by the assertions P[0] and P[l]. Then it is easy-to verify that the following relationships take place:
person G L{B)\V{B), per son k f ins eT°{B), sm G Inti (B),
h(l, t ins) = ins smperson G sm person k ins G T^(B), sm k person k. smperson G all G Int2{B),
h{2,'{ ins) = {ins} allperson G L{B), all person k {ins} G T^{B), all k person kail person G Y^{B); tour.gr G L{B)\V{B), tour.gr k t {«tìs} S T°(B), h{l, t {ms}) =: {ins}, /j.(2,t {ms}) = {{ms}} smtour.gr, alltour.gr G L{B), sm tour.gr k {ins}, all tour.gr k {{ins}} G T^{B), sm k tour.gr k sm tour.gr,
ali ktour.gr kalltour.gr G Y^{B); concept G L{B)\V{B), concept k [t cone] G T^{B), /i(l, [t conc]) = [corac], h[2, [t conc]) = {[conc]} sm concept, all concept G L{B), sm concept k [conč],
ali concept k {[conc]} G T^{B), sm k concept k sm concept,
all k concept k all concept (B).
□
Notation. We'll define in what follows the rules F[2],...,P[10]. For k = 1,...,10, the rule P[k] asserts that some formula b belongs to L{B), some formula bkt belongs to T''{B), where t G Tp{S{B)), and some formula c belongs to Y''{B). If 1 < i < 10, ß is any s.c.b., then the rules P[0], F[l],..., determine by conjoint induction the sets of formulas
LiB),T''iB),T\B),...,T{B), Y'iB),...,Y'iB).
Denote these sets by Lnri{B),T°{B),Tnr}{B), ..., Tnri{B), Ynr} (B), ..., Ynr] (5) and denote the family consisting of all these sets by Globseti{B).
Let n > 1, Zi,...,Zn G Globseti{B), i^i G Zi,...,'Wn, G Zn. Then, if'these relationships for the formulas wi,...,wn are the consequence of employing some rules	. ..,	where
m > 1, we'll denote this fact by the expression of the form B{li,.. .,1^) ^ wi e Zi,..., Wn 6 Zn. The sequence £i,.. .,£m may contain the repeated numbers.
In the expressions of the kind we'll often omit the symbol B in the designations of the sets Zi,...,Zn, besides, we'll use the expressions Wi,W2 G Zi, W3,W4,WS G Z2, and so on. E.g., it was shown in Example 8.1 that
ßi(0,l) allperson, alltour.gr G Lnri[Bi), allperson k {ms}, alltour.gr k {{ms}} G
TnrliBi), ali k person k ali person G Ynrl{Bi), ali ktour.gr kalltour.gr G Ynr\{Bi).
The expression
Si (0,1) ^ ali k person k ali per son G YnrliBi)
is equivalent to the expression
Si (0,1) all k person k all person G Ynrl.
9 The Use of Relational Symbols and Marking Formulas
The rule P[2] allows us, in particular, to construct K-strings of the form /(«i,..., a„), where / is the designation of a function with n arguments ai,..., The rule F[3] is destined for building K-strings of the form (ai = «2)1 where ai and a2 denote the entities characterized by types comparable with respect to the relation h. Using consecutively P[2] and P[3], we can build the K-strings of the form (/(«i,..., a«) = b), where b is the value of / for ai,..a„.
Let's recall that for arbitrary s.s.s. 5, the set of main types
Mtp{S) = Tp{S) \ {[t ent], [t ob], [t conc]} (according to the definitions 4.1 and 4.2).
Definition 9.1 Let B be any s.c.b., S = S{B). Then:
(a)	Ri{B) = {d e X{B) \ there is such t e Mtp{S) that t has no beginning '(' and tp{d) is the string of the form {f}};
(b)	for arbitrary n > 1,
= {d e X{B) I there are such ti,.. .,tn G Mtp{S) that tp{d) is the string of the form {{ti,... ,tn)}}\
(c)	for arbitrary n > 1,
Fr,{B) = F{B)nRn+i{B).
Ifn>l, then the elements of Rn{B) will be called n-ary relational symbols, and the elements of Fn[B) will be called additionally nary functional symbols.
□
It is easy to show that for arbitrary s.c.b. B and arbitrary k,m > 1, it follows from k ^ m that Rk{B)nR^{B) = $.
Definition 9.2 Denote by P[2] the assertion "Let
n>l, /€ F„(5), tp^tp{B), te Mtp{S{B)),
tp{f) = {iui,...,Un,t)}-,
for
j = l,...,n, 0 <kj < i, z j e Mtp{S{B)),aj e L{B),
the string a j & z j belongs to T^^ (B) ; if a j ^ V{B), then uj h zj (i.e. Zj is a concretization of Uj); if aj G V{B), then uj and Zj are comparable with respect to h.
Let b be the string of the form /(ai,..., a„). Then
beLiB),bkteT\B), /&ai& ...ka^kbeY^By.
□
It should be recalled before formulating the next definition that, according to the definitions 6.1, 6.2, the symbol = is an element of the primary universe X(ß)ifor arbitrary s.c.b. B.
Definition 9.3 Denote by P[3] the assertion "Let
ai,a2eL{B), ui,u2e Mtp{S{B)),
ui and U2 are comparable with respect to the relation h.
Let for m = 1,2, 0 < fcm < i, «m & Um € P be the sort "sense of proposition" of B, b be the string (ai = 02). Then
òei(s), bkPeT^{B),
and the string ai & = & «2 & ^ belongs to the set Y^{B)". □
Example 9.1 Let Bi be the s.c.b. built in Example 6.2; i = 3;
öl = Friends{J.Price),
Ò2 = Numb{Friends{J.Price)),
Ò3 = {Numb{Friends{J.Price)) = 2),
Ò4 = Numb {alI concept),
Ò5 = Numb {alI chemist),
be = {all chemist = Mi),
Ò7 = Numb{Mi).
Then one can easily verify the validity of the following relationships (taking into account the agreement in Section 8 about the used notation):
Bi(0) Friends k {{ins, {ins})} e
J.Price kins, Numb k {{{[ent]}, nat)} e T°-,
Bi{l,2)^bik{ins} eTnrl;
Bi{0,2,2)=^ b2 e Lnr3,b2knat e Tnrl,
Numb & & Ò2 G Ynrl]
Bi (0,2,2,3) Ò3 G Inra, Ò3 k Pro G Tnrf;
ßi(0,l,2)^ Ò4,Ò5 G Lnrs,
Ò4 k nat, Ò5 k nat G Tnr^-,
Bi (0,1, 3) 06 G Lnrs, b^ k Pro G Trerf ;
Ml G V {Bi),
tp{Mi) = {[ent]} Bi (O, 2) Ò7 G Lnrs, brk nat G Tnr|.
□
Definition 9.4 Denote by P[4] the assertion "Let
n> 1, reRn{B)\F{B), Ui,...,Un£Mtp{S{B)), tp = tp{B),
tp{r) be the string
for
{(ui,...,w„)}or{ui}forn = 1;
j = l,...,n, 0< fcj <i, Zj e Mtp{S{B)), a, e L{B),
the string a j & z j belongs to T^i {B) ;
if üj ^	then Uj h zj; if aj £ V{B), then
u j and z j are comparable with respect to the relation h.
Let b be the string r(ai,..., a„), P = P{B) be the sort "sense of proposition" of B. Then
b e L{B), bkPeT^{B), and r&ai& ...kan^beV^iBy.
□
Example 9.2 Let Bi be the s.c.b. considered in Example 6.2;
i = 4;
bs, = Less(l{)QQQ^Numb{all chemist)), bg — I«ess(50000, Numb{allconcept)), òjo = Elem[person, all concept), òli = Elem{R.Scott, Friends{J.Price)), bi2 = Subset{all chemist, all per son), 6i3 = Knows{P.Somov,
{Numb{Friends{J.Price)) = 2)), òi4 = Cause{sm exph, sm fire{), Ò15 = iess(10000, Numb{Mi)).
Taking into account the definition of the c.o.s. Ct{Bi) (see Example 5.1) and employing the rules P[0],..., -P[4], we have, obviously, the following relationships:
5i(0,l,2,3,4)^Ò8,...,òi5GLnr4,
Ò8 & Pro,..., 6i5& Pro e Tnrt
Subset & all chemist & all person & Subset{all chemist, all person) G Ynr\.
The rule P[5] is destined, in particular, for marking by variables in SRs of texts: (a) the descriptions of diverse, entities mentioned in a text (physical objects, events, notions, etc.), (b) the fragments being SRs of sentences and of larger parts of texts to which a reference is given in any part of a text. □
Definition 9.5 Denote by P[5] the assertion "Let
aeL{B)\V{B), 0<k< i, k te Mtp{S{B)), akte T^{By, V e vìb), u e Mtp{S{B)), vkueT°{B), uh t,
V be not a substring of the string a. Let b be the string of the form a : v. Then
beL{B), bkteT^B), akvkbe Y^{B)".
□
Example 9.3 Consider, as before, the s.c.b. Bi built in Example 6.2.
Let i = 5, Ol = Ò3 = (Numb(Friends{J.Price)) = 2),ki = 3,ii = Pro = P{Bi). Then, obviously, aiktie Tnr^{Bi).
Suppose that ui = Pi, wi = Pro = P(Pi). Then, according to the definition of Pi,
besides, ui h ti (since ui = ii), and v\ is not a substring of ai.
Let Ò16 = {Numb{Friends{J.Price)) = 2) : Pi. Then, according to the rule P[5],
Ò16 e Lnr^{Bi), b.ekPro e TnrUBi), aikvi k bi6 e YnrliBi).
Let
bir = sm expll : ei, bis = sm firel : el, ^19 = Cause{sm expll : ei, sm firel : e2), Ò20 = Friends[N.Cope) : M^
Then it is easy to see that
Pi (0,1, 5) ^017, Ò18 € Inrs,
buk sit, bis^sit eTnrl',
Pi (0,1, 5,0,1, 5,4) Ò19 G Lnr{h),
big k Pro e Tur j,
Cause k bn k bis ^ biQ G Ynr^-,
Pi(0,2,5) Ò20 G Lnrs,
b2ok{ins} e Tnrl,
Friends{N.Cope) & M2 & 620 G Ynr^.
It should be added that the rule P[5] is very important for building SRs of discourses. It allows us to form SRs of discourses in such a manner that the SRs reflect the referential structure of discourses. The examples of the kind may be found, in particular, in (Fomichov 1992, 1993b, 1994). □
10
The Use of Logical Connectives "not", "and",
"or"
The rule P[6] describes the use of the connective ^ ("not").
Definition 10.1 Denote by P[6] the assertion
"Let a e L{B), t e Mtp{S{B)), 0 < k < i, k be not in the set Forbid, a ht £ T^(jS), b be the string ^a. Then b € L{B), bkt € T®(S), the string of the form k a Sib belongs to Y^{B) □
In this assertion, the expression Forbid designates the set consisting of several forbidden values for k. We'll use in this article the set Forbid = {2, 5,10}. This means the following: if an ^-formula a is inferred in some way with the help of some rules from the list F[0], F[l],..., P[i], then the rule P[2] or P[5] or F[10] can't be applied at the last step of the inference.
Example 10.1 If Bi is the s.c.b. determined in Example 6.2, i = 6, Forbid = {2, 5,10}, then one can easily verify that
Si (0,6) ^ -ichemist G Lure, -'Chemist k f ins £ T|; Bi (0,6,4,4) Know s {P. Somov, Isl{N.Cope,-ichemist)) € Lnrß-, Bi (0,4,4,6) ^Knows{P.Somov, Isl{N.Cope, biologist)) £ Lnr^.
We'll interpret the built ^-formulas as possible SRs of the NL-expressions "not a chemist", "P.Somov knows that N.Cope is not a chemist", and "P.Somov doesn't know that N.Cope is a biologist" .
The rule P[7] describes the manners to use the connectives A ("and") and V ("or") while constructing SRs of NL-texts. □
Definition 10.2 Denote by P[7] the assertion "Let
n > 1, te Mtp{S{B)), for m = l,...,n, Q <km<i-, am e L{B), a^kt e T'^-'iB), s e {A,V}.
Let b be the string of the form
Then
□
{aisa2s ... sün).
beL{B), bkteT^{B), skaik ... kankb eY^B)".
Example 10.2 Suppose that Bi is the s.c.b. considered in Example 6.2, i = 7, Forbid = {2,5,10}. Let
bi = {R.Scott A N.Cope),
62	= [chemist V biologist),
63	= {{Numb{Friends(J.Price)) = 2) : Pi A Know s {P. Somov, Pi) A
-I Know s {P. Somov, Isl{J.Price, [chemist ybiologist)))), Ò4 = Know s[P.Somov, EI em [[R.Scott A
N.Cope), Friends[J.Price))), Ò5 = [Elem[R.Scott, Friends[N.Cope) : M2) A ->Elem[P.Somov, M2)).
It is not difficult to show that
bi k ins, Ò2 & t ins e Tnrl,
Pi (0,2,2,3,5,4, 7,4,4,6,7) Ò3 S Lnrj,
Pi (0,7, 2,4,4) ^ Ò4 e Lnrj,
bik Pro e Tnr^-,
Pi (0,2, 5,4,4, 6,7) e Lnrr,
Ò5 k Pro e Tnrl.
□
11
Building Compound Designations of Concepts and Objects. The Use of Extensional Quantifiers. Representing n-tuples
Consider the rule P[8] destined for constructing compound representations of concepts, e.g., such
as
text.book * [Field, biology), concept * [Name.conc, ^^molecule"), tourist.group + [Number.of .persons, 12)[Composition, [chemist y biologist)).
Together with the rule P[l] and other rules, it will enable us to build compound designations of things and sets of things in the form qdes, where q is an intensional quantifier (see Section 6), des is a compound designation of a notion formed with the help of P[8] at the last step of the inference. E.g., the formulas
all person * {Age, IS.year),
some person * {Age, 18.year),
some tour ist.group * {Number.o/.persons, 12)
etc. It should be recalled that for arbitrary s.c.b. B (according to Definition 8.1),
Tconc{B) = {te Tp{S{B)) \ t has the
beginning t} U Spectp, where Spectp = {[t ent], [f ob], [t cono]}
Each element c of the primary universe X{B) such that tp{c) e Tconc{B) is interpreted as a designation of a concept.
The set R2ÌB) consists of binary relational symbols (some of them may correspond to functions with one argument); F{B) is the set of functional symbols. The element ref = ref{B) from X{B) is called the referential quantifier (see Section 6) and is interpreted as a semantic item corresponding to the meaning of the word "some" in expressions in singular ("some book", etc.). P{B) is the distinguished sort "sense of proposition" of s.c.b. B.
Definition 11.1 Denote by P[8] the assertion "Let
a e X{B), tp = tp{B), t = tp{a), t e Tconc{B), P = P{B), ref = ref{B).
Let n > 1, for m = 1,.. .,n, rm € R2{B), Cm be the string of the form ref a, and d^, h^n £ L{B);
if rm e R2{B) n F{B), then let h^ be the string of the form {rm{cm) = dm) and, besides, hmkPeT^{B);
ifvm e R2{B)\F{B), then let
hm = rm.{cm,dm) and h^ 'k P £ T'^{B).
Let b be the string of the form
a* (ri,cži) ...(r„,d„).
Then
beL{B), bkteT^{B), akhik ... khnkb EY^{B)".
□
The rule P[8] is a weakened formulation of the rule Ps from (Fomichov 1988).
Example 11.1 Suppose that Bi is the s.c.b. defined in Example 6.2, i = 8, Forbid = {2,5,10}. Then consider a possible way of constructing the formulas 61 and Ò2 (defined below) corresponding to the notion "a tourist group consisting of 12 persons" and to the expression "some biologist from a tourist group consisting of 12 persons". We'll use for this the denotations introduced in the end of Section 8. Let
a = tour.gr, t = tp{a) = t {ms}, P = Pro, ref = sm, n = 1, ri = Numb, cy = ref a = sm tour.gr, di = 12,
hl = (ri(ci) = c/i) =
{Numb {smtour.gr) = 12).
Then
Si(0,1,2,3)=» /11 e Lnr3{Bi), hi k Pro e Tnrl{Bi).
Let bi = a * {ri, di) = tour.gr * {Numbj 12). Then it follows from the rule P[8] that
bi e Lnr8{Bi),
61 & t {ins} e Tnrl{Bi).
Let Ò2 = sm biologist*{Elem, smtour.gr*{Numb, 12)). Then
ßi(0,1,2, 3,8,1,1,4,8,1) e Lnrs{Bi), b2kins e Tnrl{Bi).
□
Example 11.2 Proceeding from the same assumptions as in Example 11.1, consider a way to build a possible SR of the phrase "N.Cope has included J.Price into a tourist group consisting of 12 persons". Let
03= {Includel{N.Cope, J.Price, smtour.gr* {Numb, 12), mil)A Before{mtl, Now)).
Then
ßi(0,l,2,3,8,l,4,4,7)=» 63 € Lnr8{Bi), b3kPro£Tnrl{Bi).	^
The rule P[9] describes how to join extensional quantifiers V and 3 to the SRs of propositions. E.g., we'll be able to build a SR of the sentence "For each country in Europe, there is a city with the number of habitants exceeding 50000" in the form
\lxi{country + {Location, Europe)} 3x2{city)ilLocation{x2i a;i)A Less(50000, Numb{Habitants{x2)))) ■
Here the expression country * {Location, Europe) restricts the domain where the variable ."Ci can take values.
This rule mentions the mapping h : {1,2} x Tp{S{B)) Tp{S{B)) introduced in Definition 8.2, a subset of types Tconc{B) (see Definition 8.1), and the relation h (see Definition 4.4). The expression Numb used in the rule P[9] designates the set consisting of numbers of some rules from the list P[l],..., P[i]. In this paper, we'll consider for each 1 < « < 10, the set Numb = Numb{i) = {3,4, 9} fl {1,..., i). □
Definition 11.2 Denote by P[9] the assertion "Let
qexe{i,3}, a^L{B) \ V{B), P = P{B), k e Numb, akPe T'=(P), t e Mtp{S{B)), veV{B), tp = tp{B), tp{v) = t,
V be a substring of a and, besides, a don't include the substrings of the form s v, where s is one of the symbols "i", '3".
Besides, let des G L{B) \ V{B), v be not a substring of des, u be a type from Tconc{B); for some m G {0,8}, the string des k u belong to T"'{B); w = h{l, u), t\-w.
Let b be the string of the form qex v {des) a. Then
beL{B), bkPeT^{B), qexkvkdeskakbe Y^{B)".
□
Example 11.3 Construct a possible SR of the phrase T = "There are such a moment mtl and
a tourist group M3 consisting of 12 persons that N.Cope included J.Price into the group M3 at the moment mtl". Let Pi be the s.c.b. determined in Example 6.2,
i = 9,
Por&if/= {2,5,10}, Mimò = {3,4,9}, Ò4 = 3 mtl {mom) 3 M3 {tour.gr * {Numb, 12))
{Includel {N.Cope, J.Price, M3, mil) A Before {mtl, Now)).
Then it is easy to show that
Bi (0,4,4,7,0,1, 2,3,8,9,9) 64 G Im-g, Ò4 k Pro G Turi
The formula Ò4 is to be interpreted as a possible SR of T. □
Remark. If a basis B is chosen in such a way that the primary universe X{B) includes an element conc.ent having the type tp{conc.ent) = [f ent] (corresponding to the conceptual item "an entity"), then for the formulas Bv{conc.ent)a and MV{conc.ent)a the domain for v is the set of all entities of the considered application domain.
The rule P[10] is destined for building representations of n-tuples {n > 1) in the form {ai,..., an), where ai,..., an denote the components of a n-tuple.
Definition 11.3 Denote by P[10] the assertion "Let
n>l,
for m— 1,..., n, am e L{B), Um e Mtp{S{B)),
0 < km <i,
amkumeT'"-{B).
Let t be the string of the form {ui,..b be the string of the form {a\,..., an). Then
beL{B), bkteT^°{B), aik ... kankbe
□
Example 11.4 Let Bi be the s.c.b. from Example 6.2, i = 10, Forhid = {2,5,10}, Numb = {3,4,9}, and h = {Elem{xA,Mi) = {[x^ = {sm int : Xi, sm int : X2)) A (Z/ess(a;i, X2) V {xi = X2)))), where the string Ò5 is to be interpreted as a possible formal definition of the binary relation "Not greater" on the set of integers. Then one can easy show that
(0,4,1,5,10,3,4,3,7,7,3) Ò5 G Lnrw, h^kPro G Trarfo
□
12 Restricted K-Calculuses, Restricted Standard K-Languages, and Mathematical Investigation of Their Properties
Definition 12.1 Let B be any s.c.b.,
Forbid = {2,5,10}, 1 < i < 10, {3,4,9} n{l,...,0
and the sets of strings
L{B),
are the least sets determined jointly by the rules .P[0],P[l],...,F[i]. Then denote these sets, respectively, by
LnniB),
T%B), Tnr](5),..., Tnrf (5),..., Tnr\{B),
Ynr}{B),...,Ynr>i{B),...,Ynr\{B)
and denote the family consisting of all these sets by Globseti{B).
Besides, let for i = 1,..., 10,
Tri{B) = T°{B) U Tnr}{B) U ...	(5) DTnriiB),
Yn(B) = YnrI(B) U .. .UYnrj(B),	(6)
Formi (B) = Lnn (B) U Tn (B) U Yn (B).	(7)
Definition 12.2 If B is any s.c.b., then
Lrs{B) = Lm'io(ß), Trs{B) = Tr.oB), Yrs{B) = yrio(S), Formrs{B) = Formio(S), Krs{B) = {B,Rls),
(8) (9) (10)
where Rls is the set consisting of the rules P[0],P[1],...,P[10].
The ordered pair Krs[B) is called the restricted K-calculus in the s.c.b. B; the elements of the set Formrs{B) are called inferred formulas of the restricted K-calculus Krs{B). The formulas from the sets Lrs{B), Trs{B), and Yrs{B) are called, respectively, Mormulas, t-formulas, and y-formulas. The set of £-formulas Lrs{B) is called the restricted standard K-language in the s.c.b. B. □
Proposition 2 If B is an arbitrary s.c.b., then:
(a)	the set Lnro{B) 0;
(b)	/ori = l,...,10,
Lnri^i{B) C Lnri{B).
a
Proof, (a) For arbitrary s.c.b. B, X{B) includes the non-empty set St{B) (see the Definitions 3.1 and 5.1). Then it follows from the rule P[Q] that St{B) C Lnro{B). (b) The structure of definition 12.1 and assertions P[0], F.[l],..., P[i] shows that the addition of the rule P[i] to the list P[0],...,P[i — 1] either enlarges the set of ^-formulas or doesn't change it (if the set of functional symbols F{B) is empty, then Lnri{B) = Lnr2{B)). Thus, we have the validity of Proposition 2b.
Taking into account Proposition 2, it is easy to see that Sections 8-11 provide numerous examples of i- ,t-, and «/-formulas of restricted K-calculuses and, as a consequence, of the strings of restricted standard K-languages.
Proposition 3 If B is any s.c.b., then
Lrs{B),Trs{B),Yrs{B) ^ (D.
□
□
Proof. Let B be as.c.b. Then Lrs{B) / 0 according to the Proposition 2 and the relationship (8). It follows from Definition 5.1 that the set of variables V{B) includes a countable subset of such variables v that = [ent\. Let vi and V2 be two variables from that subset, b be the string {vi = V2), c be the string & = kv2k{vi = V2), P = P{B) be the sort "sense of proposition" of B. Then, according to the rules P[0] and P[3], bkPe Tnrf{B), c € Ynrf{B) for each i = 3, ...,10. Hence [with respect to (5), (6), (9),(10)] Trs{B) and Yrs{B) are non-empty.
Proposition 4 If B is a s.c.b., then:
(a)	if w £ Trs{B), then w is a string of the form akt, where a € Lrs{B), t e Tp{S{B)), and such a representation depending on w is unique for each w;
(b)	if y e Yrs{B), then there are such n > 1 and ai,...,a„, b G Lrs{B) that y = ai & ... & a„ &; Ò; besides, such a representation depending on y is unique for each y.
a
Proof. The structure of the rules P[0],..., P[10] and the definitions 6.2, 12.1, 12.2 immediately imply the truth of this proposition.
Proposition 5 If B is a s.c.b., d G ^(ß) U V{B),. then there are no such k, where 1 < k < 10, n > 1, and oi,..., a„ G Lrs{B) that
aik ... kankdeYnr'lo{B) (11)
Proof Assume that there are such k € {1,...,10}, n > 1, 01,...,a„ e Lrs{B) that the relationship (11) takes place. For arbitrary m G {1,..., 10}, the set ynrJ^(S) may include a string aik ... ankd, where d contains no occurrences of the symbol k, only in case d is obtained out of Ol,..., a„ by means of applying one time the rule P[m]. It follows from the structure of the rules P[l],..., P[10] that d must contain at least two symbols. But we consider the elements of the set X{B) U V{B) as symbols. Hence we get a contradiction, since we assume that deX{B)U V{B). Q.E.D.
It would be not difficult also to prove the following two propositions, but this goes beyond the scope of the present paper.
Proposition 6 Let B be an s.c.b., b G Lrs{B) \ {X{B) U V{B)). Then there is one and only one such a sequence (fc, n, a^,..., a„), where 1 < k < 10, n > 1, ai,...,a„ G Lrs{B), that aik ...kankbeYmioiB). □
Proposition 7 Let B be a s.c.b., b G Lrs{B). Then there is one and only one such type t G Tp{S{B)) thatbkt G Trs{B). □
This proposition is a direct consequence of Proposition 5 and Proposition 6. Proposition 7 enables us to define a mapping tl from Lrs{B) into Tp{S{B)) in the following way: for each b G Lrs{B), tl{b) is equal to such t G Tp{S{B)) that bkt e Trs{B). Obviously, for each b G X{B) U V{B), tl{b) = tp{b). Hence each string of a restricted standard K-language will be associated with some type.
13 Expressive Possibilities of Restricted Standard K-Languages
For formulating and explaining numerous definitions in preceding sections, very simple examples were deliberately chosen. The collection of these examples is far from demonstrating the real power of the constructed model. That's why let's consider several additional examples illustrating some important possibilities of restricted standard K-languages.
Suppose that E is some NL-expression, Semp is a string of the restricted standard K-language in some s.c.b. and, besides, Semp is a possible SR of E. Then we'll say that Semp is a K-representation (KR) of E.
Example 13.1 Let Ti = "Somebody hadn't switched off a knife-switch. As a result, the Laboratory No. 11 was burnt down". Then the string
(3 mtl {mom * {Before, Now)) -t Switch.off {smperson •. xi, sm knife-switch : X2,mtl) : PiA Descr.sit{Pi, ei)A 3 mt2 {mom * {Before, Now)) Burn.down{sm lab* {No, 11) : xg, mt2) : P2 A Descr.sit{P2, 62) A Cause{ei, 62))
is a possible KR ofTi., Here Pi and P2 are variables of the type being the distinguished sort "sense of proposition" ; ei and 62 are to be interpreted as variables designating, respectively, two situations (events) and having the type being the sort "situation". □
Example 13.2 Let T2 = "Yves said to Mary that he was occupied with rowing and painting. The new for Mary was that Yves was occupied with painting". Then we may consider the formula
3 mtl {mom * {Before, Now)) {Say{{Agent, some person* {First name, "Yues") : yi), {Addressee, some person* {First name, "Mary") : y2), {Moment, mtl), {Info, Be.occupied{yl, {rowing A painting)) : Pi)) A New{y2, Pi, mtl. Be.occupied{yi, painting)))
as a possible KR of r2. □
Example 13.3 Let T3 = "The notion "a molecule" is used in physics, chemistry, and biology". One can determine such an s.c.b. B that the set of sorts St{B) includes the elements areal and string, the primary universe X{B) includes the elements
areal, string, concept, "molecule''\ Is.used, Name.conc, physics, some, chemistry, biology, tp{concept) = [l conc], tp{"molecule") = string,. tp{physics) = tp{chemistry) =
tp{biology) = areal, tp{Is.used) = {{\conc\, areal)}, tp{Name.conc) = {{\conc\, string)}.
some is the referential quantifier of B, Is.used and Name.conc are not functional symbols. Let
51	= Name.conc{some concept, ^^molecule'^),
52	= concept * {Name.conc, ^^molecule")^
53	= Is.used{some concept *
{Name.conc, "molecule"), {physics A chemistry A biology)).
Then
B{0,1,4) Si e Lrs{B); B(0,1,4,.8)=^S2 € Lrs{B)-, ß(0,1,4, 8,1,7,4)^ S3 € Lrs{B).
The formula S3 is a possible KR of T3. □
Example 13.4 Let T4 = "Teenager is a person having the age from 12 to 19 years". Let u be the string
{{teenager = person * {Age, xi)}A -'Less{xi, 12.year)A -I Greater{x 1,19.year)).
Then u is a possible KR of r4. □
Example 13.5 Nebel and Peltason (1991) formulate the following definition: a small and medium enterprise (sme) is a company with at most 50 employees. This definition may have the KR
Definition{sme, V^i {companyl) (/sl(a;i, sme) =
-1 Greater {Number {Employèes{xi)), 50)))).
or the KR
{{sme = companyl * {Description, Pi)) A {Pi = ^ xi{companyl) {Isl{xi, sme) =
-I Greater {Number {Employees{xi)), 50))))
□
Example 13.6 We can build complex descriptions of diverse objects and sets of objects. E.g., we can build the following KR of the description of "Informatica" :
some int.se. j our nah [Title, ''Informatica") [Country, Slovenia) [City, Ljubljana) [Fields, [artif.intelA cogn.science databases)) : k225
where k225 is the marli of the knowledge module with the data about "Informatica". □
Example 13.7 In a similar way, we can construct a knowledge module stating the famous Pythagorean Theorem and indicating also its author and field. For instance, such a module may be the following expression of some restricted standard K-language:
some textual.object* [Kind, theorem) [Fields, geometry) [Authors, Pythagoras) [Meaning, Vxi [geom)yx2[geom) \lxz[geom)'ixi[geom) If-then[[Isl[xi, right-triangle) Hypotenuse[x2, Xi)h
[Square[Length[x2)) = Sum[Square[Length[x3)), Square[Length[x4)))))) : A;81.
□
Example 13.8 The meaning of the question "Is Ghent located in Belgium ?" may be represented by the Mormula
Questionl[Location[Ghent, Belgium))
where Questioni is a special unary relational symbol (r.s.) with the type P, and P is the sort "sense of proposition". □
Example 13.9 The question "What and whom has John told?" may have a KR
Question2[[x2 A x3), [Tell[{Agentl, some person* [Name, "Jo/in") : a;i), {Content, X2), {Moment, X3))A Before[x3, Now)))
where xi,x2,x3 are variables of the type [ent], Question2 is a binary r.s. with the type {[[ent],P)}. □
Example 13.10 The phrase "Professor P. Jones advised M. Smith to enter the Stanford University and prepare a Ph.D. thesis on physics" may have a deep KR
[Pose-goal [{Agenti, some person* [Name, ''P.Jones'') [Quali f, prof) : xi), {Adressee, some person* [Name, "M.Smith'') : x^), {Form, advice), {Goal, [Enteringl*
[Inst, Stanford.Univ)A Preparingl* [Goal-product, certain ph.d.thesis * [Fieldl, physics))) : X3}, {Moment, 0:4)) A Before[xi, Now)).
In a similar manner, one can build KRs of phrases with the verbs "to order", "to request", etc.
Thus, restricted standard K-languages are convenient, in particular, for:
(a)	reflecting casual relationships in SRs of discourses (Example 13.1); one can reflect time relationships between described situations in a similar way;
(b)	explicating references to the meanings of phrases or larger parts of discourses (example 13.2);
(c)	building SRs of phrases with the words "notion", "concept" (example 13.3);
(d)	explicit indicating in SRs conceptual cases, or semantic cases, or deep cases (examples 13.2, 13.9, 13.10);
(e)	building formal definitions of notions (examples 13.4, 13.5);
(f)	building complex designations of diverse objects (example 13.6);
(g)	storing information pieces in object-like knowledge modules representing both the meaning of an information piece (a notion, a • theorem, a rule, an abstract, a patent, etc.) and the values of its external characteristics: the authors, the date of inputting into computer system or publishing a piece, fields of application, etc.(example 13.7);
(h)	constructings SRs of questions with the answers "Yes" or "No" (example 13.8) and with the interrogative words (example 13.9);
(i)	building semantic analogues of complicated goals and SRs of phrases with orders, advices, etc. (Example 13.10).
□
Numerous other examples illustrating the expressive power of standard K-languages and as a consequence, of restricted standard K-languages (in many cases) may be found in (Fomichov 1992, 1993b, 1994).
The analysis of the constructed model and of the examples considered in Sections 3-13 enables us to draw the conclusion that the task stated in Section 2 is solved.
Introduce an informal notion of a stationary NL-text. Assume that K is some knowledge base (k.b.). If T is an arbitrary NL-text, then we'll say that T is a stationary text with respect to K in case T doesn't contain fragments introducing new concepts and/or relationships (but T may include phrases explaining such concepts and/or relationships that the considered k.b. contains their designations as informational items).
It appears that there are reasons to put forward the following
Hypothesis. The expressive possibilities of restricted standard K-languages are sufficient and these languages are convenient for representing on some deep conceptual sublevel the structured meanings (SMs) of such arbitrarily complicated real sentences and discourses which are considered as stationary texts with respect to any k.b. □
In other words, there are reasons to conjecture that the constructed model may be interpreted as a variant of a Universal Stationary Metagrammar
of Deep Conceptual Syntax. Naturally, it is necessary to carry out further studies in order to prove or to correct the formulated hypothesis.
It was shown above that the built model affords the opportunity to reflect in SRs many peculiarities of surface semantic structure of texts. But one can essentially extend its expressive possibilities in this direction proceeding from the ideas set forth in (Fomichov 1992, 1993b, 1994). The second hypothesis is as follows: modifying the rule P[8] and adding the rules Pll, P12, F13, P14 shortly characterized in (Fomichov 1992), we are able to develop a variant of a Universal Stationary Metagrammar of Conceptual Structures (or of Conceptual Syntax).
At last, the third hypothesis is that proceeding from the ideas stated in this paper and in (Fomichov 1992, 1993b, 1994), we can construct a variant of a Universal Metagrammar of Conceptual Structures (being convenient for describing both stationary and dynamic semantics of NL-texts).
The initial version of the model described in this paper was published in (Fomichov 1988). That monograph was also the basis for preparing the articles (Fomichov 1992, 1993b, 1994). That's why it seems that the work (Fomichov 1988) provides the first variants of metagrammars mentioned in the formulated second and third hypotheses.
14 Related Approaches
In comparison with Montague Grammar (MG), Theory of Generalized Quantifiers (TGQ), Situation Theory (ST), Discourse Representation Theory (DRT), Dynamic Predicate Logic (DPL) set forth in (Groenendijck & Stokhof 1991), the constructed model has a lot of common advantages as concerns formal describing SMs of NL-texts and representing knowledge. These common advantages are the properties 2-9, 10.2, 10.3, 10.6, 12, 13, 16-18 formulated in the task statement in Section 2.
Comparing the model with the Theory of Conceptual Graphs (TCG), we can see that the advantages of the model are the properties 3-5, 7-9, 10.2, 10.3, 10.6, 12, 13, 16-18.
The Theory of Semantic Structures (JackendofF 1990) is stated in an absolutely non-mathematical form. But in (Zwarts & Verkuyl 1994) a mathe-
matical interpretation of this theory is suggested. In comparison with that mathematical interpretation, the elaborated model possesses additionally, in particular, the properties 3-9, 10.2, 10.3, 10.6, 12, 13, 16-18.
The constructed model is an important component of Integral Formal Semantics (IFS)—a powerful approach to mathematical studying the use of NL; the basic principles and composition of IFS are set forth in (Fomichov 1994). The model enables us to approximate all manners suggested by the mentioned approaches to build SRs of NL-texts and represent knowledge and provides numerous additional expressive possibilities. However, both the model and IFS as a whole are very far from all enumerated approaches to investigating NL-semantics if we take into account the posed goals and used mathematical methods.
Meanwhile, there is one approach to the study of NL-semantics with the help of formal methods which is rather close to IFS as concerns the general goals of researches (but far from the standpoint of technical aspects). It is Episodic Logic (EL) developed by L.K. Schubert and C.H. Hwang (Schubert & Hwang 1989, Hwang 1992, Hwang & Schubert 1993a, 1993b, 1993c). Both approaches advocate the need of highly expressive (natural-language-like) formal systems of semantic and knowledge representations as the ground of a comprehensive framework for developing the theory of NLPSs capable to understand complicated NL-discourses. EL is qualified by its authors as an extended first order, intensional, highly expressive logic; the structure of its formulas called logical forms (LFs) and quasi logical forms (QLFs) reflects many peculiarities of NL-texts.
It is possible to show that the elaborated model allows us to approximate all formulas of EL considered in (Hwang & Schubert 1993b). One of the principal ideas how to do this underlies the example 13.1. Besides, the model has a number of advantages: the properties 2, 3.1 (concerning complicated designations of sets), 3.2, 3.3, 4-9, 10.2, 10.3, 10.6, 13, 16-18.
It may be pointed out also that the principal idea of EL as a tool for building NL-like semantic and knowledge representations of texts was anticipated as far back as in the works (Fomichov 1981, 1982; Fomitchov 1983, 1984) setting forth the theory of free S-models, restricted 5-languages of
types 1-4, 5-calculuses and 5-languages of types 1-5 (see also Fomichov 1994).
Moreover, the expressive power of restricted Slanguages of type 4 (Fomichov 1982), 5-calculuses and 5-languages of types 4 and 5 (Fomitchov 1983, 1984) exceeds the expressive power of EL. In particular, one can show that expressive possibilities of LFs and QLFs considered in all examples in (Hwang & Schubert 1993b) may be modelled by formulas of restricted 5-languages of type 4 (Fomichov 1982) or by formulas of 5-calculuses and 5-languages of type 4 or 5 (Fomitchov 1983, 1984).
It should be added that the model has at least three global distinctive features as concerns its structure and destination in comparison with EL. The first feature is as follows. In fact, the purpose of this paper is to represent in a mathematical form a hypothesis about the general mental mechanisms (or operations) underlying the formation of complicated conceptual structures (or semantic structures, or knowledge structures) out of basic conceptual items. EL doesn't undertake an attempt of the kind, and 21 Backus-Naur forms used in (Hwang 1992) for defining the basic logical syntax (b.l.s.) rather disguise such mechanisms (operations) in comparison with more general 11 inference rules of the model determined above and 15 rules described in (Fomichov 1988, 1992).
The second global distinctive feature is that this paper formulates a hypothesis about a complete collection of operations of some deep conceptual sublevel providing the possibility to build effectively the conceptual structures corresponding to arbitrarily complicated real sentences and discourses pertaining to science, technology, business, medicine, law, etc.
The third global distinction is that the form of describing in EL the b.l.s. is not a strictly mathematical one. E.g., the collection of Backus-Naur forms used for defining b.l.s. in (Hwang 1992) contains the expressions
{1-place-pred-const) := happy \
person \ certain \ probable | ..., {1-fold-pred-modifier-const) plur | very \ former \ almost \ in-manner | ...
The only way to escape the use of three dots in productions is to define some analogue of the notion of a simplified conceptual basis introduced in the present paper.
A large stock of formal expressions for describing surface semantic structure of texts is provided by the Core Language Engine (CLE)—a domain independent system for translating a wide range of English sentences into formal representations of their literal meanings (Alshawi & van Eijck 1989; Alshawi 1990; Alshawi (Ed.) 1992). The CLE has two representation languages, their expressions are called logical forms (LFs) and quasi logical forms (QLFs). The model described in this paper enables us to approximate all expressive possibilities of LFs and QLFs. The model possesses additionally the properties 3.2, 3.3, 48, 10.3, 10.6, 16-18. Besides, the model allows us to build much more complicated designations of sets (the property 3.1). So the model has essential advantages from the standpoint of practice. As for theoretical aspects, the orientations of the model and CLE are very different, and the model has the same three global distinctive features in comparison with CLE as in comparison with EL.
The model described in Sections 3-13 has important advantages in comparison with the terminological knowledge representation language (TKRL) ìlilog developed by a number of German universities and research institutes (Herzog & Rollinger (Eds.) 1991). First, it is possible to show that the model allows us to approximate all expressive possibilities of Xlilog (Section 5 of Fo-michov 1994 may be of help for this). Second, the model has the advantages 3.2, 3.3, 4, 7, 10.6, 13, 16, 17 and enables us to build much more complicated definitions of concepts and designations of objects.
It appears that the most important integral advantage of the constructed model in comparison with all approaches discussed above is that only the model presented in this paper together with the ideas set forth in (Fomichov 1988, 1992, 1993b, 1994) provides a sound mathematical basis for studying the regularities of conveying information by arbitrarily complicated real NL-texts pertaining to diverse areas of human activity and formalizing arbitrary kinds of NL-dialogue.
15 Some Possible Applications of Results
15.1	Theoretical Applications
In (Fomichov 1992, 1993a; Martin-Vide 1993) it is pointed out at the necessity of mathematical studying NL in a broader context than it is being done traditionally: at the expediency of creating a mathematical theory of NL-communication between active intelligent systems.
The ideas described above and in (Fomichov 1988, 1992, 1993a, 1993b, 1994) provide the ground for developing mathematical lingu-ocybernetics, or mathematical theory of natural language use, or mathematical theory of NL-communication—in other terms, mathematical foundations of designing arbitrarily complicated text-based intelligent systems (full-text databases, etc.) and robust NL-interfaces to autonomous intelligent agents and other applied intelligent systems. In particular, the results may be applied to formalizing abductive inference (see Hobbs, Stickel, Appelt, & Martin 1993) as inference aimed at the construction of the best explanation of conceptual ties between fragments of a discourse.
In comparison with the state of affairs reflected in (Partee, ter Meulen, & Wall 1990, ICML'93), this paper considerably extends the limits of mathematical linguistics, opens a door to constructing generative algebraic models of complicated natural-language conceptual structures in addition to analytical algebraic models of language characterized in (Marcus 1993).
The constructed model provides very rich formal means to describe knowledge about the world and, as a consequence, to describe various situations and relations between situations. That's why it seems that the model may give a new strong impulse to the studies in Situation Theory.
The results of this paper may be used also for building models of knowledge bases, formalizing common-sense reasoning and heterogeneous reasoning, integrating multiple knowledge sources.
15.2	Significance for Practice
The model enables the designers of applied intelligent systems to work up formal languages providing one or several of the following possibili-
ties: to build SRs of complicated real NL-texts pertaining to diverse areas of human activity; to represent the intermediate results of the conceptual analysis of NL-texts; to describe background knowledge; to construct high-level conceptual representations of complicated visual images; to describe lexical semantics; to construct ontologies of application domains; to develop more powerful and flexible hybrid knowledge representation systems. The model opens a lot of new prospects in all these directions.
16 Conclusions
The model determined and studied in this paper enriches in a leap-like manner the stock of formal means for describing structured meanings (SMs) of NL-texts and representing other conceptual items. Combining this model with the ideas set forth in English in (Fomichov 1992, 1993b, 1994), we have also an effective method of constructing formal models being convenient for describing SMs of practically arbitrary (very likely, arbitrary) real discourses and for representing knowledge about the world. That method was published in full in Russian in the monograph (Fomichov 1988), which, it seems, provided the first variant of a Universal Conceptual Metagram-mar.
The results stated above bridge a gap between formal semantics of NL and the demands of the theory of text-based intelligent systems and of several other subfields of computer science. They open a way for mathematical studying the regularities of conveying information by arbitrarily complicated real discourses pertaining to science, technology, medicine, business, law, etc. Besides, it may be hoped that obtained results will be effectively used for studying a number of actual theoretical and practical problems in computer and cognitive science.
Acknowledgements
I thank two anonymous referees for constructive comments helped to improve the presentation style of the paper.
I am deeply grateful to Prof. A.P. Zeleznikar for the kind help (going beyond the scope of the Editor's duties) in improving the submitted Lal^
file of this paper.
References
[1]	Aczel, P., Israel, D., Katagiri, Y., & Peters, S. (Eds) (1993): Situation Theory and Its Applications, Vol. 3. CSLI Lecture Notes No. 37. Stanford: CSLI Publications.
[2]	Ahrenberg,L. (1992): On the integration and scope of segment-based models of discourse. In Papers from the Third Nordic Conf. on Text Comprehension in Man and Machine, Linkoeping Univ., 1-16.
[3]	Alshawi, H. & van Eijck, J. (1989): Logical Forms in the Core Language Engine. In Proc. 27th Ann. Meeting of the ACL, Vancouver, Canada, 25-32.
[4]	Alshawi, H. (1990): Resolving quasi logical forms. Computational Linguistics, 16(3), 133-144.
[5]	Alshawi, H. (Ed.) (1992): The Core Language Engine, MIT Press, Cambridge, MA.
[6]	A Plan (1992): A Plan for the Knowledge Archives Project, The Economic Res. Inst.; Japan Soc. for the Promotion of Machine Industry; Systems Res. & Dev. Inst, of Japan, Tokyo, March, 1-78.
[7]	Barwise, J. (1989): The Situation in Logic. CSLI Lecture Notes No. 17. Stanford., Calif.: CSLI.
[8]	Barwise, J. (1992): Information links in domain theory. In Proc. of the Math. Foundations of Programming Semantics Conf. (1991), ed. S. Brookes et al.. Lecture Notes in Computer Science, No. 598, Springer, 168-192.
[9]	Barwise, J. (1993): Constraints, channels, and the flow of information. In Aczel, Israel, Katagiri, & Peters (1993), 3-27.
[10] Barwise, J. & Cooper, R. (1993): Extended Kamp notation: a graphical notation for Situation Theory. In Aczel, Israel, Katagiri, h Peters (1993), 29-53.
[11]	Barwise, J. k Etchemendy, J. (1990): Information, infons, and inference. In Situation Theory and Its Applications 1, ed. R. Cooper, K. Mukai, and J. Perry, CSLI Lecture Notes No. 22. Stanford, Calif: CSLI, 33-78.
[12]	van Benthem, J. (1992): Fine-structure in categorial semantics. In Rosner & Johnson (1992), 127-157.
[13]	Cooper, R. (1991): Three lectures on Situation Theoretic Grammar. In Natural Language Processing, ed. M. Filgueiras et al. Berlin etc.: Springer-Verlag, 102-140.
[14]	Devlin, K. (1991): Logic and Information. Cambridge: Cambridge Univ. Press.
[15]	Ellis, G. (1995): Compiling conceptual graphs. IEEE Trans, on Knowl. and Data Eng., 7(1), 68-81.
[16]	Fenstad, J.E., Langholm, T., & Vestre, E. (1992): Representations and interpretations. In Rosner & Johnson (1992), 31-95.
[17]	Fomichov, V.A. (1981): To the theory of logic-algebraic modelling the speech-forming mechanisms of conceptual level. 1. Task statement and the idea of an approach to its resolving. Moscow Inst, of Electronic Eng., 85 pages.—Paper deposited in AUISTI of Ac. Sc. USSR (All-Union Inst, of Sc. and Techn. Information; in Russian—Vsesoyuzny Institut Nauchnoy i Technicheskoy Informatsii, or VINITI) 27/10/81, No. 4939-81 Dep.— The abstract 2B1386DEP in the Abstracts J. "Mathematica" of AUISTI, 1982, No. 2.
[18]	Fomichov, V.A. (1982): Development and application of methods of logic-algebraic modelling the mechanisms of speech-formation semantics. Ph.D. dissertation. Moscow: Moscow Inst, of Electr. Eng.
[19]	Fomitchov, V.A. (1983): Formal systems for natural language man-machine interaction modelling. In Int. Symp. on Artif. Intel. IFAC, Vol. 1. Leningrad: Ac. Sc. USSR, 223-243 ( in two variants: in Russian and in English). Paper presented at the First Symp. of the Intern. Feder, of Autom. Control on Artif. Intell.
[20]	Fomitchov, V.A. (1984): Formal systems for natural language man- machine interaction modelling. In Artificial Intelligence. Proc. of the IFAC Symp. Leningrad, USSR, 4-6 Oct. 1983, ed. V.M. Ponomaryov (IFAC Proc. Series, 1984, No. 9). Oxford, UK; New York, etc.: Pergamon Press, 203-207.
[21]	Fomichov, V.A. (1988): Representing Information by Means of K-calculuses. Text-book. Moscow: The Moscow Institute of Electronic Engineering Press (in Russian).
[22]	Fomichov, V. (1992): Mathematical models of natural-language-processing systems as cybernetic models of a new kind. Cyber-netica (Belgium), XXXV (1), 63-91.
[23]	Fomichov, V.A. (1993a): Towards a mathematical theory of natural-language communication. Informatica. An Intern. J. of Computing and Informatics (Slovenia), 17(1), 2134.
[24]	Fomichov, V.A. (1993b): K-calculuses and K-languages as powerful formal means to design intelligent systems processing medical texts. Cybernetica, XXXVI (2), 161-182.
[25]	Fomichov, V. (1994): Integral Formal Semantics and the design of legal full-text databases. Cybernetica, XXXVII (2), 145-177.
[26]	Groenendijk, J. & Stokhof, M. (1991): Dynamic Predicate Logic. Linguistics and Philosophy, 14(1), 39-101.
[27]	Herzog, O. & Rollinger, C.-R. (Eds.) (1991): Text Understanding in LILOG. Integrating Computational Linguistics and Artificial Intelligence. Final Report on the IBM Germany LILOG-Project. Berlin etc.: SpringerVerlag.
[28]	Hobbs, J.R., Stickel, M.E., Appelt, D.E., & Martin, P. (1993): Interpretation as abduction. Artif. Intelligence, 63(1-2), 69-142.
[29]	Hwang, C.H (1992): A Logical Approach to Narrative Understanding. Ph.D. Dissertation, U. of Alberta, Edmonton, Canada.
[30]	Hwang, C. H. & Schubert, L.K (1993a): Meeting the interlocking needs of LF-computation, deindexing, and inference: An
organic approach to general NLU. Proc. 13th Int. Joint Conf. on AI (IJCAr93). Cham-bery, France, 28 Aug.-3 Sept. 1993.
[31]	Hwang, C.H. & Schubert, L.K. (1993b): Episodic Logic: a comprehensive, natural representation for language understanding. Minds and Machines, 3 , 381-419.
[32]	Hwang, C.H. & Schubert, L.K. (1993c): Episodic Logic: a situational logic for natural language processing. In Aczel, Israel, Kata-giri, & Peters (1993), 303-338.
[33]	ICML'93 (1993): 1 Intern. Conf. on Math. Linguistics. ICML'93. Tarragona (Catalonia, Spain), March 30-31, 1993. Ed.: Carlos Martin-Vide.
[34]	Jackendoff, R.S. (1990): Semantic Structures. MIT Press, Cambridge, Mass.
[35]	Kamp, H. & Reyle, U. (1993): Introduction to Modeltheoretic Semantics of Natural Language, Formal Logic and Discourse Representation Theory. Dordrecht: Kluwer Academic Publishers.
[36]	Markus, S. (1993): The status of research in the field of analytical algebraic models of language. In ICML'93, 47-48.
[37]	Martin-Vide, C. (1993): How to infer nonformal conclusions from mathematical models in the analysis of natural language: a preliminary case study in communication theory. In ICML'93, 49-50.
[38]	Meyer, R. (1994): Probleme von Zwei-Ebenen-Semantiken. Kognitionswissenschaft, Band 4, Heft 1, 32-46.
[39]	Nebel, B. k Peltason, C. (1991): Terminological reasoning and information management. In Information Systems and Artif. Intel.: Integration Aspects. First Workshop. Ulm, FRO, March 1990, Proc., ed. D. Ka-ragiannis. Berlin etc.: Springer-Verlag, 181212.
[40]	Partee B. H., ter Meulen, A., & Wall, R. (1990): Mathematical Methods in Linguistics. Dordrecht etc.: Kluwer.
[41]	Rosner, M. & Johnson, R. (Eds) (1992): Computational Linguistics and Formal Semantics. Cambridge: Cambridge Univ. Press.
[42]	Schubert, L. K. & Hwang, C. H. (1989): An episodic knowledge representation for narrative texts. Proc. 1st Int. Conf. on Principles of Knowledge Repres. and Reasoning (KR'89), Toronto, Canada, 444-458.
[43]	Sowa, J.F. (1984): Conceptual Structures: Information Processing in Mind and Machine. Addison-Wesley Pubi. Comp.: Reading, MA.
[44]	Sowa, J.F. (1991): Toward the expressive power of natural language. In Sowa, J.F. (Ed.), Principles of Semantic Networks. Explorations in the Representation of Knowledge. Morgan Kaufman Pubi., Inc.
[45]	Wah, B.W., Huang, T.S., Joshi, A.K. et al. (1993): Report on Workshop on High Performance Computing and Communications for Grand Challenge Applications: Computer Vision, Speech and Natural Language Processing, and Artificial Intelligence. IEEE Trans, on Knowl. and Data Eng., 5(1), 138154.
[46]	Železnikar, A.P. (1993): Mission and Research Reports. Informatica, 17(1), 81-100.
[47]	Zwarts, J. & Verkuyl, H. (1994): An algebra of conceptual structure; an investigation into Jackendoff's conceptual semantic. Linguistics & Philosophy, 17(1), 1-28.
A Logic for Conditional Recommendation
S. K. Das
William Penney Laboratory, Imperial College, London SW7 2AZ Phone: +44 171 594 8424, Fax: +44 171 594 8449 E-mail: skd@doc.ic.ac.uk AND
P. Hammond
(
Department of Computer Science, Brunei University, Middlesex UB8 3PH Phone: +44 1895 203393, Fax: +44 1895 251686 E-mail: P.Haiiimond@brunel.ac.uk
Keywords: conditional, decision support system, logic, recommendation Edited by: Johann Eder
Received: November 22, 1994 Revised: December 15, 1995 Accepted: December 22, 1995
A decision support system is a computerized system which utilizes knowledge about a particular application area to help decision makers by recommending suitable actions. An action is conditionally recommendable when a set of conditions associated with it is expected to be brought about in the state resulting from performing the action. Conditional recommendations occur frequently in practical decision-making contexts but their formal treatment is yet to be explored. Our approach is to develop a modal propo-sitional logic Ccr for reasoning about conditional recommendations. The semantics of the logic is studied in terms of possible worlds where transition from one state to another is made by performing an action. The soundness and completeness of the logic is established. We also discuss the exploitation of Ccr to formalize update and integrity constraint verification in knowledge bases.
1 Motivation	narrow window between efficacy and toxicity of
many of the treatments, but especially in chemo-
The formalization of conditional recommenda-	therapy [9].
tion is important in the implementation of	^ concerned with the selection of a
safety-critical systems, those decision support sy-	Particular treatment plan, or protocol, can be
stems [3] where malfunction or operator error	expressed as a high-level recommendation [6]:
may give rise to severe injury, death or finan-	jf p^^i^j^^ operable bone cancer
cial/environmental catastrophe with the potential	^^en consideration of treatment under
for legal liability. The' work presented here arises	BQ03 ^^^^ ^e recommended, from our involvement in a project addressing the
technology and legal implications of knowledge- This kind of sentence has the general form based decision support systems in safety-critical
domains. To illustrate and motivate the abstract	^^ condition <f> (the patient has operable
notions, we give examples in the domain of deci-	bone cancer) holds in the present state
Sion support in oncology, a specific interest of one	^f the knowledge base then an action a
of the project partners.	(consideration of treatment under pro-
^ ,	,	r	r	tocol B003) can be recommended.
Because of the complexity of treatment performed and the amount of data captured during on- Symbolically these sentences are of the form cology trials, computerized decision support is essential. The safety-critical aspect arises from the	-i- 7Z(a)
where %{&) is read as "action a is recommenda-ble".
The condition cj) is sometimes complex and involves conjunctions and disjunctions. For example, consider the following recommendation of an action:
If the patient has operable bone cancer and the patient's renal function is adequate then the inclusion of the drug cisplatin in chemotherapy can be recommended.
In this case the condition (j) is a, conjunction of two subconditions that the patient has operable bone cancer and the patient's renal function is adequate. The action a is the inclusion of the drug cisplatin in chemotherapy. On the other hand, sentences like
If the patient is eligible for treatment under protocol B003 and folinic acid rescue and prehydration schemes are instituted before chemotherapy and posthydration afterwards and the patient is available for subsequent toxicity monitoring then the inclusion of high-dose methotrexate in chemotherapy can be recommended.
involves an action with two associated conditions. The general form of such sentences is
If a set of conditions (j) (the patient is eligible for treatment under protocol B003 and folinic acid rescue and prehydration schemes are instituted before chemotherapy) holds in the present state of the knowledge base then an action a (inclusion of high-dose methotrexate in chemotherapy) can be recommended provided another condition C (posthydration is instituted and the availability of patient for toxicity monitoring) is brought about in the state obtained by taking the action.
Symbolically these sentences are of the form
(j) cn{a, C)
where CTZ{a, C) is called a conditional and read as "action ot is conditionally recommendable provided condition C". These sentences are of the
most general kind. The conditions ^ and C will be called respectively the precondition and the postcondition for the recommendation of a. We aim to formalize the behaviour of postconditions by developing a logic £cr, an extension of propositi-onal logic. The reason for ignoring preconditions and concentrating on postconditions is that a precondition can be verified on the current state in a straightforward manner using a theorem pro-ver. On the other hand, the assumption of a postcondition is a prediction and therefore its study beforehand is important.
Now if the postcondition in a conditional is a property (for example, the patient's availability or low temperature) then it can be brought about by performing, in general, more than one type of action in the transformed state (see figure 1). For example, if the patient's low temperature is to be brought about after an operation, then for each type of treatment for bringing down temperature a state exists in which the patient does have low temperature. On the other hand, if an action (for example, posthydration) is to brought about, then it can be implemented by performing the action itself and another state is obtained after performing the action (see figure 2). It is our assumption that if a property is to be brought about then there must exist at least one action whose execution in the present state will make the property true in the transformed world.
The rest of the paper is organized as follows. Sections 2, 3 and 4 present respectively the syntax, axioms and semantics of £cr- The soundness and completeness result of £„■ is presented in section 5. Section 6 provides a detailed guideline for formalizing update and constrains verification in a knowledge base. Throughout the rest of the paper, unless otherwise stated, conditions in conditionals will be denoted by C, D, ...; property symbols by p, q, ...; action symbols by a, ß, ...; formulae by F, G, ...; worlds by w, w', ... .
2 Syntax
In this section, we present the syntax of the language of Ccr which makes provision for representing conditionals of the form discussed in the previous section. The language of Ccr is the usual proposi-tional one extended with a binary modal operator.
Due to the dynamic nature of the applications

Present world (w) A{a,w)
property symbol T (true). The modal operator of Ccr is CTZ. The formulae T of £cr are as follows:
-	All propositional formulae are modal formulae.
-	An atomic conditional C7l{a,C) is a modal formula, where a is an action and C is a formula.
A(a2,A(a,w))
-^(f), cß A are modal formulae, where ij) are modal formulae.
and
A{an, A{a,w)) Permissible worlds Figure 1: Relationship between the present world and permissible worlds in the presence of the conditional C'R{a,j)), where p is a property and A(q;, w) is the world obtained from w by executing a.
a
ß
Present world (w) A{a,w)
A{ß,A{a,w))
Figure 2: Relation between the present world and the final world in the presence of the conditional C7l{a,ß), where ß is an action.
we have dealt with, the description of the universe of discourse of Ccr is considered to have two aspects: static and dynamic. The static aspect of the universe of discourse is described by properties (e.g., the patient has a bone cancer) and dynamic aspects by occurrences (e.g., the patient is given an injection). For a detailed discussion of properties, occurrences, etc., readers are referred to [1]. An occurrence is either an event or a process. Some occurrences involve animate agents (i.e., decision makers) performing actions. We shall not consider occurrences where animate agents do not perform any actions. In view of this, the set of propositional symbols Pq, Pi, ... is partitioned into two disjoint subsets as V and A to represent respectively properties and actions.
The domain of propositions includes the special
Note that we consider formulae that are more general than the form <p CTZ{a, C) as suggested in section 1. We take ± (false) to be an abbreviation of -iT. Other logical connectives are defined using -1 and A in the usual manner. C'R{a,C) is read as "action a is conditionally recommendable provided condition C" and the formula CTZ{a,T) is simply written as 7^(a;) and is read as "action a is recommendable". If /3 is an action then the conditional CTZ{a,ß) states that the action a is conditionally recommendable provided ß is performed in the state obtained from the current state by performing a (figure 2). Note that the presence of an action ß in the knowledge base represents the occurrence that ß has been taken.
3 Axioms
To present the axioms of £cr, consider first the propositional logic and we have
all instances of propositional tautologies (1)
The above set includes instances of propositional tautologies that may involve any number of the modal operators, for example, CTZ{a, Cj —> C7Z(a,C). The modus ponens rule
MP: if H F and h F ^ G then h G
is the only rule of inference, where 'h' denotes deri vability in Cj-cv As a first step of extending pro-positional logic to meet our objectives, we have to cover situations where no action can bring J. about. Therefore, we consider the following axiom as part ol Ccr

(2)
For any action a, equivalent conditions C and D guarantee the conditionals C7^(a, C) and ClZ{a, D) are equivalent. This can be represented by the following equivalence inference rule:
EQ: if h C f^ ö then h C7^(a, C) H- C7Z(a, D)
Consider the possibility of replacing the above inference rule by a more general kind of inference rule "if D ^C then I- C7^(a, C) ^ Cn{a, D)". This can be justified by the fact that if bringing about D in a state implies bringing about C then a being conditionally recommendable provided C will yield that a is also conditionally recommendable provided D. We exclude this possibility for the following reason. If D C holds then bringing about D in a state implies bringing about at least C and possibly more, if C —> D does not hold. Correspondingly, the additional actions may affect the objective for which a is recommended.
Readers may have observed the similarity between axiom (2) and inference rule EQ and the axiom and inference rule of minimal deontic logic [2, 4], which can be obtained by replacing each CTZ{a,C) by the obligation 0{C). Later, in section 5, we shall take advantage of this to develop the soundness and completeness of £cr in line with that of minimal deontic logic. The major difference is the variation of the modal operator CTZ for every action. Also, we have two more axioms in Ccr and they are presented below.
In any particular state of a knowledge base more than one kind of action can be conditionally recommendable and we choose the most suitable one to execute. Therefore, in a particular state and for a particular action a we may be able to derive CTZ{a, C), CTZ{a, D), and so on. Thus, to say that a is conditionally recommendable provided condition C and a is conditionally recommendable provided condition D is equivalent to saying that a is conditionally recommendable provided condition C or D. This statement provides the following axiom of Ccr-
Cn{a,C)ACn{a,D)^Cn{a,CyD) (3)
The above axiom provides us with a view of alternative conditions available to satisfy when a specific action is conditionally recommended. Note that, if we have CTZ{a, C) and CTZ{a, -iC) then the axiom (3) together with the inference rule EQ generate CTZ{a,T), that is, n{a). This is quite expected because either C or -iC holds in any state and thus a can simply be recommended without any postcondition.
In our knowledge base we may have actions in different names which are equivalent to each other. By an equivalence of two actions a and ß (written asa ß) we mean that the cost, benefit, outcome, etc., involved in executing a is exactly the same as that of ß. For example, if the school is located 100 metres in a northerly direction and the only road to the school is in this direction then the action of walking to the school and the action of walking 100 metres north are equivalent. This motivates us to consider the following inference rule as part of Ccr-
{a ^ß)^ (C7^(a, C) ^ Cn{ß, C)) (4)
The axiom states that if two actions are equivalent then the recommendability of one action provided some condition is equivalent to the recommendability of the other provided the same condition. The above axiom generates equivalent conditionals with respect to actions. The axiom also provides us with a way of renaming actions in a knowledge base.
This completes our presentation of axioms and inference rules of £„•• Now we show that the system is consistent by means of the following proposition.
Proposition 1 The axiom system of Ccr is consistent. In other words, ^Ccr
Proof The prepositional subset of is consistent. Axiom (2) is the source of only one conditional. Therefore, the resultant system is still consistent. Inference rule EQ fails to generate a positive conditional of the form C7^(a, C) which could have been inconsistent with axiom (2). Therefore consistency is still preserved. We now add axiom
(3).	This axiom does generate a positive conditional of the form C7^(a, C V D) given ClZ{a, C) and CTZ{a,D). But neither C nor D is equivalent to L due to the presence of axiom (2) and inference rule EQ. Therefore the addition of this axiom cannot result in inconsistency. Finally, when axiom
(4)	is added to obtain Ccr, we are still unable to derive an atomic conditional. □
We now provide an example to illustrate the formalization and the deduction process within
Example 1
Consider the following sentence of conditional recommendation which has already been cited in section 1:
If the patient is eligible for treatment under protocol B003 and folinic acid rescue and prehydration schemes are instituted before chemotherapy then the action of the inclusion of high-dose methotrexate in chemotherapy is recommended provided posthydration is instituted and the patient is available for toxicity monitoring.
Consider another sentence which is a rule of the knowledge base:
If the patient has satisfactory diagnosis for protocol then the patient is eligible for treatnient under protocol.
The symbolic representations of the above two sentences are as follows:

C7^(a,7Ar)
where a, ß and 7 are action symbols whose meanings are described as follows:
a - inclusion of high-dose methotrexate. ß - institution of fol. acid rescue and prehyd. 7 - institution of posthydration scheme.
The symbols p, q and r represent different properties of patients as described below:
p - has satisfactory diagnosis for protocol. q - eligible for treatment under protocol, r - patient is available for toxicity monitoring.
Now, consider the following state of a knowledge base:
{p-^q, qAß^ Cn{a, 7 A r) }
Suppose we have evidence that the patient has a satisfactory diagnosis for the protocol and folinic acid rescue and prehydration scheme are instituted. Then the above state is changed to the following state:
{P, ß,P^ q,qAß^CTZ{a,jAr) }
Now CTZ{a,'yAr) becomes derivable which conditionally recommends a. If we execute a, the state of the knowledge base is changed to the following:
{ a, p, /?, p ^ g, ^ A /3 C7^(a', 7 A r) }
Since a has been executed,	7Ar) now acts
as an obligation on 7Ar. In other words, 7 should be executed and r should be brought about. If these are carried out, the final state without any obligation is transformed to the following:
{'y,r,a,p,ß,p-^q,qAß^ CH{a, 7 A r) }
Suppose we have s V 7 A r as the postcondition instead of 7Ar and s has been brought about. The following would have been the final state without any obligations:
{s,a,p, ß,p-^q,qAß ^ Cn{a, S V 7 A r)}
4 Semantics
The description of the universe of discourse of Ccr is interpreted as a conceptual model which consists of a set of possible worlds [10]. In this section we study conceptual models of Ccr in terms of minimal models.
Definition 1 A minimal model for the logic Ccr can be defined as
where W is a set of possible worlds and B maps a proposition P to a set of worlds for which P is true. Each R^ is a mapping that selects a set of formulae, for each world w, such that, if C e Ra{w) then a is conditionally recommendable in w provided condition C.
A proposition or a formula can be uniquely characterized by a set of worlds in each of which it is true. Therefore, the mapping Ra can be defined as
Ra-.w n(n(w^))
where n(v\') is the power set of X. The set of worlds from W for which C is true is a member of n(n(W')) and will be denoted as || C or simply II C II, when the name of the model is clear from the context. On the other hand, a member 5ofn(n(W)) is of the form || C where C is
the member of T which is uniquely characterized by 5.
For every w in W, the set	satisfies the
following three properties:
PI: 0^Ra(w).
P2: if Si, 52 e	then U 52 € RaM-
P3: if II a II II /? II then Ra(w) = Bfl(w).
These three properties are due to axioms (2), (3) and (4) respectively. Property PI makes each Ra{w) a collection of non-empty sets. Property P2 makes Ra{w) closed under union.
Not every action can be recommended in every world. Each world w £W has a certain number of defined actions and each defined action when executed on w transforms it to another world. When an action a is undefined in a world w then Ra{'w) is empty. If an action a is defined in every world then each Ra{w) contains T.
The rules for determining truth values of formulae in Mcr and a world w are as follows:
1 tu -p
hlt^^P ì^weBip)
NXt.-^ iff
\=ltJTZ{a,C)ì^\\C\\eRa{w)
Definition 2 A formula F is said to be true in model Mcr, written as	if and only if
\=M	world w in W. A formula F is
said to be valid, written as \= F, if F is true in every model.
5 Soundness and Completeness
This section contains the soundness and completeness results for Ccr using the concept of minimal model [4]. Ccr is said to be sound with respect to a class of models V if and only if for every model A^ in F and for every sentence F ^ T^ii hF .then \=j^F. Ccr is said to be complete with respect to F if and only if for every Af in F and for every sentence F G J", if F then hF. The following two propositions prove that the validity in a class of models is preserved by the use of the rule of inference and the axioms of Ccr-
Proposition 2 Let Mcr = {W,{Ra]aeA^B) be a minimal model. Then,	C ^ D implies
Proof If	^ D then || C || = || D ||. The-
refore, for any world u; in Mcr^ || || S Ra{w) iff II -D II G Ra{w). So for any world w in Mcr,	iff	Thus,
Proposition 3 For every formula F and G the axioms of Ccr are valid, that is,
1.	\=^CTZ{a,l.).
2.	\=CTZ{a,C)ACn{a,D)-^Cn{a,CyD) S. \= {a ^ß)-^ {cn{a, C) CTliß, C)).
Proof Let Mcr = {W,	B) be a model
and w G W.
1.	By property PI of A^^r, 0 ^ Ra{w). Thus, ||±|| ^ Ra{w); otherwise, Ra{w) would contain the 0 to satisfy ||±|| G	Thus,
that is	Jl).
2.	If	Cn{a,C) A C7^(a,D) then
and
Therefore, for any world w, || C ||, II -C II G Ra{w) and thus, by property P2 of Mcr, II C II U II £) II G Ra{w). This means that HCVDjl G Ra{w), that is,
3.	If	ß then, by property P3 of Mcr, Ra(w) = Rß{w). Therefore,' hXi, C7^(a,C) iff II C II G Ra{w) iff lie II G Rß{w) iff	Therefore, □
Propositions 2 and 3 establish the basis of the soundness result. In order to prove the completeness result, the following class of models is relevant.
Definition 3 A minimal model Mcr — {W,{Ra}a^^,B) for the logic Ccr is called a canonical minimal model, written as M^^, if and only if
-	W = {w.- w is a maximal consistent set in logic Ccr}- (N.B. The maximal consistent sets exist because Ccr is consistent).
-	For every w in W, C7Z{a,C) G w if and only if\ C \c„ e Ra{w), where | C (or simply
I C is the set of maximal consistent set of sentences containing C in logic Ccr -
For each n = 0, 1, 2, ..., the set of all possible worlds for which the proposition Pn is true is
In the above definition of the canonical minimal model we have considered the set of worlds W as the set of maximal consistent sets of sentences. Therefore, we should make sure that when an action a is performed on a maximal consistent set w, leads uniquely to another world A{a, w) which is also a maximal consistent set. The mapping A in this case can be defined as follows. A(a, w) is constructed initially from a itself and all non-atomic sentences of w. An atomic proposition P (resp. -iP) from w is added to w) if it is consistent with A{a,w); otherwise, -iP (resp. P) is added to A(a, w).
Proposition 4 Let M^^ = {W,	B) be a
canonical minimal model for Ccr ■ Then, for every weW and F e	iff F e w.
Proof Since each tu G T^ is a maximal consistent set of then	iff F € w for every F €
T without any occurrences of CR. Consider the conditional C7^(a,C). Then, by induction hypothesis, we have
NXfg C iff c G t..
So for any w G W^ II C II G Raip) iff I C I G Since is a minimal model, \=l^cjn{a,c)m\\c\\eRAw).
Therefore, from the last two sentences,
ifF|C| GižcH-
By the definition of the canonical minimal model,
cn{a,c)ewifi\c\e Ra{w).
Therefore, from the previous two sentences,
C) iff C7^(a,C) G w. Therefore, for every F £ !F,
Therefore, the worlds in a canonical minimal model for a system of modal logic will always verify just those sentences they contain. In other words, the sentences which are true in such a model are precisely the theorems of the system. Thus, we have the following theorem.
Theorem 1 Let M% = {W,	B) be a
canonical minimal model for Ccr- Then, F if and only if hF.
Proof Follows from the above remark. □
Theorem 2 If M% is a canonical minimal model of Ccr then the model satisfies properties PI, P2 and P3.
Proof To show that M^^ satisfies property PI, if possible, let 0 G Ra[w). This means that ||1|| G Ra{w), that is, CTZ{a,l.). By proposition (4), C7^(a,±) G w. Also,	1), that
is, ^CTZ{a, ±) G w, which yields a contradiction as w is consistent.
To prove that Mcr satisfies property P2, suppose II Ci II, II C2 II G Ra{w), that is, and	C2). By propo-
sition (4), C7^(a,Cl), Cn{a,C2) G w. Since w is maximal consistent, C'R,[a,Ci) A C7^(q:, C2) G w. Now, axiom (3) is a member of w and w is maximal consistent. These suggest that CJl{a,C y D) £ w, that is, by proposition (4), \=l^cCn{a,CWD). By definition, || Ci V C2 || G w, that is, II Ci II, II C2 II G w.
To establish that M.% satisfies property P3, suppose II a II = 11/3 II, that is, |=Xic a ß. By proposition (4), a <-)■/? G Now, axiom (4) is a member of w and w is maximal consistent. These suggest that Cn{a, C) O C7^(/?, C) G w. Again, by proposition (4), |=X,c^C7^(a, C) f^ C7^(/?,C), that is,	C) iff \='ji,cCTZ{ß, C). Thus,
by definition' || C || G Raiw) iff || C || G Rß{w). Therefore, Rc{w) = Rß{w). □
Suppose r is the class of all models satisfying the properties PI, P2 and P3. Then the following soundness and completeness theorem establishes the fact that Ccr is determined by P..
Theorem 3 For every formula F in Ccr, ^F if and only if |=F.
Proof The first half follows from propositions 2 and 3. To prove the converse, suppose |=F. Theorem 1 implies that there is a canonical minimal model Mcr, satisfying properties PI, P2 and P3, such that hF if and only if Na^c^F. Now, theorem 2 establishes that M^r € F. Thus, implies	Therefore, hF. □
6 Discussion
In this section we discuss a special case of Ccr to handle updates to knowledge bases formally
and the corresponding integrity constraint verification. Following the traditional approach in databases, a knowledge-based decision support system can be defined as a knowledge base together with a set of integrity constraints [5]. Each integrity constraint must be satisfied by every state of the knowledge base. Following an update to the knowledge base the relevant constraints are evaluated; this can be achieved through meta-Ievel reasoning. An alternative way of handling this is by the notion of what we call conditional possession. A property p can be possessed conditionally by a knowledge base provided condition C must hold in the state after possessing the property. This is expressed as CTZ{p,C).
There are some differences between a conditional possession and a conditional recommendation. The latter is more general than the former. First of all, conditional recommendation of an action implies conditional possession of a set of properties or occurrences which reflect the change due to the action. Secondly, if a conditionally recommended action is executed then it cannot be reversed. On the other hand, a conditionally possessed property can be retracted. Finally, in the case of a conditional possession, the condition C in a conditional must hold or must be brought about in the state after possessing the said property; otherwise, it does not make sense to continue using the possessed property and therefore the property must be retracted. In contrast, in the case of conditional recommendation, the condition C in a conditional is to be brought about, failing which it may result in undesirable consequences.
Consider an example of an integrity constraint that a person's height must be less than 3 metres. If the property to be possessed is the person's height then the condition is that for every person the height must be less than 3 metres. If we enter 3.2 as a person's height then we shall be able to derive ± from the knowledge base and this contradicts axiom (2). If we continue using values contradicting constraints we may arrive at an absurd drug dosage or result such as a probability greater than one. The former arises because drug dosage is typically computed according to height and weight. Therefore the possessed property must be 'etr acted.
Consider another example where ± can be de-
rived after possessing the property and we bring about the condition and therefore restore consistency by another possession. Consider the conditional CTZ{p, C) where the property is a student's record and C is the integrity constraint that for every student there must exist a supervisor. If we now possess a new student record before assigning its supervisor then the knowledge base becomes inconsistent, that is, ± is derivable from the knowledge base. To bring back the consistency, that is, to bring about C, we have an option to enter a supervisor for that student rather than retracting the student's record.
What has just been discussed reflects the sensitivity of a property to be possessed by allowing conditionals of the form CTZ{p,C), that is, the conjunction of relevant integrity constraints C (which may be the empty condition T) to be evaluated upon possessing the property. If every possible property to be possessed is annotated with its conjunction of relevant integrity constraints then the knowledge-based system can be viewed as a single entity consisting of sentences involving propositions as well as conditionals.
Although we have named the primitive construct C'il[a^C) of Ccr as conditional, there is no similarity between a conditional logic and Ccr-(for an account of conditional logics, see [11]). The construct CTZ{a, C) will not be equivalent to the traditional type of conditional a > C. The idea of embedded implication [7, 8] in N-Prolog is similar to our concept of conditional possession discussed in this section. An embedded implication D G, where G is a goal and D is a set of clauses, succeeds from an N-Prolog program P if G succeeds from the enlarged program Pu{D}. On the other hand, condition C arising from a conditional C'}Z{p, C) due to the possession of p is said to have been fulfilled in the context of a knowledge base KB provided C succeeds from KBu{p}.
7 Conclusion
We have developed a modal logic extending pro-positional logic for reasoning with conditionals. It would be more natural to consider an underlying logic more general than propositional such as temporal first-order and then introduce the conditional modal operator on it. We have shown that a special case of the logic is useful to forma-
lize update and integrity constraint verification in knowledge bases.
Acknowledgements
SKD completed this work while working at the Computer Science Department of Queen Mary and Westfield College, London. Similarly, PH was previously at the Advanced Computation Laboratory, Imperial Cancer Research Fund, London. The authors would like to thank their colleagues in the RED project and two anonymous reviewers for helpful comments. The project was supported under the DTI/SERC project ITD 4/1/9053: Safety-Critical Systems Initiative.
References
[1]	J. F. Allen. Towards a general theory of action and time. Artificial Intelligence, 23:123154, 1984.
[2]	L. Aqvist. Deontic logic. In D. Gabbay and R. Guenthner, editors, Extensions of Classical Logic, volume 2 of Handbook of Philosophical Logic, pages 605-714. D. Reidel Publishing Company, 1985.
[3]	R. H. Bonczek, C. W. Holsapple and A. B. Whinston. Development in decision support systems. Advances in Computers, 23:123154, 1984.
[4]	B. Chellas. Modal Logic. Cambridge University Press, 1980.
[5]	S. K. Das. Deductive Databases and Logic Programming. Addison-Wesley, 1992.
[6]	S. K. Das and J. Fox. A logic for reasoning about safety in decision support systems. In Clarke et al., editors, Proceedings of the Second European Conference on Symbolic and Quantitative Approaches to Reasoning and Uncertainty, pages 79-84. Springer-Verlag, November 1993.
[7]	D. M. Gabbay. N-Prolog: an extension of prolog with hypothetical implications. II. logical foundations and negation as failure. Journal of Logic Programming, 4:251-283, 1985.
[8]	D. M. Gabbay and U. Reyle. N-prolog: an extension of prolog with hypotheticahimpli-cations. I. Journal of Logic Programming, 4:319-355, 1984.
[9]	P. Hammond, A. L. Harris, S. K. Das and J. C. Wyatt. Safety and decision support in oncology. Methods of Information in Medicine, 33:371-381, 1994^
[10]	S. A. Kripke. Semantical analysis of modal logic I: normal modal prepositional calculi. Zeitschr. f. math. Logik und Grundlagen d. Math., 9:67-96, 1963.
[11]	D. Nute. Conditional logic. In D. Gabbay and R. Guenthner, editors, Extensions of Classical Logic, volume 2 of Handbook of Philosophical Logic, pages 387-440. D. Reidel Publishing Company, 1985.
Rule-Generating Abduction for Recursive Prolog
Kouichi Hirata
Kyushu Institute of Technology, Kawazu 680-4, lizuka 820, Japan
Phone: +81 948 29 7884, Fax: +81-948-29-7600
E-mail: hirata@ai.kyutech.ac.jp
Keywords: rule-generating abduction, inductive logic programming, generalization
Edited by: Rudi Murn
Received: June 14, 1995	Revised: September 9, 1995 Accepted: October 31, 1995
The rule-generating abduction is a kind of abduction which generates a rule and proposes a hypothesis from a surprising fact. In general, there may exist infinitely many rules and hypotheses to explain such a surprising fact. Hence, we need to put some restriction on the class of rules. In rule-generating abduction, only one surprising fact is given. Hence, we also need to generalize the concept of a surprising fact. When we deal with such generalizations we must avoid over generalization. It should be determined whether or not a generalization is overgeneral by an intended model. However, it is hard to give such an intended model in advance in our rule-generating abduction. In this paper, we introduce a syntactical formulation of generalization, in which it can be determined whether or not a generalization is overgeneral by the forms of atoms and substitutions. On the other hand, by the restriction of rules, it sufEces to consider only two types of terms, constants and lists, and two types of substitutions with these two terms. By using the above generalizations and substitutions, we design an algorithm for rule-generating abduction, which generates rules and proposes hypotheses in polynomial time with respect to the length of a surprising fact. The number of rules and hypotheses is at most the number of common terms in a surprising fact. Furthermore, we show that a common term in some argument of a surprising fact also appears in the same argument of the proposed hypothesis by this algorithm.
1 Introduction	In general, the above inference schema is depicted
by a syllogism:
C. S. Peirce, who was a philosopher, scientist and logician, asserted that every scientific inquiry consists of three stages, abduction, deduction, and in-
C C^A A
duction (Peirce 1965, Yonemori 1982). According	In computer science, especially in computati-
to him, abduction is an inference which begins	onal logic and logic programming, many resear-
with an observation of a surprising fact, and pro-	chers have extensively studied abduction from va-
poses a hypothesis to explain why the fact arises,	rious viewpoints. Plotkin (1971) has studied ab-
Then, an inference schema by abduction is de-	duction together with inductive generalization. It
scribed by the following three steps (Peirce 1965,	is considered that Shapiro's model inference sy-
Yonemori 1982).	stem (Shapiro 1981) and inductive logic programming (Ling 1989, Ling & Ungar 1989, Muggle-
1.	A surprising fact C is observed.	^on 1992) are the extensions of Plotkin's work.
2.	If A were true, then C would be a matter of ^^^^^ (^^^l) and Genest et al. (1990) have sugge-
sted abduction for explanation-based generaliza-
CÜU.1 DC.
Hon, which is an efficient technique for obtaining
3.	Hence, there is reason to suspect that A is a general concept from examples and a backgro-true.	und knowledge. Thagard (1988) has introduced
analogical abduction, which is a kind of abduction incorporating with analogical reasoning. They are also related to machine learning and knowledge acquisition.
On the other hand, Pople has given another direction for the researches of abduction (Inoue 1992, Kunifuji 1987). It is considered that Poole's Theorist (Poole 1988, Inoue 1992, Kunifuji 1987), Kunifuji's hypothesis-based reasoning (Kunifuji 1987), and abductive logic programming (Dung 1991, Eshghi & Kowalski 1989, Kakas k Mancarella 1990, Kakas et al. 1992) are the extensions of Pople's approach. They are also related to knowledge representation.
In order to systematically understand these various researches of abduction, we have classified abduction into five types by the interpretation of syllogism (Hirata 1993); rule-selecting abduction, rule-finding abduction, rule-generating abduction, theory-selecting abduction, and theory-generating abduction. They are characterized as the following inference schema. More detail would be found in (Hirata 1993).
rule-selecting abduction :
C: surprising fact wrt P Select a rule C A in P Propose a hypothesis A in P
rule-finding abduction :
C: surprising fact wrt P Find a rule C ^ A in P' P) Propose a hypothesis A in P
rule-generating abduction :
C: surprising fact wrt P
Generate a rule C A in P «
Propose a hypothesis A in P
theory-selecting abduction :
C : surprising fact wrt B
Select a theory A such that A makes C true
Propose a hypothesis A
theory-generating abduction :
C: surprising fact wrt B
Generate a theory A such that A makes C true Propose a hypothesis A
By this . classification, the above researches are placed in the following positions (Hirata 1993): Abductive logic programming (Dung 1991, Eshghi k Kowalski 1989, Kakas &; Mancarella 1990, Kakas et al. 1992) is a sort of rule-selecting abduction. Abduction for explanation-based generahzation (Duval 1991, Genest et al. 1990) and analogical abduction (Thagard 1988) are a sort of rule-finding abduction. The constructive operators such as V and W operators (Muggleton 1992) in inductive logic programming are a sort of rule-generating abduction. Poole's Theorist (Poole 1988) and hypothesis-based reasoning (Kunifuji 1987) are a sort of theory-selecting abduction. Shapiro's model inference system (Shapiro 1981) and inductive logic programming (Ling 1989, Ling Sz Ungar 1989, Muggleton 1992) are a sort of theory-generating abduction.
In this paper, we investigate the rule-generating abduction. In the inference schema of rule-generating abduction, only one surprising fact is given. Hence, we need to generalize the concept of a surprising fact.
A generalization is an important tool for inductive logic programming and program synthesis. Plotkin (1970, 1971) has introduced and developed the least generalization and the relative least generalization. Arimura et al. (1991) have developed Plotkin's least generalization as the minimal multiple generalization. Note that all of them are researches on the generalization of at least two atoms. Thus, the following problem arises: Is the generalization of one atom worthwhile or not? Hirowatari and Arikawa (1994) have introduced the partially isomorphic generalization and answered this problem affirmatively in the framework of analogical reasoning.
When we deal with generahzations, we should avoid overgeneraHzation. It should be determined whether or not a generalization is overgeneral by an intended model. However, it is hard to give such an intended model in advance. Hence, we introduce a syntactical generalization of one atom, called a safe generalization. In general, an
atom is regarded as a relation between its arguments. Then, for safe generalizations, common ground terms are replaced by common variables.
In rule-generating abduction, if the class of definite programs is not restricted to some subclass, there may be infinitely many meaningless hypotheses. Hence, we introduce the subclass of definite programs, called single recursive programs on lists. Many typical Prolog programs are included in this class. However, without any heuristic, the number of single recursive rules on lists also increases exponentially with respect to the length of the surprising fact. In order to obtain the hypotheses efficiently by using safe generahzations, we design an algorithm for rule-generating abduction in single recursive programs on lists.
This paper is organized as follows: In Section 2, we introduce the concepts of single recursive rules on lists and partially isomorphic generalization necessary for the following sections. In Section 3, we formulate a safe generalization, which is based on the forms of atoms and substitutions instead of an intended model, and show some properties of safe generalizations. In Section 4, we show that the number of hypotheses in single recursive rules on lists also increases exponentially with respect to the length of the surprising fact. In Section 5, by using the above generalizations and substitutions, we design an algorithm for rule-generating abduction, which generates rules and proposes hypotheses in polynomial time with respect to the length of the surprising fact. The number of rules and hypotheses is at most the number of common terms in a surprising fact. Furthermore, we show that a common term in some argument of a surprising fact also appears in the same argument of the proposed hypothesis by this algorithm. In Section 6, we discuss the several examples for this algorithm.
language C.
For a term t, denotes the length of t. In particular, for a list Z, the length \l\ of Z is defined as follows: \l\ = 0, if Z is an empty list [ ]. Otherwise \l\ = n + 1, if t is a list	and \list\ = n.
First, we introduce the subclass of definite programs in order to reahze rule-generating abduction.
Definition 1 Let P be the following definite program:
P =

Then, P is called single recursive on hsts if P satisfies the following conditions:
1.	for any i {1 < i < n), ti is of the form either [Wj|Xi] or Xi, where Wi is a variable,
2.	for at least one j, tj is of the form
3.	for any i and j, (1 < i,j <n,i^ j), Xi and Xj are mutually distinct,
4- for any i, Si is of one of the forms [ ], X, [y] or
A rule p{ti, ■ ■ ■ ,tn) p{Xi, ■ ■ ■ ,Xn) is called single recursive on lists if it satisfies the above conditions 1, 2, and 3.
Example 1 The following Prolog rules in Sterling and Shapiro (1986) are single recursive rules on lists:
member(X, [W\Y]) ^ memhev{X,Y), prefix([VF|X],[-py|F]) ^ prefix(X,r), sufiix(X,	sufRx(X, Y),
append{{W\X],Y,[W\Z]) ^ append(X, F, Z), conc&i{X, \W\Y],\W\Z]) ^ concat(X, Y,Z).
2 Preliminary
For lists, a constant symbol is necessary to terminate recursion. This "empty list" is denoted by [ ]. We also need a functor [ | ] of arity 2, called a list constructor. Note that	is a list ad-
ding W to the top of a fist X. Historically, a list [W\x] is formulated by a dotted pair	In
this paper, we assume that a fist constructor [ | ] and an empty list [ ] are included in first order
Note that the rules of member and suffix have the same form. The first argument of member is a constant symbol, while that o/suffix is a list.
Let a, ß, and 7 be atoms. Then, ß is said to be more general than a, or a generalization of a if there exists a substitution 6 such that ßd = a. Furthermore, ß is called a maximal generalization if there exists no atom 7 such that — ß for any substitution 6. Also ß is called the greatest generalization if there exists a substitution 6 such
that ß9 = ^ for any atom 7 with the predicate symbol of ß.
Hirowatari and Arikawa (1994) have introduced the concept of a partially isomorphic generalization, which is the special generalization of one atom and is a useful tool for analogical reasoning. Let a be an atom. A term i is a replaceable term of a if i is a constant symbol or a term f{Xi, • • •, Xji), where / is a function symbol and each Xi is a variable which does not appear in the other terms in a. For a replaceable term t of a, let (x[t] be an atom obtained by replacing each t in a by a new variable Z which does not appear in a. Then we write ß ^ a when Q.[t\ is a variant of ß. We define as the reflexive and transitive closure of —
Definition 2 (Hirowatari & Arikawa 1994) Let a and ß be atoms. Then, ß is a partially isomorphic generalization of a if ß —a.
For a set of atoms 5, let [5] be the set of all equivalence classes of atoms in S. In particular, for any a € [5] and /3 € [5], a is a variant of ß.
Hirowatari and Arikawa (1994) have shown the following two theorems.
Theorem 1 (Hirowatari & Arikawa 1994) Let a be an atom and S be the set of all partially isomorphic generalizations of a. Then, [5] is a lattice whose partial order is —join operator is the greatest instantiation, and meet operator is the least generalization.
Theorem 2 (Hirowatari &; Arikawa 1994) Let a
be a ground atom p{ti, ■ • • ,tn) and k = H-----h
Then, a partially isomorphic generalization of a can be computed in 0{k'^) time.
3 Safe Generalization
It is an important problem to avoid overgenera-lization when we deal with generalizations. In general, whether or not a generalization of a is overgeneral is determined by an intended model. For an atom a, suppose that ß6 = a and M is an intended model. Then, /3 is an overgeneralization of a if there exists a ground atom 7 such that V/3 h 7 and M ^ 7 for M. However, the decision problem of whether or not there exists such ground atom 7 is undecidable. In rule-generating
abduction, only one surprising fact is given, and it is hard to give in advance an intended model. To overcome these difficulties, we introduce a syntactical generahzation of one atom.
Let 0 be a ground substitution, that is, 9 = := ti} and every term ti is ground. Let a be a ground atom and ß be an atom such that ß6 = a. Note that, throughout this paper, if ßQ = a, then a variable Xi € dom{9) appears in ß and ti ^ []. A substitution 9 is well-defined if, for any ti, there exists no term t j which is a subterm of ti.
Example 2 Let a be a ground atom p([a, ò], [ò]). Let ßi be the following atoms (1 < i < 5):
ß,=p{[a,X],[b]), ß2=p{[a\X],[b]), ß,=p{[X\Y],Y), ß, = pi[a\X],[Y]), ßs=pi[a\X],Y).
For any ßi, there exist the following substitutions 9i such that ßi9i = a (1 < i < 5).-
9, = {X := b), 92 = {X-.= [6]}, 03 = {X:=a,y:= [Ò]}, 04-{X := [b],Y:=b}, 9, = {X := [b],Y:= [Ò]}.
Then, 9^,92, and 9^ are well-defined, while 9^ and 95 are not.
Let a be a ground atom and ß be an atom such that ß9 = a. If a substitution 9 = Uf^^fXi := ti} is well-defined, then we can define a reversal =	:= Xi}. Note that, if 9 is well-
defined, then, for any ti and Xi, there exists no term tj such that tj is a subterm of ti and no variable Xi such that Xi = Xj{j ^ i). However, even if 9 is well-defined, ß is not always
Example 3 For ßi and ß2 in Example 2,
a9^' = p{[a,b],[b]){b
= p{[a,X],[X])^ß,, a9^' = p{[a,b],[b]){[b] ■.= X} = p{[a\X],X)j^ß2.
On the other hand,
a9^' = p{[a,b],[b]){a X, [b] :=Y} = p{[X\Y],Y) = ßs.
For the reversal 9"^, the following lemma holds.
Lemma 1 Let a be a ground atom and ß be an atom such that ß9 = a. Suppose that a substitution 6 = lJ2-i{Xi := ti] is well-defined. Then, ß = ad"^ if and only if no term ti appears in ß.
Proof Suppose that ß =	By the definition
of	is an atom which replaces all the
terms ti in a with the variable Xi. Then, no term ti appears in = ß.
For simplicity, suppose that 9 = {X := t}. If a term t appears once in a, that is, a has the form oi p{- ■ -t - ■ •), then	= p{- ■ - t- ■ :=
X} = p{- ■■ X ■ ■ •). Since a is ground and ß9 = a, = ß.
If a term t appears at least twice in a, that is, a has the form oi p{- ■ - t ■ ■ ■ t ■ ■ ■), then ß has the form of • ■ X • • • F • • •)■ If X ^ F, then, by ß9 = a,9 has the form of {X := t, Y := t}. This 9 is not well-defined. Hence, it is a contradiction. Then, X = Y, 9 = {X := t], and ß has the form oi p{- ■ ■ X ■ ■ ■ X ■ ■ ■). Hence, ß = aß-^ □
A ground term i [ ]) is a common term in a if i appears at least twice in a. In particular, if a common term is a ground hst, it is called a common list. Then, we formulate a safe generalization.
Definition 3 Let a be a ground atom, 9 be a substitution U^-i{Xi := ti}, and ß be an atom such that ß9 — a. An atom ß is a safe generahzation of a if (/3,6) satisfies the following safeness conditions.-
1.	9 is well-defined,
2.	ß = a9-^, and
3.	if there exists a common term in a, then there exists a ground term t j €	such that tj is a common term in a.
Let a be a ground atom and ß be an atom such that ß9 — a. Let t be some common term in a. If 9 is well-defined and has the form of X}, then ß is safe on a.
Example 4 Let a be a ground atom p([a, and ßi he an atom such that ßi9i = a (Q <i <8). Then, the common terms in a are [6] and b.
1. Let ßß be an atom p{X,Y) and 9q be a substitution {X := [a, b],Y := [ò]}. By the safeness condition 1, ße is not safe on a.
2.	Let ß-j he an atom ò],[y]) and 9j be a substitution {X := a,Y := b}. By the safeness condition 2, ßj is not safe on a.
3.	Let ßs be an atom p{X, [&]) and 9s be a substitution {X :— [a, 6]}. By the safeness condition 3, ßs is not safe on a.
For the above a, atoms p{[a\X], X), p{[a,X], [X\), p([F,X],[X]), andp{[Y\X\,X) are safe on a.
In general, an atom is regarded as a relation between its arguments. Thus, the syntactical generalization of one atom should be obtained by replacing common terms with common variables. The safe generahzation is an example of such generalizations. On the other hand, in single recursive rules on lists it suffices to consider only two types of terms, constant symbols and lists. Then, we also define the following two types of substitutions.
Let Ö be a substitution	:= ti}. Then,
0 is a constant substitution {resp., a list substitution) if every ti is a constant symbol {resp., a ground hst) without an empty list [ ].
In particular, a constant substitution is related to partially isomorphic generalizations which have been introduced by Hirowatari and Ari-kawa (1994). Note that, though a replaceable term includes an empty list [ ], the definition of a replaceable term is independent of the proof of Theorem 1. Let RT be the set of all replaceable terms of a and T C RT. Then, we can reformulate a partially isomorphic generalization by using T instead of RT, and show that Theorem 1 also holds for the set T of replaceable terms. Let a be a ground atom, 9c be a constant substitution, and ß be an atom such that ß9c = a. Thus, we assume that ß is a partially isomorphic generalization of a, whose replaceable terms are all constant symbols in a except an empty list [ ].
In Section 5, we apply rule-generating abduction to a list substitution 9i and a constant substitution 9c in the following way: Let a be a ground atom, that is, a surprising fact. First, by using a list substitution, we obtain an atom ß such that ß9i = a and ß is safe on a. Secondly, by using a constant substitution, we obtain an atom 7 such that 70c = ß and 7 is safe on /3. By the above assumption, 7 is also a partially isomorphic generahzation of ß.
48
Informatica 20 (1996) 43-55
K. Hirata
Unfortunately, 9c9i is not always well-defined, and 7 is not always safe on a. For example, let a be a ground atom p{[a,b,c],[b,c],[a,b,c]). Then, there exist the following atoms ft such that ßiöi = a and is a well-defined list substitution (9 < i < 11):
ß,=p{[a,b\X],[b\X],[a,b\X]), 09 = {X := [c]};
ß,o=p{[a\Y],Y,{a\y]),
6,0 = {Y := [b,c]y, ßn=piZ,[b,c],Z), en = {Z := [a,b,c]}.
Then, there exist no generalization 7 of /3 and substitution 17(7^ s) such that 7er = ßn and ffucr is well-defined. Note that there does not exist the greatest Hst generalization of a.
Let q; be a ground atom. Let ß and 7 be atoms such that ßdi = a and 70c = ß- Suppose that both {ß,6i) and (7,0c) satisfy the safeness conditions. Then, the following two theorems hold.
Theorem 3 If 6c9i is well-defined, then ^ is a safe generalization of a.
Proof Suppose that Qcßl is well-defined. Then, {liGcGi) satisfies the safeness condition 1.
Since both {ß, 9i) and (7,9c) satisfy the safeness conditions, (7, öc^z) satisfies the safeness condition 3.
By the supposition, ß = and 7 = ß9~^. The list substitution 0; replaces the common lists in a by variables. The constant substitution 0c replaces the same constant symbols in ß by the same variables and other constant symbols by other variables. Hence, the composition 0c0; replaces the common lists in a by variables, the same constant symbols in a except common lists by the same variables, and other constant symbols by other variables. By Lemma 1 and since 9c9i is well-defined, then {'y,9c9i) satisfies the safeness condition 2. □
Theorem 4 Suppose that any constant symbol appearing in common lists in a does not appear elsewhere in a except in the lists. If ß = and 7 = ß9~^, then 9c9i is well-defined. Hence, 7 is a safe generalization of a.
Proof Suppose that 0j =	:= k}, where
li is a common list in a. For any j-th argument's term tj of a, if t j includes k, then
h = i I
ai, a
[ai, 02-'
■	constant symbols
- ■ ■ ,<J-n- appear in k. Then, 0c does not include the binding X := c such that c appears in li. Hence, 0c9i is well-defined. By Theorem 3, {li9c9i) satisfies the safeness conditions. Then, for 7 such that ^9c9i = a, 7 is a safe generalization of a. □
Hence, we can conclude that the condition of Theorem 4 is regarded as one of a good surprising fact for the discussion in Section 5. In other words, we consider that the atoms which does not satisfy the condition, such as p([a], [a], [a, a]) and p([a, a], [6, a]), have no enough information to determine which constant symbols or common lists should be generalized.
4 Number of Hypotheses
An inference schema of rule-generating abduction is described by the following three steps:
1.	A surprising fact C is observed.
2.	A rule C <— A is generated in P.
3.	A hypothesis A is proposed.
In this paper, we deal with the most simple rule-generating abduction such that P = (j). Suppose that a ground atom p(ti, • • •, is given as a surprising fact. Then, by rule-generating abduction, the following single recursive rule on lists
is generated and the hypothesis p(si,---,s„) is proposed. Here,
The above inference schema is depicted by the following syllogism:
p{h, ■■■ ,tn)

Pis'l,-P{si,-

In definite programs, if the class of programs is not restricted to some subclass, then there are also infinitely many meaningless hypotheses.
In this section, we investigate the number of rules generated by rule-generating abduction. Unfortunately, the following Theorem 5 claims that,
even if the generated rule is single recursive on lists then the number of hypotheses increases in exponential order with respect to the length of an atom.
Theorem 5 Letp{ti, ■ ■ ■ ,tn) be a surprising fact. Then, the number of single recursive rules on lists which satisfies the above syllogism is at most 3".
Proof See Appendix. □
By using safe generalizations from Section 3, we design an algorithm to generate single recursive rules on lists as follows: Suppose that a ground atom a is given. First, by generalizing a with a list substitution di, we obtain an atom ß such that ßOi = a and ß is safe on a. We call such a generalization ß a list generalization of a. Secondly, by generalizing ß with a constant substitution 0ci we obtain an atom 7 such that = ß and 7 is safe on ß. We call such a generalization 7 a constant generalization of ß. Note that 7 is also assumed to be a partially isomorphic generalization of ß. For this algorithm, the number of rules is at most the number of generalizations. Then we investigate the number of generalizations, in particular, the number of maximal generalizations.
Let a be a ground atom p{ti,- ■ •, For all common lists in a, we can classify them by the suhlist relation. For example, let a be the following ground atom:
p{[a,b,c,d\, [c, d], [ò, c], [c], [Ò, c, d])
{= p{tl,t2,t3,t4,t5)),
and ti be the i-th argument's term of a. Then, t2,t4, and are common lists in a. By the sublist relation, we classify them into {^2,^5} and {Ž4}.
The number of maximal generahzations is characterized as follows.
Theorem 6 Let a be a ground atom p{ti, ■ ■ ■, tn) and I be the number of classes in ti by the sublist relation. Then, the number of the maximal generalizations is at most
(i^T
Even when 1 = 1, thè number of the maximal list generalizations of a -is at most [V^j ■
Proof See Appendix. □
By Theorem 6, the number of single recursive rules on lists, even if we adopt the maximal list generahzations, increases exponentially with respect to n. Hence, in the next section, we restrict the forms of generahzations, and design the algorithm for rule-generating abduction whose number of hypotheses is at most n.
5 Algorithm PROPOSE
In this section, we design an efBcient algorithm for rule-generating abduction, which is called PROPOSE. In the algorithm PROPOSE illustrated in Figure 1, we restrict the reversal for list generalizations to the form of {f := X}, where t is both a common list in a and some argument's term of a. Obviously, the list generahzation a{X := t} is' safe on a. Then, the following lemma holds.
Lemma 2 The number of single recursive rules on lists generated by the algorithm PROPOSE is at most n.
Proof The number of terms that are both a common term and some argument's term is at most n. Then, the number of elements of L is at most n. Hence, the number of hypotheses is also at most n. □
An important feature of the algorithm PROPOSE is that, if the i-th argument's term is some common list in a, then the i-th argument's term of the head of the generated rule is a variable; otherwise, it is a list. Furthermore, by the algorithm PROPOSE, the rules in Example 1 are constructed from one ground atom.
In Figure 1, rs-abd{fact, head body, hyp), which is rule-selecting abduction, is a procedure to propose a hypothesis hyp such that {head)a = fact and {body)a = hyp for some substitution a.
For the algorithm PROPOSE, the following two theorems hold.
Theorem 7 Let a be a ground atom p{ti, ■ • ■ ,tn) and k	-[-••• + |in|. Then, the algorithm
PROPOSE computes the rules and hypotheses in 0{P) time.
Algorithm PROPOSE input a = p{ti,- • • ,tn) : a ground atom output head <— body : a rule 6 : a hypothesis
L := {ß \ ß = a{ti := Vi},U: common list in a}
U{q;}; /* ß : safe on a */ while L ^ (f) do select /? € L;
7 := the greatest constant generalization
p(si,---,s„) of /?; for ž = 1 to n if Sj = [ ] then
output 7 true output true halt;
else if Si is a variable then head-argi := Xi; /* Xi is a new variable */ else /* Si ^[Wi, ■■■] */ head-argi :=
/* Xi is a new variable */ end end
head := p{head-argi, • ■ •, head-arg^)] /* head-arQi = [WÌ\Xi] or */ body—	• •
output head body rs-abd{a,head body,6)] output s L:=L-{ß}-, end
Figure 1: Algorithm PROPOSE
Proof For any ti, it can be determined whether or not ti is a sublist of tj in 0(|ti|). Then, for any i, it can be determined whether or not ti is a sublist of any tj(j i) in 0{{n — l)|ti|). Hence, the set L in the algorithm PROPOSE can be constructed in 0((n — l)k).
For the selected ß in L, the greatest constant generahzation of ß is also a partially isomorphic generahzation of ß. By Theorem 2, a partially isomorphic generahzation 7 of /3 can be computed in 0{k'^). Since the procedures in the for-loop can be computed in 0(n), the for-loop terminates in O(n^). Then, the procedures in while-loop can be computed in	-h v?). Since the number
of elements in L is at most n by Lemma 2, the while-loop terminates in	+ Hence, the
algorithm PROPOSE terminates in 0((n - l)k + k'^n + n^).
Since n < k, the algorithm PROPOSE computes rules and hypotheses in 0{k^) time. □
Theorem 8 Let a be a ground atom p{ti, ■■■ ,tn) and 6 be the proposed hypothesis p{si, ■ ■ ■ , Sn) by PROPOSE. If there exists a selected common list I in a by the algorithm PROPOSE and I appears in ti, then I also appears in Si.
Proof Let I be the selected common list in a by PROPOSE. If the i-ih. argument's term ti of a is I itself, then the i-th argument's terms of both the head and the body of the generated rules are variables Xi. Then, the i-th argument's term Sj of 6 is also I.
Suppose that I appears in another argument's term of a, and ti has the form of [a^, a^, • • •, By the algorithm PROPOSE, the z-th argument's term of the head of the generated rule is a list \Yl\Xi], where Y^ is a variable corresponding to a\, while one of the body is a variable Xj. Then, for the hypothesis 6, the i-th argument's term Si of 6 has the form of [a\, • ■ •, a^.
Hence, the selected common list I also appears in the 2-th argument's term Si of 6. □
Theorem 8 claims that, if a given ground atom satisfies the relation on common lists, then the proposed hypothesis by the algorithm PROPOSE also satisfies it.
6 Examples
In this section, we discuss the several examples for the algorithm PROPOSE.
Example 5 Let a be a ground atom p{[a,b],[c,d],[a,b,c,d]). The list [c,d] is both a common list in a and the second argument's term of a. By the construction of L, ß = p([a, Ò], V2, [a, ÒIV2]) is a safe list generalization of a and 7 = p{[X,Y],V2,\X,Y\V2\) is the greatest constant generalization of ß. The first argument of 'j is a list which begins with X. The second argument is a variable V2, and the third argument is also a list which begins with X. By the for-loop in PROPOSE, we obtain a head p{[X\X,],X2,[X\Xs]) and a body piXi,X2,X3). Hence, PROPOSE generates a rule
p([X|Xi],X2, [XIX3]) ^ X2, X3),
and proposes a hypothesis p{\b],[c,d],[b,c,d]). The predicate p means the append in Example 1.
Since L includes a, then ß = a, and 7 = p{\X,Y],\Z,W],[X,Y,Z,W]). The first argument of 7 is a list which begins with X, the second argument is a list which begins with Z, and the third argument is also a list which begins with X. By the for-loop in PROPOSE, we obtain a head p{[X\Xi],[Z\X2],[X\Xz\) and a body p{Xi,X2,Xz). Hence, PROPOSE generates a rule
and proposes a hypothesis p{[b], [d], [ò, c,d]). The predicate p means that all arguments' terms are lists and the first argument's term is the prefix of the third argument's one.
Furthermore, each of the rules and hypotheses by PROPOSE respectively satisfies the following syllogisms:
p{[a,b], c,d], [a, 6, c,d )
p{[X\Xy ,X2,[X|X3])
p{[bUc,dl[h,c,d])
p([a,&],[c,d],[a,ò, c,d])_
[^1X2], [XIX3]) ^ p(Xx, X2, X3) pi[b],[d],[b,c,d})
Example 6 Let a be a ground atom p{a,[a,b]). Since there exist no common lists in a, then ß =
p(a, [a,ò]), and 7 = p{X, [X, Y]). The first argument of 7 is a variable X, and the second argument is a list which begins with X. By the for-loop in PROPOSE, we obtain a head p{Xi, [X\X2]) and a body p(Xi,X2)- Hence, PROPOSE generates a rule
p{Xu[X\X2])^p{Xi,X2),
and proposes a hypothesis p{a, [ò]). Note that the predicate p means the member in Example 1.
Let a be a ground atom p([a], [a, 6]). Since there exist no common lists in a, ß = p([a], [a,^]) and 7 = p([X], [X, y]). The first argument of 7 is a list which begins with X, and the second argument is also a list which begins with X. By the for-loop in PROPOSE, we obtain a head p{[X\Xi],[X\X2]) and a body p{X 1^X2). Hence, PROPOSE generates a rule
p{[X\X,},[X\X2])^p{Xi,X2),
and proposes a hypothesis p{{ ], [6]). Note that the predicate p means the prefix in Example 1.
Let a be a ground atom [a, &]). The list [6] is both a common list in a and the first argument's term of a. Then, ß = p{Vi, [a|Vi]) and 7 = p(Vi, [X|Fi]). The first argument ofj is a variable Vi, and the second argument is a list which begins with X. By the for-loop in PROPOSE, we obtain a headp{Xi, [XIX2]) and a body p(Xi, X2). Hence, PROPOSE generates a rule
and proposes a hypothesis [6]). Note that the predicate p means the suffix in Example 1. On the other hand, since L includes a = p{[b], [a,b]), then ß = a, and J = p([X], [F, X]). The first argument of 7 is a list which begins with X, and the second argument is a list which begins with Y. By the for-loop in PROPOSE, we obtain a head p([X|Xi],[y|X2]) and a body p{Xi,X2). Hence, PROPOSE generates a rule
p{[X\Xr],[Y\X2])^p{X^,X2),
and proposes a hypothesis p{[ ], [ò]). Note that the predicate p means the defining lists, that is, all arguments' terms are lists.
If a = p{[a, 6], [c, d], [a, b, c, d]), then PROPOSE generates a rule
PÌ[X\Xi],X2,[X\X^]) ^ piXi,X2,X:i),
and proposes a hypothesis p([ò], [c, d], [6, c,d]). If a = p{[a, è], [a, b, c, d], [c, d]), then PROPOSE also generates a rule
p{[X\Xi], [X\X2],Xs) ^ p{XuX2,Xs),
and proposes a hypothesis p{[b],[b,c,d\,[c,d\). Hence, the algorithm PROPOSE is independent of the order of arguments.
Furthermore, by the construction of member and suffix in Example 6, the algorithm PROPOSE is also independent of the types, such as lists and constant symbols, of argument. In other words, PROPOSE needs no types of arguments.
We can realize the algorithm PROPOSE in a Prolog program. The predicate propose returns a generated rule as the third argument. It also returns a hypothesis proposed by the generated rule as the second argument.
For the following five surprising facts
p{{a,b],[c,d],[a,b,c,d]), p(a,[a,6]), p([a], [a, Ò]), [a, i»]), P([a, b, c], [b, c], [a, b, c|),
the results of propose are as follows:
: ?- propose(p([a.b],[c,d],[a,b,c,d]),X,Y). X = p([b],[d],[b,c.d]),
Y	= p([_2|_5],[_l|_4],[_2L3]):-p(_5,_4,_3); X = p([b],[c,d],Cb,c,d]),
Y	= p(C_lL4],_2,[_lL3]):-p(_4,_2,_3); no
: ?- propose(p(a.[a,b]),X,Y). X = p(a,[b]).
Y	= p(_2,[_l|_3]):-p(_2,_3); no
: ?- propose(p([a],[a,b3),X,Y). X = p([],[b]),
Y	= p(Ll|_3],HL2]):-p(_3,_2); no
: ?- propose(p([b],[a,b]),X,Y). X = p([],[b]),
Y	= p([_2|_4],[_l|_3]):-p(_4._3); X = p([b],[b]),
Y	= p(_2,[_l|_3]):-p(_2,_3); no
: ?- propose(p([a,b,c],[b,c],[a,b,c]),X,Y) . X = p([b,c] ,[c].[b,c]),
Y	= p([_2|_5],[_l|_4],L2L3]):-p(_5,_4,_3); X = p(Cb,c],Cb,c],Cb,c]),
Y	= p(C_l|_4],_2,[_l|_3]):-p(_4,_2,_3); X = p([a,b,c],[c].[a,b,c]),
Y	= p(_3,[_lL4],_2):-p(_3,_4._2); no
In the last example, two atoms	[«1^2])
and p(Vi, [Ò, c], Vi) are the safe list generalizations of p([a, 6, c], [b, c], [a, b, c]) obtained by the algorithm PROPOSE. Hence, for a ground atom p{[a,b,c],[b,c],[a,b,c]), there are three hypotheses and rules. The first rule obtained by PROPOSE means that all arguments' terms are lists, and the first and the third arguments' terms begin with the same constant symbols. The second rule means that the second argument's term is the sublist of the first and the third arguments' terms. The third rule means that the first argument's term is equal to the third argument.
On the other hand, for a ground atom p{[a, b, c], [d, e], [a, b, c]), the predicate propose returns the following two rules and hypotheses:
: ?- propose(p([a,b,c],[d,e],[a,b,c]),X,Y). X = p([b.c].[e],[b,c]),
Y	= p([_2|_5].[_lt_4],[_2L3]):-p(_5,_4,_3); X = p([a,b,c],[e],[a,b,c]),
Y	= p(_3,[_l|_4]._2):-p(_3,_4,_2); no
From this surprising fact, the rule whose second argument's term is the sublist of the first and the third arguments' terms is not generated by PROPOSE.
7 Conclusion
In this paper, we have introduced single recursive programs on lists for rule-generating abduction. We have also discussed a safe generalization, which is a generalization of one atom whose common terms are replaced by common variables. We have given some properties of safe generalizations. In rule-generating abduction for single recursive programs on lists, we have shown that the number of hypotheses increases exponentially with respect to the length of a surprising fact. On the other hand, by using safe generalizations, we have designed an algorithm called PROPOSE to construct single recursive rules on lists. The number of hypotheses by PROPOSE is at most the length of a surprising fact. We have shown that the algorithm PROPOSE generates rules and proposes hypotheses in polynomial time with respect to the length of a surprising fact. Also we have shown that the selected common fist in some argument
of a surprising fact appears in the same argument of the hypothesis proposed by PROPOSE.
We have left several problems as future work.
Abduction is an inference to propose hypotheses to be used before deduction and induction. We have left as a future work to combine abduction and inductive logic programming. Ling (1989, Ling & Ungar 1989) has introduced the constructive inductive logic programming. There may exist a relationship between this work and this paper.
The results in this paper depend on the simplicity of the hypothesis space for rule-generating abduction. Hence, we need to extend the result of this paper to the wider class, for example, the programs consisting of multiple recursions, including local variables, and dealing with terms except lists. Furthermore, as to the inductive logic programming, we have left the problem of predicate invention. The number of rules and hypotheses becomes exponentially large. Hence, we need to introduce some heuristics such as on a distinction between a necessary and useful intermediate term, the number of local variables, invented predicate symbols and rules, and so on.
All researches on abduction in computer science are based on computational logic, and the abduction beyond the scope of computational logic such as abduction for formal language or numerical data has not yet been studied. On the other hand, abduction begins with a surprising fact. In machine learning, a surprising fact is considered as a good example. Hence, abduction is regarded as learning from good examples. We need to study the abduction for various frameworks together with machine learning. This should be one of the most important future tasks.
Acknowledgment
The author would like to thank Prof. Setsuo Ari-kawa and Dr. Eiju Hirowatari for many insightful comments and precious discussions. He also thanks anonymous An'94 and Informatica reviewers for valuable comments.
References
[1] Arimura, H., Shinohara, T. Sz Otsuki, S.
(1991)	A Polynomial Time Algorithm for Finite Unions of Tree Pattern Languages. Proceedings of ALT'91, pl05-114.
[2]	Dung, P. M. (1991) Negation as Hypothesis: An Abductive Foundation for Logic Programming. Proceedings of ICLP'91, p 3-17.
[3]	Duval, B. (1991) Abduction for Explanation-Based Learning. Proceedings of EWSL'91 LNAI 482, p 348-360.
[4]	Eshghi, K. & Kowalski, R. A. (1989) Abduction Compared with Negation by Failure. Proceedings of ICLP'89, p 234-254.
[5]	Genest, J., Matwin, S. k. Plante, B. (1990) Explanation-Based Learning with Incomplete Theories: A Three-Step Approach. Proceedings of ML'90, p 286-294.
[6]	Hirata, K. (1993) A Classification of Abduction: Abduction for Logic Programming. Machine Intelligence 14 (to appear).
[7]	Hirowatari, E. & Arikawa, S. (1994) Partially Isomorphic Generalization and Analogical Reasoning. Proceedings of ECML '94, LNAI 784, p 363-366.
[8]	Inoue, K. (1992) Principles of Abduction. Journal of Japanese Society for Artificial Intelligence 7, p 48-59 (in Japanese).
[9]	Kakas, A. C. & Mancarella, P. (1990) Gene-rahzed Stable Models: A Semantics for Abduction. Proceedings of ECAI'90, p 385-391.
[10]	Kakas, A. C., Kowalski, R. A. & Toni, F.
(1992)	Abductive logic programming. Journal of Logic and Computation 2, p 719-770.
[11]	Kunifuji, S. (1987) Hypothesis-Based Reasoning. Journal of Japanese Society for Artificial Intelligence 2, p 22-87 (in Japanese).
[12]	Ling, X. (1989) Learning and Invention of Horn Clause Theories - A Constructive Method. Methodologies for Intelligent Systems 4, p 323-331.
[13]	Ling, X. & Ungar, L. (1989) Inventing Theoretical Terms in Inductive Learning of Functions - Search and Constructive Methods. Methodologies for Intelligent Systems 4 p 332-341.
[14]	Muggleton, S. (ed.) (1992) Inductive Logic Programming. Academic Press.
[15]	Peirce, C. S. (1965) Collected papers of Charles Sanders Peirce (1839-1914). Hart-shone, C. S., Weiss, P.(eds.), The Belknap Press.
[16]	Plotkin, G. D. (1970) A Note on Inductive Generalization. Machine Intelligence 5, p 153163. .
[17]	Plotkin, G. D. (1971) A Further Note on Inductive Generalization. Machine Intelligence 6, p 101-124.
[18]	Poole, D. (1988) A Logical Framework for Default Reasoning. Artificial Intelligence 36, p
■ 27-47.
[19]	Shapiro, E. Y. (1981) Inductive Inference of Theories from Facts. Research Report 192, Yale University.
[20]	Sterling, L. k Shapiro, E. (1986) The Art of Prolog. The MIT Press.
[21]	Thagard, P. (1988) Computational philosophy of science. The MIT Press.
[22]	Yonemori, Y. (1982) Peirce's Semiotics. Ke-isou Syobou (in Japanese).
Appendix
Proof of Theorem 5
Suppose that the generated rule is of the form p{ui, - ■ ■ ,Un) p{vi, - ■ ■ ,Vn) and the generated rule is single recursive on lists. The number of combinations such that Ui has the form of [Wi|Xi] for 7 arguments in n arguments is at most nCj. In such j arguments, the number of combinations such that at least one Vi has the form of Xi is at most
j i=2
where 1C2 is assumed to be 1.
Then, the total number of hypotheses is at most
i=i
j
\i=2
/
Hence, we obtain the following formulas: / j \
j=l \i=2 / j=l
n
< 3".
Here, we have used the following formula:
j2nCiXx' = {l + Xr. i=0
Proof of Theorem 6
Let a be a ground atom p{ti, ■ ■ ■ ,tn) and Kn be the number of the maximal list generalizations of a. If Z = 1, then we can find the upper bound of Kn in the following way.
For simplicity, suppose that the common lists in a are ii, • • ' > ^n-i, and \ti\ > \t2\ > ■ ■ ■ > We denote the generalization a{tj^ :— ^hr ■ ■,tjf ■■= Xjj} by	Note that
is not a common list in ß(j^). Furthermore, for
ß{ji,ji-i-a,ji-i-a+b)
(a,b = 2 or 3), there exist substitutions dj^+a+b, öj.+a, and 6j. such that
ß{jiJi+o.) = ß{ji,ji+a,ji-\-a-\-b)ßji+a+b, ß{ji,ji+a+b) - ß(ji,ji-^a,ji+a+b)&ji+a, ßUi+a.,ji+a-^b) = ß(ji,ji+a,ji-^a-\-b)Sji-
Hence, /?(j, j-,+a), ß(ji,jHa+b), and	are
not maximal list generalizations.
By using the indices of ß, Kn is equal to the number of the sequences (ji, • • •, jy) which satisfy the following conditions:
1.	31 = 1 or 2,
2.	jf = n — 1 or n, and
3.	the adjacent number of ji is either ji -I- 2 or ji + 3.
For example, if n = 8, then the following seven sequences
(1,3,5,7), (1,3,6), (1,4,6), (1,4,7), (2,4,6), (2,4,7), (2,5,8)
satisfy the above conditions. For the sequence {ji,'''ijf) which satisfies the above conditions, the number of sequences such that ji = 1 is greater than ji = 2. Let An be the set of the sequences (ji, • • • ,jf) which satisfy the above conditions and ji = 1. Then, < 21A„|. Furthermore, for n > 6, we can construct the set A„ in the following way:
1- if {ji,---,jf) G An-2, then	€
2. if (il, e
then	1) 6
Hence, \An\ satisfies the following equations: |A3| = 1, |A4| = 1, |A5|=2,
|A„| = |A„_2| + |A„_3| (n>6).
By mathematical induction on n, we obtain the following formula:
^ < < (V2)" ' (n > 6).
Hence, the number Kn of the maximal list generalizations is characterized by the following formula:
Kn < 2	= (V2)" .
Note that this formula also holds for any n>l.
Let I be the number of classes by the sublist relation and Cj be such a class for I < j < I. For any Cj, the number of the sequences which satisfy
the above conditions is at most ( v2) . Then, the number Kn of the maximal generahzations is at most

\c,\
Hence, Kn is also characterized as the following formula:
/
Kn<
M
I
n I
Bottom-up Layout Generation
Walter Hower
Department of Computer Science, University College Cork, National University of Ireland, College Rd, Cork, Ireland E-mail: walterScs. ucc. ie
Keywords: layout design, constraint satisfaction Edited by: Rudi Murn
Received: June 20, 1995	Revised: August 29, 1995 Accepted: October 25, 1995
The constraint satisfaction problem (CSP) coniiprises a set ofn variables with associated fìnite domains, and some combinations of value assignments ("constraints") to the variables; then, in order to get the globally consistent solution, we need to compute the set of all n-tuples consistent with the given constraints. Here we consider a CSP modeling of a layout problem where we have to place rectangles ("objects") inside a target frame such that the objects do not overlap. However, we further constrain the problem to obey the following requirements: (a) Try to compute all layout possibilities. (Just the answer whether a layout exists at all or to compute only one possibility should not suffice.) (b) The user shall be able to interact with the (semi-) automatic layout system in a way such that s/he may pick an object to place it in a subarea, offered by the system, in an arbitrary manner without the need to think about the placement of the other objects. (After an algorithm's termination a globally consistent layout should be guaranteed.)
1 Preliminaries	CSP approach. [BD95] reports on a CSP solving
approach using a genetic algorithm component.) The constraint satisfaction problem (CSP) com- ^^ ^^^ j^^g^^	^^^	^^^
prises a set of n variables with associated finite	qualities of constraint solving techniques: local
domains, and some combinations of value assi-	propagation and global constraint sa-
gnments ("constraints") to the variables; then, m	Briefly speaking, the distinction is as
order to get the globally consistent solution, we	Local propagation just considers those
need to compute the set of all n-tuples consistent	^^^ p^^^lem which are closely related to
with the given constraints.	^^^^ ^^^ works only on a subset of the pro-
There are a lot of papers which exploit specific	gj^^al satisfaction however copes with the
CSP features, and others focus on local constraint	j^^j^t^er as a whole. Or from another point of view:
propagation. However, finally, the global solution	^ j^^al technique usually tries to rule out values
is wanted and in this regard, we would hke to	^j^ich are identified to be of no further use and
introduce here a general framework which really	thereby potentially reduces the domains of the va-
yields this globally consistent solution. Anyway,	riables;^ a global algorithm however keeps track
to start directly with a higher-level mechanism	^f the interrelation of the values and maintains
may not be a bad idea; see also [SF94] (p. 126)	complete combinations. The result of local propa-
in this regard. ([HJ94] illustrates a distributed	g^tj^n^ ^^e so-called "locally consistent solution",
—r----is an n-tuple of (domain) sets, whereas the result
Striving for all combinations is important m a lot of r 1 i 1 ■ r ■ 1	n 1 n i 1 ,1
real-world applications; for instance, modelling a set of g^O^al satisfaction, the SO-Called "globally con-
traffic lights requires all possible colour assignments; see	sistent Solution", is a Set of n-tuples enumerating
[How95]. Furthermore, in computer-aided layout design all
potential configurations should be offered to the user; this	_
feature is one of the reasons to employ a constraint-based	^also pointing to an assignment mode mentioned below
approach. (You may consult [Van95].)	^herewith we point to the label filtering mode
ali possible combinations.'' Although local propagation is a fast technique we would like to note the fact that local consistency does not imply gloljal consistency; cf. also [Kra92] (p. 62, footnote).^
Thus, the locally consistent solution just serves as a pre-processing state; the correct instantiations of the variables must still be tried. Besides, in the graphics domain, local propagation often works in an assignment mode to just inform other variables about some variable instan-tiation(s). Then, when k (< n) variables are given in a subnetwork, we need the instantiation of A; — 1 variables to determine the value of the remaining one.® (However, some systems even allow the propagation of algebraic expressions.) In the papers taken into consideration local propagation has been the method mostly preferred— probably due to the efficiency of the procedure. Unfortunately, it sometimes leaves to the reader to imagine how global consistency is achieved.''' (Backtracking-hke algorithms are discussed in [ZH96].)
In view of the necessity to compute the globally consistent solution we would like to discuss a general framework (working on constraints with arbitrary arities) incorporating the global constraint satisfaction view ([How95]^). The main point to employ such an exhaustive approach is the need of the user of a computer-aided design (CAD) layout system to choose among various or even all options one may have to place the objects. The procedure works in the domain of rectangles, and its employment even in the domain of triangles^ has just been completed. (In CAD and layout design it is enough to consider rectangles and triangles; we may refer to the triangulation in the area of computer graphics.)
Also [AB93] deals with rectangles. In [Bor81]^° the weakness of local propagation has already been recognized, and it refers to the need of a glo-
^Please note that "local" and "global" may both refer to the way how to relate constraints and to the solution quality of a CSP.
®Please consider [Dv95] regarding the way of the quality of a locally consistent solution to a globally consistent one. ®typically for equational constraints ^In some cases just simple assignment techniques seem to get employed.
®which I have already further improved in the meantime ®where at the moment two sides must still be in parallel to the co-ordinate axes
^"see also fSan94] w.r.t. further developments
bal analysis—see also [HHLM92] (p. 302). [FS93] states the fact that binary constraint networks are often not expressive enough and prefers user interaction when some modifications come up (pp. 1454/1456 there). ([Ros93] points to "un-do" operations in CAD.) [Szw94] has presented the use of constraint processing in the area of graphics. [HF94] deals with continuous domains. [Fal94] nicely motivated a constraint-based approach to CAD; see also [HHR94].
There are already special purpose routines for the examples introduced herein. The intent of this paper however is to briefly illustrate how a general global constraint solver is able to function well in CAD. ([HRB93]^i may be regarded as an initial precursor.) One of its main features is its ability to interact with the user in a way such that after the termination of the algorithm (employed for layout design) the user may arbitrarily place any object inside a subarea offered by the system while it is guaranteed that an arbitrary placement of a different object in another subarea (again offered by the system) in a following step is always possible; this guarantee is ensured by the global consistency of the solver.
In order to point to related work in a really broad sense we have compiled about 200 papers and collected this part in a separate work: [HG95]. Therefore, only an unusually brief list of appreciated articles is included here. (This fact hopefully does not matter because we do not compare the various appproaches; we would just like to illustrate a global constraint satisfaction view.)
2 Algorithmic framework
The procedure employs a global constraint satisfaction solver where all the various layout possi-bihties are generated by a bottom-up approach. The CSP algorithm in its use here just employs an upward phase starting from the variables' domains (the 1-ary constraints) and proceeds until the n-ary constraint (the globally consistent solution) is finally computed.
As already introduced n is the number of the variables of the CSP. Let the index set I associated with a constraint Cj denote the variables involved in C/; the cardinality of this index set is the arity of the constraint. The maximum arity
11
see also [How94]
of the constraints is the highest index level: h := maXfy^{\I\ I I C {!,...,n}}. (The "natural equi-join" operation ix (see also [Bib88] and [JCG95]) may get understood as in the database field.) We use the following abbreviations: C Si := entire set of constraints of arity i whose index sets are proper subsets of the index set of a specific constraint of arity i+ 1. (See the formal description of the algorithm. |C5j| = i + 1.)
ixi,
vcecs;
C := IX C/2 XI ... 1X1 C/,
CSil"
We do not have constraints with an arity larger than h] so, when it is possible to establish consistency among h variables arbitrarily chosen ("/i-consistency") then we may try to compute the globally consistent solution ("n-consistency"). The algorithm already realizes this idea.
The following lattice structure symbolizes a CSP with four variables:
{3,4}
1 2
3
4
5
6 7
{1}	^{3} ^{4}
algorithm globaLupwardskip
for i 'S— 1 to /i — 1 for each Cj :\J\ = i + l CSi ^ {Ci I \I\ = i,ICJ} Cj^Cjn i^^cecs, C) if for each Ch {\H\ = h) \Ch\ + 0 then skip else INCONSISTENCY
Algorithm 1: Global upward skip
1	procedure skip
2	C'{l,...,n}

3	if	^ {}
4	then CONSISTENCY else INCONSISTENCY
In order to reformulate a problem to get a discrete search space we group corresponding values to form an interval; the reasoning then takes place along the boundaries of the value intervals. (See also [EBH93] where interval domains repre-. sent ranges of values.)
3 Illustration
Let the topological^^ CSP solver TopCsp work on the following short example.^^ The problem is similar to the one once found in the newsgroups comp.ai and comp.graphics.algorithms: "configuration/encapsulation problem": given a larger rectangle and several smaller ones we would Hke to know all the possibilities of the smaller rectangles to fit into the larger one. To perform this task we introduce a two-dimensional variable for each rectangle indicating the range of the lower left corner point (P') and the range of the upper right corner point (P'"); each point itself has two dimensions (along both the x- and y-axis).
For instance, a given larger rectangle has the dimensions 9 x 11. This results in the following co-ordinates in a fixed co-ordinate frame:
P': (0,0), P' : (9,11).
Let further three other (smaller) rectangles I'll 1^2, Ts be given with the following information on width (w) and height (/i):
:= 5, hi := 2; W2 := 3, /i2 1; w^ ■= 4, /13 := 10.
Now we denote the various possibilities (initially given by the dimension of the frame rectangle along with the width and height information) for these corner points of the smaller rectangles by an interval notation which yields the following ranges:
Pi: ([0..4],[0..9]),
Pi:
n--
P[:
([5..9],[2..11]), ([3..9],[1..11]), ([0..5],[0..1]), PJ: ([4..9],[10..11]),
([0..6],[0..10]), PJ:
which corresponds to the computation of the 1-. ary constraints.^^
The procedure computes the 2-ary constraints by checking the different choices of the placement of two rectangles: whether it is possible to place a rectangle left (or right) of another one or below (or above) it. Therefore, we have the following possibihties connected among each other via "or" w.r.t. the rectangles ri and r^:
^^Winfried Graf has proposed this term to me; see also [Gra95].
^®Please note that such a simple example allows to focus the discussion on the main points; a more "real-world" example would veil the focal point and would be much harder to introduce.
'^In fact, we only maintain the lower left corner point; the upper right one is just a convenience to us.
60 Informatica 20 (1996) 57-63
W. Hower
Xpr < Xpl^
ypi < ypi yp; < ypi
(ri is left of r2) (r2 is left of ri) (ri is below r2) (r2 is below ri).
In the following we take the lower left corner point as the reference point.
To re-compute the boundaries of the intervals we introduce the following notation: mm^^ denotes the smallest possible x-value for the reference point of object i (the left bound of the x-interval), and max^^ denotes its largest possible x-value (the right bound of the a;-interval). (In the example above, miux^ '■= 0, maXxi '■— 4; minx2 0, maxx2 ■= 6.) Obviously, miny. denotes the smallest possible y-value for the reference point of object i (the left bound of the y-interval), and max^. denotes its largest possible ?/-value (the right bound of the y-interval). (In the example above, miuy^ := 0, maxy^ := 9; min.
'y2
:= 0, maxy^ := 10.)
The following two computation rules are needed in the first case ("ri is left of r2"):

if miUxi -f twi > min. minx2 := min^-^ -f wi\
if maxxi > maxx.2 — wi maxxi ■= maXx2 — w^i-
then
then
Now it becomes visible that in fact just the lower left corner points are maintained.
To deal with the second case ("r2 is left of ri") you just interchange the numbers "1" and "2" in both rules.
The following two computation rules are needed in the third case ("ri is below r2"):
if miny^ -[-hi > min. mm„2 := miny.^ + hi\
y2
if max.
yi
> maXy2 — hi
then then
maxy.^ :— maXy2 - hi..
Pi: ([0..1M0..9]), F{: ([5..6], [2..11]); Pj: ([5..6],[0..10]), F^: ([8..9], [1..11]). Checking yp^ < Vpi yields:
Pi-. ([0..4],[1..9]), Pf: ([5..9],[3..11]);
([3..9],[1..9]).
Pj: ([0..6],[0..8]),
PJ:
Concerning the rectangles ri and 7"3 the boundaries are computed as follows when testing xpr < xpr.
Pi-. ([0..0],[0..9]), P[: ([5..5],[2..11]); Pi-. ([5..5],[0..1]), PJ: ([9..9],[10..11]).
Relating r2 and when checking ypr < ypi yields the following:
Pi-. ([0..6],[0..0]), PJ: ([3..9],[1..1]); ([0..5],[1..1]), PJ: ([4..9],[11..11]).
pi.
The final step is to check consistency among all three rectangles. Thereby, we "intersect" the intervals and test consistency. In our example 44 "abstract" solutions are computed (and presented in an interval-like notation) where each abstract solution represents several single solutions.^^ For instance, the following solution space is presented corresponding to one of the 44 abstract solutions:
Pb
([0..0],[1..9]), Pf:
Pi-. ([0..6],[0..0]), PJ:
H--
(Here, x has been replaced by y, and w has been replaced by h, compared to the rules for the first case.)
To deal with the fourth case ("r2 is below ri") you again interchange the numbers "1" and "2".
As an example we show the re-computation of the interval boundaries concerning the topological relations mentioned first and last. Testing xpr < Xpi results in the following:
([5..5],[3..11]); ([3..9],[1..1]); ([5..5],[1..1]), PI-. ([9..9],[ll..ll])i and Xpr < Xpi (ri is left of ra) and ypr ^ yp^ {f'2 is below ra) and ypr < ypi (r2 is beloiu ri).
The complete computation of all solutions of the example discussed here has been performed in a few milli-seconds.
Working just on the boundaries of the intervals enables a reasonable processing. This fact that the domain sizes do no longer be a crucial parameter, due to a kind of qualitative reasoning ([Her94]) on an interval-hke representation, is one of the advantages of this approach; hence, it actually does not matter whether we consider a discrete or a continuous domain.
An initial implementation also positions pieces of furniture in a room. ([Mac92] mentions to employ CSP techniques in the domain of furniture
^®Those solutions which are obtained by interchanging height and width of the "working" rectangles are included.
layout, too.) Even a deployment for VLSI design is conceivable—we have already prepared the algorithm to consider some space ("channels") between objects (for the wiring).
Furthermore, an early detection of global inconsistency is possible; if in the example a proper constraint is already present forcing, for instance, ri to be below rs the lower bound of the y-interval of the lower left corner point of rz would have to get incremented to the value 2 producing an empty interval of values for y: Pj : ([0..5], [2..1]). We may notice that this is an important efficiency feature. (It also contributes to the completeness of the approach—in contrast to the lack of such a feature in general evolutionary approaches.)
4 Final remarks
The actual implementation is already based on a more flexible version of the algorithm.
In the future, intelligent heuristics should be considered to select an appropriate order of the constraints during the computation process and to exploit problem-dependent specialties.
Just recently we have experimented with some evolutionary computing ideas^® (thereby, however, abandoning the exhaustive completeness feature) to gain in efficiency in the case when only a few placements sufBce; you may consult [HKR95].
Acknowledgement
Many thanks to Derrick Köstner and Manfred Rosendahl for their kind assistance as well as to the referees for their helpful comments.
References
[AB93] Abderrahmane Aggoun and Nicolas Bel-diceanu. Extending CHIP in order to solve complex scheduling and placement problems. Mathematical and Computational Modelling, 17(7):57-73, 1993. Pergamon Press Ltd.
[BD95] James Bowen and Gerry Dozier. Solving Constraint Satisfaction Problems Using A Genetic/Systematic Search Hybrid That
®see [KTN94], for instance
Reahzes When to Quit. In The 6th International Conference on Genetic Algorithms {ICGA-95), University of Pittsburgh, Pennsylvania, USA, July 15-19, 1995.
[BH93] Roland BerUng and Walter Hower. Solving	Geometrical	Constraint
Systems. AI Communications, 6(3/4):232-234, Sept./Dec. 1993. lOS Press, Amsterdam, The Netherlands. Book review of [Kra92].
[Bib88] Wolfgang Bibel. Constraint Satisfaction from a Deductive Viewpoint. Artificial Intelligence, 35(3):401-413, 1988. Elsevier Science Publishers B.V., Amsterdam, The Netherlands.
[Bor81] Alan Borning. The Programming Language Aspects of ThingLab, a Constraint-Oriented Simulation Laboratory. ACM Transactions on Programming Languages and Systems, 3(4):353-387, October 1981.
[Bor94] Alan Borning, editor. Principles and Practice of Constraint Programming, volume 874 of Lecture Notes in Computer Science. Proceedings, Springer-Verlag, Berlin/Heidelberg, May 2-4, 1994. Second International Workshop, PPCP '94, Rosario, Orcas Island, WA, U.S.A.
[Dv95] Rina Dechter and Peter van Beek. Local and Global Relational Consistency — Summary of Recent Results. In [MR95], pages 240-257, 1995.
[EBH93] D.G. Elliman, E.K. Burke, and M.I. He: ard. A Constraint Based Approach to Engineering Design. AISB Quarterly, 83/84: 3538, Spring/Summer 1993. Newsletter of the Society for the Study of Artificial Intelligence and Simulation of Behaviour, Brighton, UK.
[Fal94] Boi Faltings. Constraint Satisfaction Problems in CAD Systems, August 14, 1994. Invited talk. International Workshop on Constraint Processing in Computer-Aided Design (CoPiCAD-94), Swiss Federal Institute of Technology, Lausanne, Switzerland.
[FS93] Boi Faltings and Kun Sun. Computer-aided Creative Mechanism Design. In Ru-zena Bajcsy, editor, IJCAI-93, Proceedings
of the Thirteenth International Joint Conference on Artificial Intelligence, pages 14511457, Chambéry, Savoie, France, August 28 - September 3, 1993. IJCAII. Volume 2; distributed by Morgan Kaufmann Publishers, San Mateo, California, USA.
[Gra95] W. H. Graf. Constraintbasiertes automatisches Layout multimedialer Präsentationen: Semantikorientierte Plazierung, Typographie, Visualisierung und Design. PhD thesis, Fachbereich Informatik, Technische Fakultät, Universität des Saarlandes, Saarbrücken, 1995. Dissertation.
[Her94] Daniel Hernandez. Qualitative Representation of Spatial Knowledge, volume 804 of Lecture Notes in Artificial Intelligence, Sub-series of Lecture Notes in Computer Science. Springer-Verlag, Berlin/Heidelberg, 1994.
[HF94] Djamila Haroud and Boi Faltings. Global Consistency for Continuous Constraints. In [Bor94], pages 40-50, 1994.
[HG95] Walter Hower and Winfried H. Graf. Research in Constraint-Based Layout, Visualization, CAD, and Related Topics: A Bibliographical Survey. In International Workshop on Constraints for Graphics and Visualization {CGV '95), Cassis, Prance, September 18, 1995. DFKI Research Report RR-95-12, Deutsches Forschungszentrum für Künstliche Intelligenz GmbH, Saarbrücken.
[HHLM92] Richard Helm, Tien Huynh, Catherine Lassez, and Kim Marriott. A Linear Constraint Technology for Interactive Graphic Systems. In Graphics Interface '92, pages 301-309, Canada, 1992. Proceedings, Morgan Kaufmann Publishers, USA.
[HHR94] Walter Hower, Djamila Haroud, and Zsófia Ruttkay, editors. Constraint Processing in Computer-Aided Design {CoPiCAD-94), August 14, 1994. Workshop notes. Third International Conference on Artificial Intelligence in Design (AID'94), Swiss Federal Institute of Technology, Lausanne, Switzerland, 15-18 August 1994.
[HJ94] Walter Hower and Stephan Jacobi. A distributed realization for constraint satisfaction. In H. Kitano et al., editors. Parallel
Processing for Artificial Intelligence 2, volume 15 of Machine Intelligence and Pattern Recognition series, pages 107-116. Elsevier Science Publishers B.V., Amsterdam, The Netherlands, 1994.
[HKR95] Walter Hower, Derrick Köstner, and Manfred Rosendahl. Computer-aided layout by evolutionary computing. In Remco C. Veltkamp and Edwin H. Blake, editors, Proceedings of the fifth Eurographics workshop on Programming Paradigms in Graphics, EUROGRAPHICS '95, pages 251-269, Maastricht, September 2-3, 1995. CWI, Amsterdam, The Netherlands.
[How94] Walter Hower. 5th International Conference on Human-Computer Interaction jointly with 9th Symposium on Human Interface (Japan), 8-13 August 1993, Orlando, Florida, USA. AISB Quarterly, 86:46-47, 1994. Winter 1993/94, Newsletter of the Society for the Study of Artificial Intelligence and Simulation of Behaviour, Brighton, UK; conference report.
[How95] Walter Hower. Constraint satisfaction — Algorithms and complexity analysis. Information Processing Letters, 55(3):171-178, 11 August 1995. Elsevier Science Pubhshers B.V.
[HRB93] Walter Hower, Manfred Rosendahl, and Roland Berling. Constraint processing in human-computer interaction with an emphasis on intelligent CAD. In Michael J. Smith and Gavriel Salvendy, editors. Human-Computer Interaction: Applications and Case Studies, volume 19A of Advances in Human Factors / Ergonomics, pages 243-248. Elsevier Science Publishers B.V., Amsterdam, The Netherlands, August 8-13, 1993. Proceedings of the Fifth International Conference on Human-Computer Interaction (HCl International '93, Orlando, Florida, USA), Volume 1.
[JCG95] Peter Jeavons, David Cohen, and Marc Gyssens. A Unifying Framework for Tractable Constraints. In [MR95], pages 276-291, 1995.
[Kra92] Glenn A. Kramer. Solving Geometric Constraint Systems: A Case Study in Kinematics. Artificial Intelligence series. The MIT Press, Cambridge, Massachusetts, USA / London, England, UK, 1992. Reviewed in [BH93].
[KTN94] Takashi Kido, Kazuo Takagi, and Ma-sakazu Nakanishi. Analysis and Comparisons of Genetic Algorithm, Simulated Annealing, TABU Search, and Evolutionary Combination Algorithm. Informatica, 18(4):399-410, December 1994. An International Journal of Computing and Informatics. Special Issue: Artificial Life.
[Mac92] Alan K. Mackworth. Constraint Satisfaction. In Stuart C. Shapiro, editor. Encyclopedia of Artificial Intelligence, volume 1, pages 285-293. John Wiley &; Sons, 1992. Second edition.
[MR95] Ugo Montanari and Francesca Rossi, editors. Principles and Practice of Constraint Programming — CP '95, volume 976 of Lecture Notes in Computer Science. Proceedings, Springer-Verlag, Berhn/Heidelberg, September 19-22, 1995. First International Conference, Cassis, France.
[Ros93] Manfred Rosendahl. UNDO in CAD Systems. In W. Cellary, K. Vidyasankar, and G. Vossen, editors, Versioning in Database Management Systems, page 12, DagstuhlSeminar-Report 55, IBFI GmbH, Internationales Begegnungs- und Forschungszentrum für Informatik, Schloß Dagstuhl, Wadern, February 1-5, 1993.
[San94] Michael Sannella. Analyzing and Debugging Hierarchies of Multi-Way Local Propagation Constraints. In [Bor94], pages 63-77, 1994.
[SF94] Daniel Sabin and Eugene C. Freuder. Contradicting Conventional Wisdom in Constraint Satisfaction. In A.G. Cohn, editor, ECAI 94, 11th European Conference on Artificial Intelligence, pages 125-129, Amsterdam, The Netherlands, August 8-12, 1994. Proceedings, John Wiley Sons, Ltd., Chichester, England.
[Szw94] Gerd Szwillus. Graphical Constraints. CHr94 Tutorial, Conference on Human Factors in Computing Systems, April 25, 1994. Boston, MA, U.S.A.
[Van95] Pascal Van. Hentenryck. Constraint Solving for Combinatorial Search Problems: A Tutorial. In [MR95], pages 564-587, 1995.
[ZH96] Martin Zahn and Walter Hower. Backtracking along with constraint processing and their time complexities. Journal of Experimental & Theoretical Artificial Intelligence, 8(1), 1996. Taylor & Francis, London, UK.
Informational Frames and Gestalts
Anton P. Železnikar.
An Active Member of the New York Academy of Sciences Volaričeva ulica 8
SI 61111 Ljubljana, Slovenia (anton.p.zeleznikar@ijs.si)
Keywords: informational frame—arrangement, concatenation, demarked, disharmonious, edge syntax, harmonious, operator composition, parenthesized, possibility, syntax; informational framing; informational gestalt—axiomatic, causal, circular, inference, informational machine, intelligent, me-taphysicalistic, parallel, phenomenalistic, reverse, serial, star; informational gestaltism; number of formulas in a gestalt
Edited by: Jifi Slechta Received: September 28, 1995
Revised: November 6, 1995 Accepted: November 20, 1995
This article deals with two characteristic and widely useful notions, called the informational frame (of representation) and the informational gestalt. A preliminary discussion of both notions was already presented in [17]. Informational framing is nothing else than a part of informational gestaltism by which various causal possibilities of formulas come into existence. Although a frame is everything which can be put in a frame within a well-formed informational formula, the concatenation of frames must preserve the so-called possibility of a frame to be a part of a well-formed formula. On the other hand, the gestalt structure represents a parallel array of informational formulas, that is, an informational system of causally different formulas proceeding from a given formula. To this, circular gestalting can induce the reversely circular properties, so different forms of gestalts become possible. The most complex and free gestalt called star gestalt is a consequence of an initial circular formula and its graph, where operator transitions from one to the next operand are possible in an arbitrary manner of repeated looping.
1 Introduction
Both the notion of informational frame and informational gestalt^ have been introduced in a superficial form, in connection with the informational Being-of [17] as a phenomenon of informational functionalism. In this article, both informational frame and informational gestalt will be tackled in a more fundamental manner using some particular means of informational formalism to make them informationally accessible and formally effective.
An informational frame is everything which can be framed within an informational formula, from
a single parenthesis ) , operand or operator
'This paper is a private author's work and no part of it may be used, reproduced or translated in any manner whatsoever without written permission except in the case of brief quotations embodied in critical articles.
\= to an arbitrarily complex part of the formula, In a similar way, the principle of fra-

say
ming can be applied to elements of a demarked formula [12], for example, [7] or to any complex part of the demarked formula, e.g. . [= a . A gestalt of any informational formula is everything formally, especially causally, hidden behind the structure of the formula which represents an arbitrarily complex informational entity. We could say that the gestaltistic nature of a serial formula comes in the foreground when this formula is roughly sketched by the corresponding informational graph (a graphical scheme of operand circles and operator arrows) in which no parenthesis pairs or demarcation points of a formula are considered. Therefore, the gestalt of an informational entity remains the hidden other, formally, the adequately transformed formula system concerning the original operands and operators in an unchanged
order within a formula, describing the entity in question. In general, an informational entity can appear (be understood, interpreted, grasped) as a gestalt of different possible entities (various formulas), strictly corresponding to the initial order of the operand-operator structure of the entity in question leaving parenthesizing or demarcation completely open.
As the reader will see, the informational frame (of representation [4]) can be any part of an informational formula (system) and, in this respect, the frame does not follow a structurally limited and traditionally organized syntactic formalism. The concept of the informational frame shows also how traditional syntactic schemes can be managed in a natural frame-appearing form leaving open the concatenation of frames in, certainly, a regular form. Any part of the formula means that the formula can be broken off. at any place and that a part of it does not necessarily represent an operand or operator structure.
On the other hand, an informational gestalt will represent the possibility of the variety or, the whole variety or the whole variety of an operand, formula or formula system in a structural sense, where different structures (possible formulas to a given formula) can be forecasted automatically to some determined extent.
2 Two Cases of Informational Enframing
The usefulness of framing was demonstrated for a verbal case concerning the predicate of existing of something in [18]. Framing of the informational entities can become an effective and transparent aid in transforming linguistic forms (especially, sentences) into informational ones (e.g., cogniti-vely relevant internal states, representations) and vice versa.
2.1 Enframing of a Sentence and Its Translation
Let us take an example of framing a sentence ([5] p. 161) in German, its formal transcription, and retranslation into English. The enframed original German sentence is
The dot at the end of the sentence will be replaced by the semicolon at the end of a formula, when the formula appears in a parallel informational system. In the last frame the main operator composition (verbally, ist existenzial gleichursprünglich mit) is split within the two frames marked consecutively by </>i and, to this, still verbally transposed. In this sense, the substitutional enframed German sentence, also suitable for formal translation, becomes
Let us introduce the following marks for the operands and operators in the given two sentences: r for 'Rede' (discourse 5), b for 'Befindlichkeit' (state-of-mind a) and t) for 'Verstehen' (understanding v) as operands, and |=ex for 'ist existenzial' ('is existential' or 'informs existentially', operatór |=exist), |=gi-ur for 'ist gleichursprünglich' ('is equiprimordial' or 'informs equiprimordially', operator |=eq-prim) and |=mit for 'ist mit' ('is with' or 'informs with', operator f=with) ^ operators. An interpretation of the first enframed form is
(thmit E'.t') h ex ^ I—gl-i
where
{tt=mit b.ti) =
Fmit o; t |=mit t)
when the German 'und' ('and') was interpreted by comma (a short form) and by semicolon (a long form) within a parallel system of two formulas. Both kinds of expression are equivalent (informational operator =).
The second form of the sentence enframing (frame-informational interpretation) delivers, formally,
t (|=exOt=gl-ur)o|=mit ^^
being a parallel system (as shown in the preceding case) in regard to Befindlichkeit b and Verstehen t).
The first formula is externalistically open via the operator composition [=ex o t=gi-ur while in the second formula this composition is brought into a formula interior operator composition with |=mit) that is, (|=exO|=gi.ur)o|=mit-
Let us take the second form of the enframing of the German sentence (the last formula) with the English-adequate operand and operator markers, that is,
P (Nexist O |=eq-prim)o |=with <7, V
Considering this formula, the English translation of the German sentence becomes (in an adequately enframed form)
Discourse
is existentially equiprimordial with
state-of-mind
4>i
and understanding
Another scheme of framing for the discussed sentence, depending upon an intuitive understanding, could take the enframing form
Discourse
is existentially equiprimordial with	
state-of-mind	and understanding
't'l
In the parallel approach, as formally expressed, the and disappears and so two parallel sentences are coming to the surface, that is.
This sentence appears in the English translation of [5], that is in [6] (p. 203). A 'weaker' (so-called literal) translation would follow the first form of the German sentence enframing, that is.
Now, a dot appears at the end of each of the two enframed sentences. In the formal case, we have got the possibility with the comma between a and V, or the semicolon between the two formulas. It is to stress that substantial differences can exist between the operators subscripted by (j)\ and (^2 because the first one concerns internally the state-of-mind while the second one pertains internally to understanding. Here, the informational character of an equally marked operator in different contexts comes to the worth. Namely, in an informational transition of the form a ß, where a is the informer and ß is the observer, operator ^ is to be understood as operator composition where has to be decomposed according to the a's informingness and according to the /?'s informedness.
The demonstrated translation of the German sentence into informational formula was only that which could be called the first informational approximation. Introducing the concept of the so-called informational graph it is possible to give several kinds of interpretation of an informational formula. In Fig. 1 an informational graph of the widening of this approximation is presented
Nc
Nc
Nc
a
Figure 1: A graphical interpretation of the bidirec-tionally and circularly (5-loop) structured parallel decomposed system r concerning the input entity a.
where the semantic scope of the word 'gleichursprünglich' (equiprimordial) is taken into consideration. Let us mark the operator composition
68
Informatica 20 (1996) 65-94
A.P. Železnikar
t (a)
|=gUur)o Nmit corresponding to 'ist existen-zial gleichursprünglich mit' by operator j=c. By this operator the arrows between entities t, b, ti in Fig. 1 are marked. The vertical input and output arrows are unmarked and represent the general operator [=.
Which is the parallel informational system describing the graph in Fig. 1? If discourse (die Rede) r concerns an entity a, that is t(o;), the so-called r-solution of the system graphically presented is
t, b,t) t=;
x\=cb; b he o; 0 t=c r;
\t t=: o; t) t=c b; b l=c t J
In the first row of the parallel array the input paths are located and in the second row the output paths. The third and the fourth row represent the main two loops (the left and the right one) of the graph. These two loops, joined, hide three more elementary loops existing between operands r and b, b and ti, and t) and t. Thus, one of the so-called serial type solutions (a specific one) for i;(o;) can be expressed as
/a\=r, h b.ti; t, b,t) [=;
rNc(b K (f Net)); t K (f K (b Net));
t Ne (öNet); bNe (oNe b); V t) Ne (t Ne o) This solution formula system considers the input and output structure of the graph in Fig. 1 in the first and the second row, two long-size (bidirectional) loops in the third and fourth row, and three short loops in the last two rows, respectively.
The discussed examples of the natural text translation into informational formulas, and vice versa, demonstrate the hidden (intuitive) importance of the informational framing. As a consequence, framing concerns the so-called gestal-ting, where as the gestalt of a serially structured formula the parallel system of all possible formulas is meant, emerged from an initial formula by all possible displacements of the parenthesis pairs.
2.2 Sentential Paradigm and Possible Frames and a Gestalt
The ideas for the subsequent example of using informational frames and a gestalt go back to Fo-

dor [2, 3] being sketched in a condensed form in Churchland ([1], pp. 385-388). We will study this particular case on the basis of the graph presented in Fig. 2. This graph is an exact representation of the parallel formula system (as a free-standing system without a specific marker) consisting of primitive transitions (from an operand to another), which is.
/a\=i-
L N O^M N io	N (^M N 7M
h N 'Sm	N p^',	pi N Cm
c, N^;	c, nq^m	C, N^Sm
V i. N	7. N	p. N /
This array includes three types of transitions: the input, forward, and backward ones. In the first row, the only input transition informs the internal state about the sentence. In the second and third row seven forward transitions are listed. In the last two rows, seven backward (feedback) transitions appear, enabling a complex cycling of the system when the input sentence is identified in a processing way by the arising representation p^ and content c^.
On the basis of this graph we will present a solution for content c^ and some possible, for the discussion relevant frames. The gestalt and possible interpretations of content c j will be presented (and discussed later on).
The concept of sentential attitudes a^ belongs to the autonomy of human being; on the other side, the logical inference defends co-evolution and interdependence, that is, informing of entities, in informational terms. Beliefs, desires, thoughts, intentions, etc. inform between a being's internalism and externalism and play a crucial role in causation of living behavior. Sentential attitudes approach the nature of the so-called representations and of the rules that govern the transitions between representations. All this sounds in-formationally familiar where sentential attitudes and representations may function as distinguished informational entities.
There exists an interplay of the sentential attitudes and representations, a parallel and serial informing among entities. The theory postulates that sentential attitudes can also play a role in nonconscious processes and, in this way, can be involved in cognitive processes, e.g. in the form
internal identification identification representation-content state	informing counterinforming	embedding
Figure 2: A sententially paradigmatic, metaphysicalistically (7-tuple-loop) structured internal state system l giving the input sentence(s) a the sentence(s) representation Pi{(t) and sentence(s) internal content (understanding with meaning) Ci,{a).
of inductive and deductive logic and decision making (the theory of internal states).
Let operand a denote a sentence or a set of sentences, and operand p a representation or a set of representations. Transitions of sentences Ui 1= aj and representations pi pj have the standard informational form, where
{ai 1= CTj)	{pi ^ Pj)
Other entities are the sentential attitude,

a # (cTi,.. .,a„); i ^ (ti,..., /-„); i,(c(0 W cr); Pi ^ (Pi, • ■■,Pn)]
(a\=r,

( (ö:. h t.) N « J K ( p. h (c. h ö:.) )


and the internal state t. Cognitively relevant internal states can be comprehended as i # il,..	An internal state t has, in general, a
content c (<•) (or in a short form, cj, thus, c^ Ct(ti), ..., Ct(i„), where content components can be identified via sentences, that is,
Ct(il) (7i; . . . ; Ct(im) ^ o-m
In this case, operator can be replaced by operator |=identically which reads 'informs identically' (that is, circularly between the ^-involved operands). Such an operator is bidirectional (constituting a loop, graphically), that is,
'^i-i'-i) ^identically (^i'l l=identically Ct(<-i)
The identification, ii(ct(ij) <—> ai), is possible by virtue of the isomorphism (operator U—)•') between Li and ai.
Let us discuss the solution for content Ci,{a) on the basis of the graph in Fig. 2, where
Vs. t= (p. N (SJ
The same solution is, in the demarked form, Ciia).^. a t= i;
.hQ^Jf.NQi. NQÖ:. hÖT. h or. [= .i, \=.<i, ^.7, h^M

h-pc N -c. h

In the third and fourth line, there is the circular formula describing a causal situation of the longest loop. In the remaining rows, systematically all loops are formalized, so that the entire graph in Fig. 2 is systematically covered. The reader can compare the parenthesized, and equivalent to the demarked formula systems, for one particular situation of the graph. In the next section,
the demarked formula system will be discussed in detail.
The gestalt of a circular formula is determined by Definition 19. How many formulas does the content gestalt r(Ct((T)) include? The first row delivers only one formula, <7 |= i. The length of the formula in the second and the third row is £2,3 = 8 (the number of the formula's binary operators). Thus, N2,3 = ^g®) = 1430. For the next lines there is: N4 = 1(4) = 14, N5 = = 42, Ne = = 5, Nr = = 4, and Ns = ^(2) = 2. The sum is 1498 formulas in the gestalt of the discussed formula system.
3 The Role of the Demarcation Point Replacing a Pair of Parentheses in an Informational Formula
Let us introduce the demarcation (delimiting, separating) point [12] in the context of an informational formula replacing a pair of parentheses. Let us list some characteristic examples of parenthesized formulas to get the feeling how the demarking points are uniquely set instead of parentheses pairs in such a way that the parenthesized formula can be reconstructed, uniquely. Let have the following examples for parenthesized and equivalently demarked formulas, respectively:
(«);
(«) N (/?); (« N ß) N; ((«) h iß)) N;
N ((«) N (/?));
(«N^)N(T);
«1 1= («2 N («3 N 04));
((«1 N «2) N "3) N <^4;
a;
a\=ß-«h/?.h;
«h-/9 h7; «N/9-NT;
«1 N -«2 N -«3 K«4; 011= 0^2 • 1= "3 • 1= 04;
(ai 1= «2) 1= (0:3 1= «4) ; ai 1= «2 ■ N ■ «3 1= «4
The question is whether simple rules can be set for the mastering of a unique reading of a demarcated formula. Must it be read from the left to the right or can the reading be performed uniquely also in the opposite direction? In demarked formulas, the direction of formula reading is essential (from the left to the right or vice versa) and
must remain the same as the direction in which demarcation points have been set when replacing particular parenthesis pairs. One must keep the same direction in which demarcation points have been set.
The rules of a unique replacement of parenthesis pairs by demarcation points (one point replacing the pair) in an informational formula are the following:
1.	Before conversion, all superfluous parenthesis pairs in an informational formula have to be eliminated (crossed out).
2.	Any left parentheses, ((■ • ■ (, at the beginning, as well as any right parentheses, ))---)i the end of an informational formula (if any), are ignored and crossed out (within the final result).
3.	Each left parenthesis, '('—-and to it corresponding right part ')'—staying directly after an informational operator, [= (or a particularized operator), is replaced by the demarcation point, '.'. E.g., 1= (_) is replaced
by 1= ._(with one point only).
4.	Each right parenthesis, ')'—and to it corresponding left part '('—staying directly before an informational operator, |= (or a particularized operator), is replaced by the demarcation point, '.'. E.g., (_) 1= is replaced
by_. 1= (with one point only).
5.	All remaining parentheses, if any, with exception of those used in functional forms (e.g., V'CO))	informational formula, irrespective of the the left or the right type, are ignored and crossed out.
6.	By the procedure following the previous rules, all proper parenthesis pairs in a formula are removed and uniquely replaced by demarcation points.
7.	Semicolon ';' and comma ',' are in regard to demarking a formula exceptional operators which remain as they are. By semicolons separated formulas are in parallel informing formulas of a system and the formula parallelism remains preserved in both parenthesized and demarked formulas.
On the other hand, the rules of a unique replacement of demarcation points by parenthesis pairs (one parenthesis pair replacing one demarcation point) in an informational formula are the following:
1.	The first point on the left side of operator in a formula is replaced by the parenthesis pair at the beginning of formula (left parenthesis), and at the place of the point (right parenthesis).
2.	The first point on the right side of operator is replaced by the parenthesis pair with the left parenthesis '('at the place of the point and with the right parenthesis ') ' at the place of the first point at the left side of operator (lying right of the discussed operator).
3.	After parenthesizing by rule 1 or rule 2, the next point is already within a parenthesis pair. Satisfying rule 1, the'left parenthesis is set behind the existing left parentheses and the right one at the place of the point. Satisfying rule 2, the right parenthesis is set before the existing right parenthesis and the left one at the place of the point.
4.	In the way of rules 1, 2, and 3, all the points can be uniquely replaced by parentheses pairs.
4 The Basic Notion of the Informational Frame within an Informational Entity
Informational frame is a very beisic notion which, as a special case, functions between two informational operands irrespective of the parentheses pairs, which are disposed within an informational formula. Informational frame is everything which can be enframed in an informational formula. Enframing means simply to put a frame around certain consequent elements in an informational formula.
In this section we must answer the question what does such a regular, or irregular, structure, named informational frame represent. We shall see how the introduced notion of frame deviates from any other, traditionally fortified constituents of mathematical formulas. This strange notion
originates from a fully legal style of formula writing in metamathematics—using the so-called demarcation point (quadratic dot '.')—introduced in Principia mathematica by Whitehead and Rüssel [12].
An informational frame is a formula or anything which can be put into the frame within a formula. Thus, one has to investigate which characteristic frames can occur within a formula or even a formula system, where the system is comprehended as a complex formula structure.
4.1 Possibilities of Informational Frames
To get an intuitive insight of the possibilities of occurring informational frames we have to look into the formula structure and detach all possible typical frames. This insight can begin with looking into a complex formula and mark or list the characteristic frames. As we shall learn, some frames will be senseful, being informational entities by themselves, the other will be artificial and corresponding to specific views and analyses. In this context harmonious and disharmonious frames will be treated and classified down to necessary details.
In principle, any part of a formula can constitute the so-called informational frame. The problem is how to choose a sufficiently complex formula in which all possible forms of informational frames do occur.
We can begin to write formulas from the left to the right and observe how structurally different frames come into existence. So, let us make a trial beginning with an operand:
a
a
a\=
« N (/3f=(
« N (/? N (7
ah
a\=.ß
In the right column, formulas with demarcation points instead of left parentheses are listed. .
In a similar manner, we can write formulas from the right to the left and observe how structurally different frames come into existence. So, let us make a trial ending with an operand:
a
h« )\=a
l=/3)l=« 7) N /3) h «
a
N« ■ h« ß.^a
N/3-N« ■ Niö.N«
j.]=ß.\=a
Within the listed parenthesized or demarked frames arbitrary parts can be chosen as frames. Some of them are operand frames, that is, in-formationally regular formulas or subformulas. Other frames are the so-called operator frames, which can be put between two operands, or, in case of parenthesized formulas, at the beginning and the end of formulas or inside of them, representing the so-called parenthesis frames [17].
4.2	Informational Operand Frames
Definition 1 [Operand Frame] An informational operand frame is nothing else than an enframed informational formula which is either a marker, an autonomous formula, in a formula occurring subformula, or a formula system. An informational operand is, by definition, an informationally well-structured formula of formula system. □
By inspection, it can be clearly recognized which syntactic structures in a formula perform as operands.. And not only this: the semantic character of operands can be recognized to the extent of their decomposition which takes place in a serial extension or in one or more parallel, that is, additional interpretations. Operand frames need not to be treated separately because the notion of the informational formula is fundamental and firmly informationally determined.
4.3	Informational Operator Frames
Informational operator frames are twofold. The most primitive operator frames are informational operators as they are (or appear) by themselves. For a complex and, to some extent unusual, concept of operator frame we introduce a new definition. The sense of the introduction of the notion of the operator frame is to identify a direct informational connection between two operands in an informational formula.
Definition 2 [Operator Frame of a Parenthesized Formula] Operator frame in a parenthesized formula is the part between arbitrary two operands, and to this operand-intermediate part belonging left-parenthesis and right-parentheses frames. □
Example 1 [Some Operator Frames of a Parenthesized Formula] Let the metaphysicalistic parenthesized informational formula
((((«(=!„) h	K (^^ah
be given. Let us have examples of the following three parenthesized frames
((((
((((
a
a
hi,) h g NO Nt«) K
a
\=Ia)\=la)ì=Q
N " ))
( ((( a\=Ia
(ea h «))
between operands a at the beginning and a at the end, a at the beginning and Sa, and and 7«, respectively. In the first two cases, the operator frame is an entity of three parts, namely, the left, middle and the right one, the third case hćis a two-part operator frame. □
The last example shows how parenthesized frames can become badly transparent and mutually dependent.
Definition 3 [Operator Frame in a Demarked Formula] Operator frame in a demarked formula is the part between arbitrary tzoo operands. □
Example 2 [Some Operator Frames of a Demarked Formula] Let the formula in Example 1 be expressed in the demarked form, that is,
a |=I„.|=	7a- h-^a 1= -^a 1= "
The enframed examples, discussed in Example 1, become evidently.
Each case, irrespective of the operator frame structure (and position), possesses only one de-marked frame. □
4.4 Harmonious Informational Frames
We have to determine harmonious and disharmonious parenthesized and demarked informational frames. Which are the various possible frame forms and in which manner do they appear as parenthesized and demarked frames? Which are the advantages of frames in one or the other form?
At the first glance, it seems that harmonious frames always appear in one piece, that is, they are not split within a formula. We shall see how this principle can have different consequences comparing the parenthesized and the demarked frames.
4.4.1 Harmonious Parenthesized Frames
Parenthesized frames in formulas of Example 1 are all harmonious. A frame, although split in two or three parts, is harmonious if it is functionally closed.
Definition 4 [Harmonious Parenthesized Frame]
A split, or unspUt, parenthesized frame is harmonious if it is functionally closed. The functional closeness of a parenthesized frame means that after concatenation of its parts the resulting frame presents a well-formed formula, that is, an operand frame. In the procedure of frame concatenation the empty parenthesized parts of the form 0 are replaced by an empty part (informational nothing), X. At such places, the rule () A is applied. Also, the outmost parenthesis pair can he omitted. □
The concatenation of frame parts in Example 1 gives
which results in three well-formed formulas, that is,
((((h la) \= ^a) \= Q \= 7a) N* N t=)); '
One can see how the concatenated frame-harmonious structures are reduced in respect of the 'goal' entities, so that they preserve the necessary formula well-formedness.
Example 3 [Split Parenthesized Frames] Split parenthesized frames can be harmonious and disharmonious. They are divided in a,t least two parts and each part of a split harmonious frame is disharmonious. But the parts of a harmonious frame can be concatenated into a unique (well-formed) frame formula. For instance, the split frames in parenthesized formulas
(( « \=ß)\=7)
((« N /3 ) N 7) N
((«N/3) h 7 ) h^
are all harmonious. On the other hand, the frame parts are all disharmonious," that is.
((	>			) y	■((«N	)	)N7)
			1.				
The reader can recognize the disharmonious
structures of the listed frames by himself/herself. □
4.4.2 Harmonious Demarked Frames
What is the difference between a harmonious parenthesized and demarked frame in concern to a frame splitting?
74
Informatica 20 (1996) 65-94
A.P. Železnikar
Definition 5 [Harmonious Demarked Frame] An
unsplit demarked frame is harmonious if it is functionally closed. The functional closeness of a demarked frame means that the frame itself presents a demarked well-formed formula. □
Because the demarcation point replaces the parenthesis pair, the split harmonious frames in case of parenthesized formulas appear as unique frames in cases of demarked formulas. This is quite true for Example 2, where the demarked harmonious frames are
1= Jc-N^a-NC-N
7a-	\=-£a N


as the operator of alternativeness; and '(' and ')' as parenthesis pair. A preliminary context-free grammar is a construct G = {N,T, R,o), where N = {o, |=} is the alphabet of nonterminals, T = {'(', ')', o, ';', V, o, N} denotes the terminal alphabet, R is a set of context-free rules (see below) and o marks the initial symbol. The set of rules is determined by two syntax rules, which
The rightmost operator combination . C . in the second frame means . C (), so, it can be reduced into . C for the sake of simplicity. Similarly, the leftmost operator combination . C in the third frame can be replaced by operator C to keep the frame in a common form.
4.4.3 A Syntax Comparison between Harmonious Parenthesized and Harmonious Demarked Frames
How can harmonious parenthesized frames be recognized at once? The answer is: by stating that they represent well-formed formulas. The correct form of a formula can be proved by the usual syntax analysis, taking into account the general, context-free grammar for well-formed informational formulas. It is instructive to determine such grammars for parenthesized and demarked informational formulas.
Designing a syntax, one can consider that a formula or formula system is nothing other than an operand. It means that the initial (starting) grammatical variable in a formula development (generation) is the operand, symbolized grammatically (and as a terminal) by o. A general context-free grammar for the parenthesized formula systems can be constructed by the following items (syntax categories and terminals): o as operand; |= as operator; o as a separator in an operator composition; semicolon ';' ^^ the operator of formula parallelism; comma
are
o f- o\= \ \= o\o\= o\[o)\o\o\o^o
N ^ NoN l(H
In case of formula systems using the demarcation points instead of the parenthesis pairs, the demarked grammar is G' = [N, T' ,R* ,o), the alphabet of terminals is T' = {'.'o, ';', o, |=} and the rules of R' are
o	o\= j |=o|ot=o|.o|o.|o;o|o,o
N NoN l-N I N.
Rules with demarcation point in the second line (operator composition case) can be used only in such a way that the point is inside of an operator composition. For example, in an operator generation process, there is
1= 1=0^ |=.o|= -» |=o|=.o[= t=o|= .o. t= ^ |=o|= .o. t=o|=
where the end result would correspond to the operator composition ([= o |=) o ([= o |=). Symbol represents the derivation (generation) step.
4.5 Disharmonious Informational Frames
The concept of the disharmonious informational frame (dif) enables the treatment of arbitrary formula parts which do not fit harmonious informational frames. Difs, in this way, contribute to the possibility to treat arbitrary parts of formulas as entities which may, in special cases, be of essential interest.
Definition 6 [Disharmonious Frame] An informational frame is disharmonious if it is a part of a well-formed formula or formula system but it does not represent a well-formed formula by itself, a
Disharmonious frames are parenthesized incompletely within a parenthesized formula and de-marked within a demarked formula.
4.5.1 Disharmonious Parenthesized Frames
In a parenthesized formula, disharmonious parenthesized frames are the most arbitrary (enframed) entities. They can be parts of harmonious frames on one side, on the other side they can embrace any imaginable sequence of adequately serried operands, operators and parentheses, which do not constitute a harmonious frame. So, both types of frames exclude each other.
Definition 7 [Disharmonious Parenthesized Frame] An informational frame is a disharmonious parenthesized frame if it is a part of a well-formed parenthesized formula or formula system but it does not represent a well-formed parenthesized formula by itself. □
By this definition, a disharmonious parenthesized frame is not an arbitrary sequence of operands, operators and parentheses, but is an arbitrary sequence of the mentioned entities which constitute a part of a well-formed parenthesized or formula system. For example, frames
etc. are disharmonious pa-
constitute a part of a well-formed demarked formula or formula system. E.g., frames [!], .
brmula		
1	)	3'
(N
N)
f=(N
N-
mar
1= . 1= , . 1= . , etc. are disharmonious de-
ed frames because they can be completed to
renthesized frames because they can be completed to the harmonious parenthesized frames.
4.5.2 Disharmonious Demarked Frames
In a demarked formula, disharmonious demarked frames are enframed entities being arbitrary parts of the formula. They can represent parts of de-marked harmonious frames and, in this way, can include any imaginable sequence of adequately composed operands, operators and demarcation points, which in this sequence appear in a harmonious frame. Demarked harmonious, and demarked disharmonious frames, exclude each other.
Definition 8 [Disharmonious Demarked Frame]
An informational frame is a disharmonious de-marked frame if it is a part of a well-formed de-marked formula or formula system but it does not represent a well-formed demarked formula by itself □
By this definition, a disharmonious demarked frame is not an arbitrary sequence of operands, operators and demarcation points, but is an arbitrary sequence of the mentioned entities which
the harmonious demarked frames.
4.5.3 A Comparison between
Disharmonious Parenthesized and Disharmonious Demarked Frames
The comparison between disharmonious parenthesized and 'disharmonious demarked frames concerns the so-called of the frame's left and right edge development (see Subsubsection 5.6.1 and Table 1). In designing a frame (harmonious as well as disharmonious one), the designer (designing entity) proceeds from that part of the frame which already exists, developing the frame at its edges in such a way that the emerging disharmonious frame will become a part of a possible harmonious frame or of well-formed formula. Syntax rules for generation of disharmonious parenthesized and disharmonious demarked frames differ, of course, essentially between the parenthesized and the demarked case.
4.6 Functional Frames
Functional frames are characteristic in such a way that they can be clearly recognized in comparison with other formula frames. Informational function as an informational operand has the form where the left parenthesis appears between two operands (the only possible case for such an appearance), and the right parenthesis appears at the end of argument a. There are not demarked functional frames, but within a function and its argument formulas can be expressed in the de-marked form.
4.6.1 Harmonious Functional Frames
Functional frames are parenthesized structures which occur as such (roughly parenthesized) also in cases using demarcation points.
Definition 9 [Harmonious Functional Frame]
Harmonious functional frames have the general Jorms
A.P. Železnikar
,	(a) ,	(V') (a) ,
	(a), ( if )(a),	i<P)	(1«)
where, in principle, both ^p and a are complex informational formulas, so, they must be parenthesized as {(p) and (a), respectively. A common form of informational functionalism is ^{a), where ip stands for a marker (a single operand name). Thus,
ip{a) , (/5(0) ,and

{a)
where a is an argument formula and (p is a function representative informing upon the argument [17]. □
The following comment could be useful: frames fa] and (a) are harmonious and can appear as such also outside a functional context. On the other side, the possibility of framing both the function entity and the function argument a, makes them visible for further informational investigation.
4.6.2 Disharmonious Functional Frames
A disharmonious functional frame offers an especially characteristically visible case which explicitly concerns the syntactic structure pertaining solely to the concept of informational function.
Definition 10 [Disharmonious Functional Frame]
Characteristic disharmonious functional frames take the general forms as


etc. □
As stressed, combination ')(' is the most significant mark of the presence of an informational function which can be searched in the left and the right direction of the mark.
4.7 Frames Concerning the Operator Composition
Operator composition belongs to distinguished operator structure. It enables to join several operators systematically into one resulting operator. Similarly, as a function, operator composition can be treated as an autonomous framing problem. Harmonious and disharmonious framings of operator compositions can be studied and clarified.
4.7.1 Harmonious Frames for Operator Compositions
An operator composition is a structure consisting of operators |= (differently particularized), operator separators 'o' (uniquely determined) and parenthesis pairs.
Definition 11 [Harmonious Parenthesized Frame of Operator Composition] Harmonious parenthesized frames of an operator composition can take the general forms
hoh , (Noh) , (NoNoN , l=o(NoN)
etc. They must satisfy the operator syntax described in Subsection	□
In a similar way, frames for the demarked operator compositions can be determined.
Definition 12 [Harmonious Demarked Frame of Operator Composition] Harmonious demarked frames of an operator composition can take the general forms
NoN		
hoN-o-Noh		
etc. They must satisfy the operator syntax described in Subsection 4-4-3- Q
In a syntactically regular way, harmonious operator composition frames can be arbitrarily adequately nested.
4.7.2 Disharmonious Frames for Operator Compositions
Which are the main forms of disharmonious parenthesized and demarked operator composition
frames? At least, a harmonious operator composition frame must begin and end by an operator |= (which, obviously, follows from the previous definition). A general answer is given in the form of the following two definitions for a parenthesized and demarked case, respectively.
Definition 13 [Disharmonious Parenthesized Frame of Operator Composition] Disharmonious parenthesized frames of an operator composition can take the general forms
)o , o( , )o( , Ho , o(N

etc. □
On the other hand, frames of disharmonious de-marked operator compositions are determined in the following manner.
Definition 14 [Disharmonious Demarked Frame of Operator Composition] Disharmonious demarked frames of an operator composition can take the general forms
.0
o .
.o. ,	o 1= .0	)	0 . 1= 0
h ° h -o- h °			5
o [= .o. 1= o 1= .
(a)
(b)
Left operand
Operator
Right -operand
Left	Left	Sepa rator	Right	Right
oper-	oper-		oper-	oper-
and	ator		ator	and
Figure 3: Frame sequences for transition of type {&) a\=ß and (b) a\=o\=ß.
comprehended as a the most essential informational frame—the operator frame—which has to be studied carefully, and exhaustively.
Informational transition, of the form (a) a \= ß, or, operator-compositionally, in the form (b) a \=o\=: ß can be comprehended by the general schemes in Fig. 3, respectively. How can informational frames be consistently (well-formedly) joined together?
Let us start with the following frame structures concerning the transition a ß. There are, evidently, some possible frame configurations:
a
a
N /3;
a
\=aO\=ß ß-,
\=a	h/3
a\=a O ß
Ö Na 0 \=ß ß
; "0]	h.	o	h/J	ß
etc. □
5 A Frame-analytical Comprehension of Informational Transition and Possible Frame Concatenation
5.1 Operator Frame
Informational operator is that entity-which appears between two informational operands, forming the so-called basic informational transition, irrespective of the complexity of operands. On the other hand, the operator can be understood as an arbitrarily complex entity, composed of many other entities, that is, operators, operands, and parenthesis pairs, which do not constitute a well-formed formula. In such a sense, operator can be
Let us study particular framings of informational transition.
5.2 Frame Concatenation and Frame Parallelism
Under certain circumstances, informational frames can be concatenated into new frames and can be stacked in a parallel manner within frames and, then, the stacked frames concatenated, etc.
Definition 15 [Frame Concatenation] Two frames (paj and (f)Q, can be concatenated building up a frame 4>, that is, (f) # (f>a4'ß! if both 4>a, and (pß,
are parts of a welUformed formula, and such is 4>. □
Definition 16 [Frame Parallelism and Concatenation] Frames (pa and (f)ß, being arrays of frames
78 Informatica 20 (1996) 65-94
A.P. Železnikar
(t>a,2, 4>c,,k and cßßß, • • •, <j)ß^k, respectively, can be concatenated into frame array (j) with frame components 4>i,4>2ì ■ ■ • i<Pk in such a way, that 4> ^ <f>a4'ß! where
that is,
4>a,l	't>ß,l
	(f>ß,2
\	
4>a,k	(t>ß,k
(and (j)i # (f>a,i4>ß,i fori^l,2,---,k). □ 5.3 Framing the Transition a \= ß
In the framed transition a \= ß, the question what could an operator |= represent, and which is the degree of its complexity in a serial and parallel sense, comes to the surface. Initially, |= is an arbitrarily structured operator and its structure has now to be clarified.
Evidently, the seria tion of a transition a
parenthesized decomposi-
1= ß can have the form
(•••((
a


The complex operator frame is split into three parts and one can understand how a symbol |= between operands can become as complex as possible. Operator |=* is the main operator and signals the transition decomposition process is never ended. Simultaneously, it marks, how the apart of length £a = m of the operator belongs to a and how the /J-part of length £ß = n belongs to ß. Thus, the length of decomposed transition is ia\=ß = m-|-n-|-l. The transition gestalt includes (see Subsection 6.3)

m + n + 2
-I- 2n -Im -f n -H 1
a
1= «1 . t= «2 • 1= • • • «m . h •
ß
where the care of parenthesis pairs is left over the the mechanism of the demarcation point.
The next enframing shows a clear separation between the left, and the right, part of transition a\= ß m regard of the main operator [=*. There is
where <pa is a harmonious left frame and (t)ß a harmonious right frame. Within these frames, frame pairs and {(l>a,\, <j>a,r) and (<^/3,1, (f>ß,i) are disharmonious. Additionally, the main operator does not belong explicitly either to operand a or to operand ß. It stays between both of them and can be decomposed in a further way in the left, and the right, direction.
The demarked case of the discussed example brings a clear evidence how the informer, and the observer, part in a transition can be separated. There is
a
1= «1 . 1= «2 . 1= •••«m
■N.
ßi h ■/?2 h ■
ß
Certainly, there can exist several different, that is parallel, decompositions of transition a \= ß. In this case, instead of a single operator enframed formula, there are, say, k parallel formulas of the form
(•••((

a
a
possible decompositions of transition a |= /? of length £a+ £ß +I-
The demarked form of the discussed operator decomposition of transition has a compact shape, with only one (unsplit) frame, that is.
N au) N "12) N	• • - aim)
	
{ßn N ißu N • ■	• {ßln N
1= «21) N «22) N	• • ■0:2m)
• N*	
(^21 N (/?22 t= • •	■{ß2n\=
ß )••■))
ß )•■•))
(•••((
a
t= ttfci) 1= Oiki) N • ••O^km) [ßkl N {ßk2 h • • • {ßkn N
ß )•••))
which results into a compact interpretation of the transition decomposition by
a
h au) N "12) N •••
«Im) h* (ßn
1= «21) 1= «22) h • • • «2m) N* (/Ö21
h(ß22h---(ß2ni=
1= aki) t= ak2) 1= • • • akm) N* (ßki
h(ßk2h---(ßknh
ß

)■■■))

The reader can comprehend in which sense the complexity of a transition operator |= can develop. In the last enframing example the split parts of a parallel decomposed transition operator are separately enframed, so that operands a and ß appear only once, like in an informational graph. In the last complex parallel case, the transition gestalt includes (see Subsection 6.3)
N.

1 f 2mi + 2ni + 2 ^ rui + n,- + 2 mi + Ui + 1
= E
a
ß
In a compact, however virtually artificial way, the separation into the left and the right part frames can be expressed in the form
1= an) 1= «12)
1= • • -«Im)
1= «21) 1= «22) h • •■0:2m)
1= • ■■akm)



H
0a,r
{ßxx N (/?12 N	
	
	^jS,!
(/321 h [ß22 N	
■■■{ß2nh	
:	
{ßkl h {ßk2 N	
•■•(ÄnN	éì.
	
ß
possible decompositions of transition a ß of parallel lengths la,i +	1 for i = 1,2,..., A;.
As already ascertained, the demarked form does not need an explanation concerning the parallel frames within a frame. The demarked formula system for multiple operator (e.g. interpretative) decomposed transition is (without serially split frames)
The reader can find the demarked form of the last formula system in Section 11 and Subsections 5.4 and 5.5 where the philosophy around transition a 1= o (= /3 is debated. A kind of the system simplification will become evident.
5.4 Framing the Transition a \=o\= ß
At the first look, there is a minimal difference between the presentation of transition decomposition a \= ß and a \= o\= ß. However, as it is pointed out in Subsection 5.5, the difference is essential, because in the second case the informer part {a) and the observer part (/3) can be separated up to the operator composition separator 'o'.
In general, for the parenthesized case, the split, composed operator (|=o|=) frame example is
(•••((
a
h aiTN «2) t= •	•■am) 1=
Q	■■{ßn^
ß )•■•))
The demarked form of the same formula becomes
1= ai. 1= ^2 . [= • • • «TO . 1=
a. o

ß
The next two examples of possible enframing come near to the goal of separation between the informer and the observer. In case of parenthesized formula, one obtains the enframing
and in case of demarked formula the enframing
comes into the separation foreground.
Further examples, discussed in Subsection 5.3, can easily be constructed for the a\=o\= ß case.
5.5 Interpreting the Transition Framing a \=a o \=ß ß
Framing of the form a\=a o \=ß ß
IS essen-
tia] for a proper understanding and informational regularity of the separator 'o', functioning as a regular operator. Evidently,
(aK o /3 ) = ((a Na) o (N/J/?))
In this formula, subformulas (a Na) and (N^ ß) are well-formed formulas, and between them an informational operator, that is, also, operator 'o', can appear. In this way, separator 'o' is a regular informational operator. On the other hand, in a
Na°N/3 entity 'o' is a member of the opera-
tor composition No o N/?i ^hat is, the separator between operators Na and N/3- In both cases, the meaning of 'o' is a sort of informational concatenation between the informing operand a and by it informed operand ß.
5.6 A Consistent Concatenation of Frames in Complex Transition Formulas
How can arbitrary frames be linked together to keep the possibility that a final frame concatenation will represent a well-formed formula? A
particular question concerns the frame concatenation which would lead to an operator frame between arbitrary two operands in a serial (or serially circular) formula. What are the characteristics of such an disharmonious (irregular, non-well-formed) operator frame?
5.6.1	A Conditional Frame Edge Syntax
How can frames be composed beginning from an initial frame H? Let the initial frame be replaced by a single basic alternative frame which is nothing else than a general symbol appearing in a well-formed formula. Thus, we introduce the initial replacement rule in the form
H ^ a|N I o I ( I ) I, I;
where a represents all possible operands, N all possible operators, o composition operator for operators. We can also introduce operand (j) as the current occurrence of a frame which is still arising or is already arisen by the use of syntax rules. These rules can be used only at the left and the right edge of arising 4> in such a way, that the final result of different frame concatenation delivers, at the end, a well-formed informational formula, after an arbitrary initial rule ii <r~ (j) was chosen. A complete collection of frame edge syntax rules is listed in Table 1.
In this table, the aesthetical (obligatory) space symbol, is introduced, which explicitly marks the usual space between formula components. As one can see, there are seven syntactic types of symbols as given by the H-rule. The use of rules from Table 1 is conditional. The condition which must be satisfied at the generation of frame ^ is that ^ is a syntactically correct part of the arising informational formula. Thus, additional conditions in the form of context dependent rules can be constructed in the following way:
oN^NoN I N)oN
We must not forget that the rules can be applied only at the edge of the current frame (p.
5.6.2	Explanations Concerning the Use of Rules in Table 1
Explanations concerning the use of concrete rules in Table 1 is necessary. As said at the very beginning of this section, the application of concrete
#	0	1	2	3	4	5	6	7	8	9	10	11	12
0	K	a	N	o	(	)	)	Ì					
1	a		-h«	(a	a)	a,	,ua	a;	;ucv		«u(.	)a	)uOi
2	N	o; 1=		(N	N)	h,	,uh	h;	;uh	h(	■ ■) "h ■	ho	oh
3	o	^ oh	ho	oN	(ho	o(		a |=o	o\= a	«□(ho	oh) u«	)oh	
4	(	{a		(h	h(	)(	,u(	;uh	((	o(	au(		
5	)	a)	)«	N)	)h	)(	),	);	))		)uO;		
6	5		tt.		h,		),						
7	?		a;	;uN	h;	;u(	);						
Table 1: A conditional frame-edge context-free syntax in which some meanings are the following: reads is_replacedrby; for example, (1:0^ 1) marks a cv or (4 : 0 4) marks "{ \= {"; the visible space symbol marks an aesthetical (also obligatory) space between some elements of a formula.
rules is conditional: the arising frame ^ must be a part of a well-formed formula. We will refer to particular rules by (x, z) markers, for example, (2,12) [= o|=. The most interesting cases are those in which for a given edge symbol several rules for this symbol could be applied, but only one (or several) can generate a well-formed formula. The use of a certain rule depends on the context on the right or the left side of the edge symbol.
Additional explanations to the use of some nonevident rules (replacements) for the edge symbols are given in Table 2. The application of rules is conditional in the sense that the result of a symbol replacement must stay within the well-formed formula. The preceding (already existing) context and the intention of a formula development determine the choice of a concrete rule.
5.6.3 Examples: the Application of
Conditional Frame Edge Syntax
Let us show several examples -of the discussed frame syntax. This syntax enables a straightforward generation of frames from the left to the right and vice versa, but also from wherever in the middle of an arising formula and then proceeding on its left and its right side. Within such a frame generation the way to the well-formed formula as the final result must be considered, that is, the design of an adequate (syntactically correct) frame
We can distinguish two characteristic cases of the design by the conditional frame syntax. In the first case, we are confronted with the formation of a complex operator composition, for which certain conditions of generation have to be satis-
fied. In the second case we discuss a general case and point out the conditions where syntax rules could violate the emerging of a well-formed formula.
Example 4 [Generation of a Complex Operator Composition] An operator composition can be begun by several rules of Table 1. The beginning symbol is H and rules H f- [= and H o are both adequate. The used rules can be marked by (a; : y z) where x is the line number, and y and z are the column numbers in Table 1. The shortened marker is simply (a;, z) and marks uniquely the rule of the table. Thus, we will use the marked deduction arrow of the form i—In this way, the final form of a frame (j) can be generated in the following way:
((hoh^ ((NoN ^ ((hoNo ^ ((hoHoM^ ((NoN)oN ^ ((hoNoNo ^ ((hoh)oHoh^
with the current frame 4> at the end of the deduction chain. □
Example 5 [General Formula Generation Violating the Formula Syntax] How, by using the rules, the well-formedness of the emerging formula can be violated? Somebody being acquainted or having the feeling of formula well-formedness can immediately sense the mentioned violation. According to the Table 1, the following illegal (syntactically incorrect) frames (derivation results or their parts) can be generated:
A.P. Železnikar
{x, z) Rule
Explanation
(1>3) (1,4)
(1.9)
(1.10) (1,11) (1,12) (2,11) (2,12)
(3.3)
(3.4)
(3.5)
(3.6)
(3.9)
(3.10)
(3.11)
(4.1)
(4.2) (4,5) (4,10) (5,5) (5,10)
a (a	a begins a function argument formula or a regular serial subformula
a <r- a)	a ends a function argument formula or a regular serial subformula
a a(	a is a function of that which will follow after '(' to the corresponding ')'
a au{	a with 'u' before '(' marks the beginning of an operator composition
a )a;	a is a simple argument (marker) of a function arising before ')'
a <r- )uö;	a marks the beginning of a subformula after an operator composition
1= |=o	1= is the left of an operator composition
1= j=o	1= is the right of an operator composition
o ■f- o[=)	')' marks the end of a complex operator composition
o <r- ([=o	'('marks the beginning of a complex operator composition
o o(	after '('an operator composition will follow
o )o	before ')' an operator composition will appear
o f- a;u(t=o	after a, a more complex operator composition can follow by 'o o('
o o|=)ua'	before a, a more complex operator composition can follow by 'o ^ )o'
o ^ )o[=	before ')', an arbitrarily complex operator composition can appear
(a	a begins a subformula or a function argument in the context )(«
a{	a is a single marker of a function after '(', ')', or |=
)(	)( marks the concatenation of a function formula and function argument
f- «□(	Oi with 'u' before '(' marks the beginning of an operator composition
<— )(	)( marks the concatenation of a function formula and function argument
)□«	) with 'u' before 'a' marks the end of an operator composition
Table 2: Explanation of some contextually nonevident (critical) replacement formulas from Table 1.
.. 0,1 1,4 . 5,1 I—> a I—> a) I—y
0,1 2,1 >1; I—> a I—>
aa

aa
.. 0,2 I 2,5 I
vv 0.1 i>i I 1.2 n-r~
I—a I—a 1=1—\= a
H 0,1	I	I	t
a 1—> \= a I—^ t= F
1,2
1,2
.. 0,1 1,3 , 1,3 ,, 4,3 ,, , 4,9
I—a I—> (a I—> ((a i—> (|= [a i—>
o(|=( a)
etc. Through these examples one can understand the conditionality of the edge syntax. In this way, the intermediate results of formula generation must be proved on syntactic correctness. However, this mode of correct frame generation enables a spontaneous approach in emerging of formulas, connected with semantic (interpretive) concepts of the spontaneously arising formulas. □
6 The Basic Notion of the Gestalt Belonging to an Informational Entity
6.1	Introduction
Gestalts are a kind of interpretative possibilities to a given formula. As one will learn, gestalts can be classified in various directions, the formal and the applied ones. In some respect, the concept of informational gestalt approaches the concept of an informational graph, but in a formal, especially causal and circular sense.
6.2	Informationally Phenomenalistic Gestalts
Informational entity on the formalistic level is nothing else then an informational formula. The simplest form of a formula is a marking operand which marks an entity. Thus, in the very beginning of our discourse we have to put the question concerning the gestalt of a formula (or formula system) on an intuitive level. Later, we will answer the question in an informationally formalistic way,
that is, by an adequate formula expression for informational gestalts.
Gestalt of a formula describes the entire possible structure of causality in the framework of the informational logical consistency, that is, the well-formedness of formulas which follow (can be derived) from the original formula by all possible displacements of the parenthesis pairs.
For example, a sentence in a natural language is a grammatically correct sequence of words and each word in the sentence performs (more exactly, informs) as an autonomous and with other words informationally connected entity, that is, formally, as an informational operand or informational operator (the property, quality of operands which it concerns). Such a sentence hides all possible causal choices (cases, example) of the sentence and this sentence presentation potentiality is called the gestalt of the sentence.
In this way, a sentence can also be understood as an informational graph, which is an ordered structure (sequence, loop) of operands and operators without any parenthesis pairs (grouped informational connections) between words (operands and operators). In a practical case of a sentence understanding, only few of the possible cases of the setting of parentheses pairs are realized (constructed) by the observer of the sentence, that is, only those which fit in the best possible manner the given discourse of involved observers.
6.3 Definition of the Gestalt of a
Formula and of a Formula System
In this subsection, we have to define gestalts concerning a serial and circular formula, and gestalts of parallel formula systems.
Definition 17 [Length of a Serial Formula] The
length £ of a serial formula is an integer being equal to the number of binary informational operators (of type \=) in the formula. □
For the length i, unary operators in a formula do not count. They represent the so-called interna-lism (input) or externalism (output) of operands occurring in the formula. For instance, the length of formula
"N ((«1 l=;h"i) l="2)
is, evidently, £ = 2, where operand oti disposes of its own input and output. Evidently, the last
formula can be expressed by a parallel formula system in the form
a[={ai\= «2); «1 N; h «1
where from the original serial formula the unary parts are removed.
Definition 18 [Gestalt of a Serial Formula] Let
a N (o^l h («2 N • • • («n-l h Ön) • • •))
be a serial formula (p^ of length £ — n. Then, the parallel formula system
'a 1= («1 1= («2 1= ■ ■ (a„_i [=«„)•• O);^ [a 1= ai) 1= (02 ^ • • •	|= a„) • • •);
\(---((a 1=^ ai) 1= «2) [=	\= a^ )
consisting of exactly
1
iV,

n-f- 1

formulas obtained from formula ^p^ by all possible replacements of the parenthesis pairs, including (p-i., is called the gestalt of serial formula (p^. □
A circular serial formula (p^ differs from a serial formula <p^ only in its last (the rightmost) operand which is equal to its first (the leftmost) one and, in this way, performs the cycle concerning the distinguished operand or its circularity (its circular closeness).
Definition 19 [Gestalt of a Circular Formula] Let
be a circular serial formula cp^ of the length £ = n-\-l. Then, the parallel formula system
/a h («1 N («2 h • • • («n-1 N («n h «)) • •
[a ^ ai) [= («2 h • • • («n-i N ("n 1= a)) • • •);
((..■((«hai)h«2)N
consisting of exactly
■ttn-i) \=an)\=a J
A.P. Železnikar
Nr.



Vii
ai 1= «2; \an-l t= Olnj
formulas obtained from formula (p% by all the possible replacements of the parenthesis pairs, including ip^, is called the gestalt of serial formula
^O. □
In the next two definitions the adjective basic will concern the serial formula in the form of transition ^ 1= So, the named parallel formula systems will consist of basic transitions. On the other hand, we have learned the gestalts of complex serial formulas are not basic in the described sense, and they will be marked in other ways.
Definition 20 [Length of a Parallel Basic Serial Formula System] The length of a parallel serial formula system is an integer being equal to the number of serially connected parallel basic transitions (of type rj) in the formula system
including n serially operand-coupled basic transitions). Thus, evidently, = n. □
Definition 21 [Gestalt of a Parallel Basic Serial Formula System] Let be a parallel basic serial formula system of the length = n (Definition 20). The gestalt F of formula system (p^^ is given by
The formula system ^^ means the gestalt of itself. □
This definition says a basic parallel system of the form v'li represents the final achievement with respect to other possibilities of informational presentation (interpretation). In already all its informational possibilities are included.
Definition 22 [Gestalt of a Parallel Circular Basic Serial Formula System] Let ip^ he a parallel circular basic serial formula system of length = n-(-l where
/a\=ai-, ^ «1 1= «2;
Oin-l 1= Oln\
Vo;n|=a ) The gestalt T of formula system (p^^ is given by
Circular formula system means the gestalt of itself □
This definition is in accordance with Definition 21 for the serial noncircular case.
6.4 Parallelization of Serial Formulas and Serialization of Parallel Formula Systems
A serial formula (p^ (each single formula is in a way serial) demonstrates possibilities of its parallelization by comprehending it into detail, where the transition from one operand to another occurs from one to the next operand in the formula sequence. For instance, in a j= (/3 |= 7), as a serial formula specimen, the detailed analysis concerns the transition from a to ß and, then, from ß to 7, that is, formally, 01 ^ ß and /3 |= 7, respectively. Within this view.

/3 1=7
But, the consequence (a /3;/3 |= 7) of this implication delivers all the possible cases (serial interpretations) of the original formula a |= (/3 |= 7). Thus, a further implication is
a [=/3;^
(("N/3) Nt)
and, implication transitively,
(a'N(/3N7))^((«N/3)N7)
6.4.1 Parallelization of a Serial Formula
In which way a serial formula	o>i,..«„) of
the length n could be possibly parallelized (when looking into its details on the operand level, and
ignoring the set parenthesis pairs)? Such a view, ignoring the parenthesis pairs, searches for all possible causal cases of a serial formula presentation when operands and operator keep their places and meanings of the original formula, but the informational relations concerning the parenthesis pairs are changed in all possible manners.
Definition 23 [Parallelization of a Basic Serial Formula] Let Pi mark parallelization and let tti,..., On) be a serial formula. Then,

«11= «2;
V^n-l 1= Ol.
n/
is called the parallelization of the serial formula.
□
On the other hand, the discussed serial formula (f^{a,ai,...,an) has the standardized gestalt «1,..., a«)), given by Definition 18.
Definition 24 [Graphical Equivalence between Parallelization and Gestalt of a Basic Serial Formula]
Two formula or formula systems are graphically equivalent (operator =is) if they have one and the same informational graph <5, where informational graph is presented by circles (or ovals) for operands and by arrows for operators, ignoring the positions of parenthesis pairs in both formulas and formula systems.
Under such circumstances, parallelization of a serial formula is graphically equivalent to gestalt
of the serial formula, that is, □
This definition shows that a gestalt of a serial formula represents, according to the causal possibilities (causal structure), nothing more than the parallelization of the formula. As a consequence, the last definition and the previous ones deliver, evidently, for basic serial formulas the following graphical equivalences:
=0 r{cp.

r(¥'ii) =6
For circular serial formulas, there is, analogously,
r(c^a) =0 V'?;	)
=0	^o
This completes the discussion concerning parallelization and gestalts of serial and circular formulas.
6.4.2 Serialization of a Parallel System of Basic Transition Formulas
Particularized causation can be extracted from parallel formula systems by their serialization. If the parallelization of serial formulas is a sort of causal universalization, the serialization of a parallel formula systems has the meaning in a causal specialization, proceeding from a universal situation to a very particular one.
Definition 25 [Graphical Implication between Serialization of a Parallel Formula and Serial Formula Belonging to the Gestalt] If ^^^ and (p-^ are parallel and serial formulas according to Definition 20 and Definition 18, respectively. Then,
'0	Ssys
Operator	reads 'implies graphically' and
operator £sys means 'is a component of the system'. □
Similar definition can be set in concern to circular formulas.
Definition 26 [Graphical Implication between Serialization of a Parallel formula and Circular Serial Formula belonging to the Gestalt] If (p^ and
are parallel and serial formulas according to Definition 22 and Definition 19, respectively. Then,

,0
e V'«; esys r(^^i)
,0
Operators	and £sys means have the same
meaning as in the preceding, definition. □
This completes the discussion on the.possibilities of drawing the serial consequences (possible interpretations) from parallel systems. In this respect, a serialization on the basis of parallelism has the role to investigate the details, possible particularities and the like.
A.P. Železnikar
6.5 Axiomatic Gestalts
The axiomatic gestalts include all the formulas obtained by the possible replacements of parenthesis pairs in given axioms. In mathematics, axioms function as the given initial and hypothesized theorems and as given rules of inference (method, deduction, induction). It would be extremely interesting to make a look into the background of the axiomatic gestalts and to observe all the possible 'axiomatic' formulas, that is those, proceeding from the given axioms by their 'gestalting'.
Mathematical axioms [7, 8] can be rewritten in an informational form. Let us take the implication axiom set ([7], p. 66) in the form
1)	a^{ß =
2)	{a (a
3)	(«=^/?):

7) (a 7))
For seeing the possibilities in the sense of a gestalt, informational graphs in Fig. 4 can be used. These graphs are uniquely described by the cor-

1)
1) (« 2) (« 3) («
a :
a; a ■ß\ß-7)
ß\ß
t; 7

number of the possible formulas N = where i is the number of =^>-operators in a formula. Thus,
i
r(v2)
4 = 2; iVx = i(^) = 2;
>ß))\
'' a =» (a =» (/? ==» (a = {a a) {ß (a
(((a a) /3) a) /3 e, = 4; iV2 = = 14;
(a ^ /3) (/3 (7 (a 7)));
((((« ^/3) ^/3) 7) «)
4 = 5; iV3 = i(i,°)=42
7
Novikov ([8], p. 75) replaces axioms 2) and 3) by a single axiom of the form
4)
Ì4 = 6; iV4 = i Ce') = 132
The graph of this axiom is presented in Fig. 5 and, as one can see, differs from graphs 2) and 3) in Fig. 4, substantially. The basic parallel formula
Figure 4: A graphical interpretation of the implication axioms where graphs give an insight to the corresponding gestalts r((^i), r((/?2)) and r(((^3).
responding parallel systems of basic transitions, that is,
Figure 5: A graphical interpretation of the implication axiom 4) where the graph gives an insight to the corresponding gestalt F (<^4).
The last system of formulas represents the so-called parallelization of axiom formulas by the most elementary informational-implication transitions of the form ^ ==> Tj.
In the axiomatic gestalts corresponding to the axiom formulas ipi,ip2 and (/53 in 1), 2), and 3), the
a-ß
a-
system for this graph is
4) (a => /3; /3 7; 7 a 7)
The gestalt r(i/?4) includes 132 formulas of length ^4 = 6. Another presentation of the graph in Fig. 5 uses a system of all circular formulas (graph loops), so, system
a a
^	{ß ^ a)
>1
«2
ai
«i+i
that is, numerically, n 11 2 3 4
6
8 9 10
L 0 1 3 6 10 15 21 28 36 45
A parallel formula system for the graph in Fig. 6 is, certainly,
«1 t= «2; «2 h «i;
ai [= «i+i; ai+i \= ai]
1= an, an h "n-l
describes the situation in a particular case.
6.6 Serial and Reversely Serial Gestalts
A serial decomposition of an informational entity roots in the causal nature of entities and its informational components. For instance, the analysis of an entity progresses into the direction of discovering its informational details, stepping deeper and deeper into the structure. But, commonsen-sically, when reaching a deep informational detail, the process can be reversed, so that the analysis proceeds in the opposite direction, for example, verifying the obtained analytical (decompositio-nal) results and accomplishing them informatio-nally. On the level of conscious thought, such a forward and backward informational processing is thoroughly possible.
Let us discuss in short the case of a serial and, simultaneously, reversely serial decomposition of an entity, marked by ai, as shown in Fig. 6. At the first look, this structure is a multiloop
Figure 6: A graphical interpretation of the serially and reversely serially [\n{n — 1)-tuple-loop] structured decomposed system for entity ai.
one. There is, for example, the longest cycle [^max = 2(n — 1)], in which operands appear in the sequence ai, «2, ... , «i, ai+i,	o;„_i, ... ,
a;,... , «2, «i, and there are all the possible other shorter cycles [£ = 1,2,..., 2n - 3]. A short analysis shows that the decomposition structure in Fig. 6 has a number of loops (L)
L =	1)
Let us define, precisely, the kinds of different "loops" and their numbers according to the graph, formula, and causal situation.
Definition 27 [A Concept of Graphical, Formula, and Causal Loop for Simultaneously Serial and Reversely Serial Case] Let & be a graph in Fig. 6. Let us distinguish three kinds of loops:
1.	A graphical loop is a loop visible through the circumspection of the graph which, regardless of its circular structure, is considered as graphically different from all the other possible loops in the graph.
2.	A formula loop is a loop which, in any appropriate form (arbitrary displacements of parenthesis pairs and arbitrary choice of the leftmost operand of the loop) corresponds to the graph loop.
3.	A causal loop follows from a formula loop by an arbitrary displacement of the parenthesis pairs.
The are three different numbers of loops for the graph in Fig. 6:
1.	The number L^ of all graphical loops is
Zg = ^n{n - 1)
2.	The number L^p of all formula loops amounts to
L^ = 2n^
3. The number L^q of all causal loops attains
■žn?

(2ii\ J
where ti marks the length of the corresponding formula loop.
Evidently, Lj^q corresponds to the number of all the formulas included in all the gestalts corresponding to the graph in Fig. 6. □
A.P. Železnikar
6.7 Circular and Reversely Circular Gestalts
Circularly serial decomposition of an informational entity roots in the causal and metaphysica-listic nature of informational entities and its informational components. The reversal circularity brings something new into the discourse of the possible circular structures and its practical implications.
Let us see in short the case of a circularly serial and, simultaneously, circularly reversely serial decomposition of an entity, marked by «i, as shown in Fig. 7. A parallel formula system for the graph
ö-l tZIÖ'2
öj cz!] Oii+I
in Fig. 7 is
/«1 1= «2; «2 1= «3;
(X2\=0!i; ^ Ö3 1= «2;
(Xn-l \= a-n; af„|=Q'n-i; ^On 1= «i; ai\= an )
The n longest serial loops and their counterloops are determined as
«i 1= («i+l N • • • («n-l 1= («n N
K N («2	«i)•••))))•••);
ai t= (a;_i [= • • • (^2 |= («i h
(an h (a„_i 1= • • • (tti+i 1= «i) • • •)))) • ■ •); i = 1,2, ...,n
respectively. The length of each of 2n circular formulas is n and, thus, the gestalts for these formulas only include, altogether,


E
èt A- + 1
(2ei]
Figure 7: A graphical interpretation of the circularly serially and circularly reversely serially -j- 1)-tuple-loop] structured decomposed system for entity ai.
formulas (with all the possible displacements of the parenthesis pairs).
Evidently, according to Definition 27, for the circular case, there is
6.8 Metaphysicalistic and Reversely Metaphysicalistic Gestalts
Metaphysicalistic formulas concern the interior (internal states) of an informing entity and, in this way, replace the reductionistic and rigidly (algori-thmically) determined propositions and predicates. In such a context, metaphysicalistic formulas can behave informationally, that is, as circularly and intentionally spontaneous entities. One can imagine how a metaphysicalistic formula—on the level of the conscious informational phenomenalism in the living brain (mind)—models the nervous processes constituting the essential conscious entity. In neural systems, the one-way (direction) propagation of information—from synapses, dendrites, neuronal somata, axons to the synapses, and so forth—is a commonly recognized phenomenon (scientific philosophy). However, on the conscious or artificial (constructionist) level, the direction of information propagation can be reversed in the form of the thought flow or a machine processing^. Such a reverse metaphysicalistic process can represent an essentially different interpretation of the original process, particularly in the sense of specifically changed causalism where, in a cycle, causes and effects interchange their roles. One has to remind that the original and the reversed process take place in a circularly interweaved environment where a strict distinction of causes and their effects (consequences) is no longer possible.
A metaphysicalistic direct and reversal structure is graphically presented in Fig. 8. The bidirectional metaphysicalistic system is much more cycled than a standard metaphysicalistic system of an entity, represented in Fig. 10. While the original system has only 6 loops, the additional reversing causes a 30-loop structure, according to the formula ^n[n-\-l) and Fig. 8, +1) -t-4 = 32 for n = 7. The reader can calculate the entire complexity of the circularly structured me-
^See, for instance, the thermodynamic theory of thought processes [11],


ß

entity's metaphysicalistic metaphysicalistic metaphysicalistic metaphysicalism informing counterinforming embedding
Figure 8: A graphical interpretation of the circularly (32-tuple-loop) structured basic metaphysicalistic system of informational entity a being informationally impacted by the exterior entity ß.
taphysicalism and its reverse in the form of particular informational gestalts by himself/herself.
6.9 Parallel Gestalts
Parallel gestalts are nothing other than gestalts of gestalts. One can imagine in which way such a situation appears when examining parallel systems of arbitrarily structured (e.g., circularly circular) serial formulas.
Parallel gestalts concern systems of serial formulas where each formula has its gestalt and the possibilities of different serial formulas included in different gestalts has to be considered. Evidently, in such a case, the sum of the gestalt possibilities can be taken into account.
For a gestalt of a gestalt as a parallel system and a gestalt of parallel systems of basic transitions, there is evidently,
7 Inference Gestalts
Besides axioms, inference rules build up the skeleton of logical inferetialism, consisting of processes of deriving, deducing, inducing, abducing, etc. in the framework of logical reasoning and theories constructing.
Modi informationis (modes of informational inference) can be used in different informational deduction processes. The basic rules (principles) of inference in the propositional and predicate logic are, for instance, substitution, modus ponens and modus tollens, which perform in the framework of truth and falseness (the principle of tertium non datur). The reader can imagine how these rules
can be transferred in the realm of informational entities (a kind of informational logic). Beside these principles, other inference rules can be introduced, known in the common speech as modus agendi, essendi, rectus, operandi, obliquus, vivendi,' possibilitatis, etc., as already constituted in the Latin language. Informational modus
(1)

decomposition loop

(2)
Figure 9: Informational graphs for informational modus agendi (1), modus ponens (2), and modus tollens (3).
agendi (informational phenomenalism) of an entity, marked by operand a, is shown by graph (1) in Fig. 9 and represents the mode in which an informational entity (operand, phenomenon, thing, process) acts or operates (informs and is informed), that is,
't^a; \	input (internalism)
a^ «1=;	output (externalism)
^a j= aj interior (metaphysicalism)
The 'feedback' arrow (vector) hides the potentiality of a's decomposition (the a's metaphysicalism a\= (x), the input arrow presents the a's internalism 1= a, and the output arrow the o;'s externalism a \=.	•■
Informational modus ponens represents the interpretation of the logical (philosophical) modus
90 Informatica 20 (1996) 65-94
A.P. Železnikar
ponens into informational realm. Similarly, the informational modus tollens represents the interpretation of the logical (philosophical) modus tollens into informational realm. The detachment (modus ponendo ponens and modus tollendo tollens) formulas are, respectively,
basic transition formulas determined by the paths between two entities. So, let us construct the parallel system (Fig. 10) in the form
(mp) ——-—— and (mt) —



i t= J,; ■ ^ i,;	i, ^
(t. N c,;	c, ^	(S, ^ e,;
U t=	c, 1=	e, f=
Vq 1= J,;	e, 1= C,;	e, |= t ,
Their graphs (2) and (3) are drawn in Fig. 9. The arrows are marked by ';' (informational parallelism), (informational implication), ^ (informational detachment), and (particular no-ninforming). It is to stress that (^ It^ (^ It^; It^ expresses the modus existendi of a certain noninforming of the entity presented by the informational operand
Evidently, the gestalts for MP (rule marker /5j^p) and MT (rule marker p^^) are, respectively, equal to the rules themselves, that is,
r(pMp) = PMP and	= p^^
because the parallel structured premises a; a ß and a => ß-,ß\^ß have to be treated as indivisible entities.
Several other forms of modi informationis can be discussed. The one concerning the intention of an informational entity is the so-called modus rectus, by which the detachment of the intention of an informing entity would be possible.
8 Intelligent Gestalts
Intelligent gestalts are a consequence of intelligent informational formulas. Which kind of a formula could be comprehended as informing in an intelligent way? What is intelligent informing?
Definition 28 [Intelligent Informational Formula]
An intelligent informational formula possesses its specific intelligent metaphysicalism [13]. The rough structure of an intelligent formula, not being decomposed to the necessary details yet, is given by Fig. 10 and expressed as intelligence i concerning a (functionally) by the parallel system of basic transitions. □
How can the graph in Fig. 10 be described in a form embracing all possible formalistic phenomenalism? The answer is: by a parallel system of
Let us write only one formula for each of the six loops in Fig. 10, considering the input operand a. For the shortest loops to the longest one, there is,
J, ^(i, hCfJ; N (c. hö^O; N (e. N (S.);
N (c. N (c. N (e. N V. N N (Ù N Ö:. N (c. N h (e. h 01/
This system consisting of the input formula /3 t and the six circular formulas of lengths £ — 2,4, and 7, describes only à very small part of the informational graph in Fig. ,10. The length £ of a formula is determined by the number of binary operators occurring in it. The subgraph corresponding to the system of circular formulas in parallel cannot be drawn in a usual form because the informational graph (without parenthesis pairs) expresses the entire parallelism and, in this respect, informational gestaltism. But, this reduced or specifically particularized system already determines the discussed parallel system which can be constructed out of the reduced system of circular formulas and formula ß \= i.
8.1	Interpretive Gestalts
Interpretive gestalts are a consequence of interpretive formulas by which existing formulas are "interpreted" by the introduction of additional parallel formulas or by further serial decomposition of the existing formulas. The process of interpretation is not limited in advance because at each system situation a new interpretive detail concerning the system constituents can be added.
8.2	Understanding Gestalts
The understanding system v in Fig. 11, understanding and producing the meaning of something
o
intelligent entity
intelligent informing
intelligent counterinforming
intelligent embedding
Figure 10: A graphical interpretation of the circularly (sextuple-loop) structured parallel metaphysica-listically intelligent system of informational entity i.
ß, is a paragon of a sufficiently complex understanding device, concerning various phenomenalisms in philosophy, psychology, cognitive science, artificial intelligence and, first of all, the general theory of the informational.
Let us introduce the following markers (entities) of understanding (subscribed by v) and their initial meanings:
V understanding; intending; sensing; observing; being conscious; iijj being unconscious; <ty conceiving; signifying; making sense; perceiving; 3v concluding; fi-y meaning
ß to be understood; intention; sensibility; Oy observation; by consciousness; u„ unconsciousness; Cy conception; significance; reasonableness; perception; conclusion;
The longest loop in Fig. 11 has 22 binary operators, that is, §(22) ~ 201 261 630 000 possible interpretations alone within its informational gestalt.
9 Gestalts of an Informational Machine
A formal informational machine [19] is a formula system which can handle informational formulas in an informational manner. To set a formal machine conceptually means to make possible the design of a physical informational machine by which informational formulas can be processed informa-tionally. This means that the faculty of informational arising (emerging and appearing of infor-
mational formulas) is offered to any informational formula representing an informational entity within the informational machine.
The question which arises is what is the basic formal structure of the informational machine. Which entities must enter into the machine concept to enable the informing of the processed formulas by the machine? Answers to this question are given on some other place [10, 19]. But, for our discussion concerning informational gestalt, it is important to stress that among other features, informational machine must be able to produce and observe the gestalts of appearing and processed informational formulas.
On the other hand, complex circular formulas, being a part of the informing machine itself, are, as any informational entity, observed by the machine also through the gestalt possibilities and being adapted consequently in dependence of the arising circumstances.
10 star Gestalts
The idea of the star gestalt roots in the formula circularity, although it can be defined for straightforwardly serial formula too.
Definition 29 [A Concept of Star Gestalt of a Serial and Circular Formula] For the star gestalt, F*, of a serial formula, (p-^, that is	there is
where	is a system of all gestalts for for-
mulas (p^ (subformulas) obtained from formula (p^ as possible, in respect to shorter formulas. Formula is any formula which follows (is constructed) from the graph of formula (p^, starting from an arbitrary operand in the direction of
signifying
making sense
perceiving
concluding meaning
understanding embedding
Figure 11: A graphical interpretation of the circularly (16-tuple-loop) structured (metaphysicalistic) parallel understanding system v of entity ß.
arrows (operators) to an arbitrary point (operand or operator).
The star gestalt of a circular formula, includes an infinite number of formulas obtained in the following manner:
(a)	Let ©(v^^) denote the graph of circular formula	that is a circular informational graph.
(b)	Let the construction of a formula begin at the place of any graph operand (circle, oval).
(c)	A formula ip, serial or circular, of the star gestalt	is any formula obtained by moving from a starting circle (operand) along the graph arrows (operators) and ending at the arbitrary circle or arrow.
□
The example which follows shows how formulas can come to existence within a star gestalt,
using an informational graph. Let us show how, according to the given informational graph, an informational formula can arise. In [14], p. 63, Formula (63), the following formal situation of the understanding il and metaphysicalism /t^ within an intelligent l entity is discussed:
il-loop
/il—loop
(. — loop
This formula is a step towards the decomposition of entity l. The underlying graph for this formula is shown in Fig. 12.
The il-, Ur, and i-loop are not meant to be informationally (causally) isolated loops. Markers it, /Xt, and L can appear in other formulas of a system. In this way, they can stay open in respect to any actual environment, being changed by the exterior influences.
The reader can see how different informational formulas can arise from the informational graph

Figure 12: A graphical interpretation of the circularly (double-loop) structured parallel transitional system {l \= il;il |=	[= f, fx, ^11).
in Fig. 12. To obtain regular informational formulas, arbitrary transitions, for example, from a beginning entity in the graph, say i, to an entity are necessary. The following formula system (a part of the star gestalt) shows the emerging of formulas using the graph in Fig. 12:

In such a transition through the graph, the choosing of parenthesis pairs is spontaneous and, in this respect, different informational formulas can come into existence. Looking causally, each of the listed formula represents a special case and can deliver, for example, different semantics concerning the involved entities of a formula.
	1= «11 ■	1= «12■		■«lm ■	N	■^o ,1
						
a	1= «21 .	' 1= "22 •		■ •"2m ■	>	
						
	N (Xkl .	■ N ■	■h-	■■(Xkm	•N	
						
Ö1 02
Ok
4>a
h-Al		■.■ßln	N	^ ß,\	
					
h-/321		■■ß2n	N	<t>l3,2	ß
					
		■•ßkn	N	'l>ß,k	
					
11 Conclusion
The introduced informational frame and informational gestalt view have brought new insight into the possibilities of a general informational theory [13, 14, 15, 16, 17, 18, 19, 20].
The informer-observer problem can be exhaustively analyzed through the study of framing of the transition a j=o|= for instance, in the form
What is a's informing and what is /?'s observing? How does phenomenon a inform its reality and how does phenomenon ß observe the a's reality? Is the informing of a nevertheless observingly pre-understood by ß and, in this way, is a's reality comprehended in advance by ß? The last enframed and formalized formula system offers not only various possibilities for such interpretations, but also much more than it can be said by words.
The reader can observe that both (pa and <f)ß are harmonious frames, that is well-formed formulas. In this respect, a is an independent informer and ß is an independent observer. Phenomenon a informs its reality by k different phenomenalisms, so phenomenon ß can observe these k phenomenalisms of a by the k properly chosen observational phenomenalisms, when frames (pa and (j)ß are concatenated by the separator frame 4>o (performing as a concatenation operator) into the resulting transition of the form a\=o\= ß.
Let us suppose that a informs its reality by k different phenomenalities and that ß has chosen k adequate phenomenalities for observing of a. The number of different phenomenalities is not limited and, in some way, it depends on the observing capabilities of /3, so, in principle, k ^ oo.
What are the observing phenomenalisms of ßl Let a represent a physical phenomenon in the sense of the contemporary physical sciences. The observing entity (observer, apparatus) ß can use different theoretical and experimental concepts, methodologies, and methods
for observation of a, for example, mathematical (formalistic, recursive), Euclidean (geometrical), Newtonian (gravitational), Hamilto-nian (mechanical), Maxwellian (electromagne-tical), Lorenzian (particle-motional), Einstei-nian (relativistic), quantum, Hilbertian (axiomatic, spatial), Schröderian, cosmological, Weylian (quantum-gravitational), computational, mind-informational [9], Slechtanian (informationally thermodynamical [10, 11]), etc. All these theories and methods belong to scientifically accepted forms of informationalism (artificialisms, metho-dologisms, scientisms, cybernetism) and depend on human consciousness, intuition, and 'reality-adequateness'. By development of science, theories and methods will improve, change, and their number will increase. But, the observer entity will observe only the phenomenalisms for which it is theoretically, methodologically, and experimentally capable [4]. In this sense, the preunderstan-ding of ct's phenomenalism will be, in a way, pre-understood by the observer.
References
[1]	Patricia Smith Churchland: Neurophììoso-pby. Towards a Unified Science of the Mind-Brain. A Bradford Book. The MIT Press, Cambridge, MA, 1986.
[2]	J.A. Fodor: The Language of Thought. Harvard University Press, Cambridge, MA, 1975.
[3]	J.A. Fodor: Representations. MIT Press, Cambridge, MA, 1981.
[4]	D. Gernert: What Can We Learn from Internal Observers? In H. Atmanspacher and G.J. Dalenoort (Eds.): Inside versus Outside, Springer Series in Synergetics 63 (1994), 121-133.
[5]	M. Heidegger: Sein und Zeit. Sechzehnte Auflage. Max Niemeyer Verlag, Tübingen, 1986.
[6]	M. Heidegger: Being and Time. Translated by J. Macquarrie & E. Robinson. Harper k Row, Publishers, New York, 1962.
[7]	D. Hilbert und P. Bernays: Grundlagen der Mathematik. Erster Band. Die Grundlagen
der mathematischen Wissenschaften in Einzeldarstellungen, Band XL. Verlag von Julius Springer, Berlin, 1934.
[8]	n.C. Hobhkob: dACMetmu MameMamunecKOu AOzuKu. FocyaapcTBeHHoe HsaaTejibcTBo ^h-3hko-MaTeMaTOTeCKOH JIHTepaTypBI ( $IDMaT-THs), MocKBa, 1959.
[9]	R, Penrose: The Emperor's New Mind. Oxford University Press-Vintage, New York, 1990.
[10]	J. Slechta: The Brain as the 'Hot' Cellular Automaton. Proceedings of the 12th Int. Congress of Cybernetics, pp. 862-869, Na-mur, Belgium, 1989.
[11]	J. Slechta: On Thermodynamics of the Thought Processes within a Living Body in a Changing Environment. Proceedings of the 12th Int. Congress of Cybernetics, pp. 878885, Namur, Belgium, 1989.
[12]	A.N. Whitehead and B. Russel: Principia Mathematica. Cambridge University Press, Cambridge, 1950 (in three volumes).
[13]	A.P. Železnikar: Metaphysicalism of Informing. Informatica 17 (1993) No. 1, 65-80.
[14]	A.P. Železnikar: Formal Informational Principles. Cybernetica 36 (1993) No. 1, 43-64.
[15]	A.P. Železnikar: Verbal and Formal Deduction of Informational Axioms. Cybernetica 37 (1994) No. 1, 5-32.
[16]	A.P. Železnikar: Informational Being-in. Informatica 18 (1994) No. 2, 149-173.
[17]	A.P. Železnikar: Informational Being-of. Informatica 18 (1994) No. 3, 277-298.
[18]	A.P. Železnikar: Principles of a Formal Axiomatic Structure of the Informational. Informatica 19 (1995) No. 1, 133-158.
[19]	A.P. Železnikar: A Concept of Informational Machine. Cybernetica 38 (1995) No. 1, 7-36.
[20]	A.P. Železnikar: Organization of Informational Metaphysicalism. Cybernetica 39 (1996) (to be published).
A Distributed Client/Server Benchmark for Network Performance Measurement
Jozo Dujmović,
Department of Computer Science, San Francisco State University 1600 Holloway Avenue, San Francisco, CA 94132, USA jozoOcs.sfsu.edu AND
Robert Kinicki, and Sanj ay Subbanna
Department of Computer Science, Worcester Polytechnic Institute 100 Institute Road, Worcester, MA 01609, USA rek@cs.wpi.edu, sanju@cs.wpi.edu
Keywords: network benchmarking, network traffic generator, LAN performance measurement, WAN performance measurement, client/server pairs
Edited by: Rudi Murn
Received: May 12, 1995	Revised: September 9, 1995 Accepted: January 15, 1996
We have developed a scalable network traffic generator and a general computer network benchmark for Unix platforms. This benchmark can be used to evaluate performance of user-level applications which interface directly with the transport layer of TCP/IP running on all types of computer networks. The network workload consists of distributed client/server process pairs (DCSP) and is called the DCSP benchmark. It can include any number and any distribution of communicating client/server pairs, yielding a very high level of flexibility and scalability of network trafBc. We propose a standard clas-sifìcation of network workloads, defìne network performance indicators, and introduce performance measurement methods based on various versions of the DCSP benchmark. We also present experimental results generated using DCSP workloads to compare LAN configurations, study network saturation phenomena, and test WAN communications.
1 Introduction and Background There are two extreme standpoints: (1) benchmarking is meaningful only when performed with
Computer performance evaluation is used for sol-	^^^^^^^ ^^ ^PP^^^^"
ving a variety of problems related to system de-	Programs) (e.g. [8], p.48), and (2) benchmar-
sign, system comparison and selection, and sy-	^^^^ meaningful results with synthe-
stem tuning/optimization. Traditional perfor-	^^^ workloads including industry standard ben-
mance evaluation techniques include analytic mo-	^^^^^^ ^^ ^^^^^ ^^^^^^ ^^^ P^^"
deling, simulation, and benchmarking (perfor-	Pose of benchmarking is to provide inputs for the
mance measurement using benchmark programs).	P™^'^®® ^^ evaluation and selection of computer
In this paper we are interested in network bench-	systems, the disadvantages of natural workloads
marking as a technique for solving system compa- ^The term "synthetic workload" denotes here all gene-
rison and selection problems.	workloads that differ from specific application programs
of a given user. For example, such synthetic programs
Benchmarking has been an established tech- include small kernel programs (e.g. Eratosthenes' sieve),
nique for evaluating computer system perfor- specifically designed synthetic workloads (e.g. Dhrystone),
mance since the mid-1960's, but controversies re- benchmark programs based on frequently used subroutines
1 -4- J i. u U 1 •	i-ii c	il	(e-g- UNPACK), and all other benchmarks based on se-
lated to benchmarkmg are still frequently encoun- , , ,	,	, . ,
°	-1 J	lected more complex program products, such as compilers
tered [10, 8]. The most frequent source of different (e.g. gcc), interpreters, electronic circuits design programs, opinions is related to the selection of workload, etc.
are well known and include high cost, low portability, and reduced credibility of natural workloads related to changes of workload caused by changes of equipment [5, 4], On the other hand, synthetic workloads and standard benchmark suites are suitable because they usually provide low cost and high portability of measurement, the computer industry strongly supports this approach [3, 6, 7,13, 18], and theoretical analyses show that errors caused by differences between natural and synthetic workloads are automatically reduced in all cases where individual performance indicators of a computer system are positively correlated [4]. Therefore, there are good reasons to believe that synthetic benchmarks can generate valuable performance indicators, useful as inputs for the decision process of evaluation and selection of competitive computer networks.
However, utilizing benchmarks to analyze computer network performance and extending benchmarking techniques into the domain of distributed systems has proven to be a complex problem. The plethora of possible choices for topology, communication configuration, network technology, protocols, file servers, computer platforms and operating systems yield an extensive problem domain.
There have been a large variety of research studies attempting to answer questions related to some aspect of computer network performance. This paper presents results from a new. general purpose computer network benchmark based on distributed client-server pairs (DCSP). Before discussing the philosophy of the DCSP benchmark, it is useful to review a few of the efforts from the Hterature which analyze computer network behavior and identify performance bottlenecks.
One major focus in computer network performance has been to evaluate and analyze techniques for improving the response time of communications mechanisms such as IPC^ [2, 15] and
^Following is the meaning of standard abbreviations: IPC = Inter Process Communication, RPC = Remote Procedure Call, FDDI = Fiber Distributed Data Interface, ATM = Asynchronous Transfer Mode, IP = Internet protocol, TCP = Transmission Control Protocol, UDP = User Datagram Protocol, LAN = Local Area Network, WAN = Wide Area Network, ARP = Address Resolution Protocol, PING = Packet InterNet Groper, ICMP = Internet Control Message Protocol, NFS = Network File System (network file access protocol developed by SUN Microsystems), or (depending on context) Network File Server, FTP = File Transfer Protocol.
RPC [16, 17, 1]. For example, Thekkath and Levy [17] explore avenues for achieving low-latency communication by evaluating the low-level performance of a proposed RPC mechanism running on new-generation networks (FDDI and ATM). Using a very specialized testbed consisting of two DECstations communicating over an Ethernet, an FDDI, and a simplistic ATM LAN, they are able to determine that network controllers will play an increasing role in affecting communication latency in high-throughput LANs. Although a very thorough study, their results rely heavily on higher level assumptions about distributed application behavior, and their measurement procedures are too custom-tailored to be broadly useful.
- Cabrera et al. [2] present a study of the impact of IPC mechanisms in Berkeley Unix 4.2BSD. Using artificial workloads, they present a detailed timing analysis of the dynamic behavior of TCP/IP and UDP/IP communication protocols. Using user space processes to send messages, network throughput and latency are obtained for Ethernet implementations. They assess the effects of different processors, network hardware interfaces, and host loads on network interprocess communication. A significant result of their study is identifying checksumming and data copying as expensive components of an IPC sent over TCP/IR
By heavily instrumenting Sun Sparcstations connected over an Ethernet, Papadopoulos and Parulkar [15] studied the performance of SUNOS IPC over TCP/IP. They analyzed the effects of socket buffer size, window size, checksum costs and interactions with the operating system. They conclude that IPC requires a major portion of the CPU cycles to sustain high Ethernet utihzation.
Kay and Pasquale [11] instrumented the TCP/IP and UDP/IP protocol stacks in the DEC Ultrix4.2a kernel and connected an HP Logic Analyzer to a Hnk connecting DECstation 5000/200 workstations. They show that non-data touching components such as device driver code, m&w/management, and ARP processing can contribute significantly to the TCP overhead in network communications.
These studies are valuable for identifying specific pieces of the network communications mechanism which significantly contribute to overall performance. However, these research efforts are not
intended to provide general tools for evaluating and comparing the performance of specific network configurations. Very .little published work is available on generic network benchmarks which utilize distributed applications; this is excluding transaction processing benchmarks (e.g. [13]), and various proprietary benchmarks. We briefly comment on three of the known efforts.
At HICSS-26, Kinicki and Finkel [12] report on measurements made using the WPI Benchmark Suite to compare the performance of Mach 2.5 versus Mach 3.0 supporting client-server interactions between two HP 486 PCs communicating via TCP/IP. The benchmarks allow for multiple clients to interact over an Ethernet with a single server. This study does provide insight into comparing operating system support for chent-server activities, but the benchmarks are hmited in that the chents are not distributed over all the machines on the network.
LADDIS [18] is a distributed apphcation benchmark which specifically focuses on evaluating NFS performance on a network. Accepted as a SPEC benchmark, LADDIS is designed to be scalable and managed from a single prime client. Using a "standard mix" of NFS operations, LADDIS is a benchmark to be used to compare NFS implementations. Multiple clients per server are supported and a number of tunable control parameters are available. Some LADDIS disadvantages are: it only evaluates NFS behavior, reliance on UDP, and only balanced round-robin resource allocation. The abihty to disable UDP checksums is necessary for vendor compatibihty, but causes concern when comparing competing vendor platforms.
Netperf [9] is a public domain network benchmark developed by the Networking Performance Team at the Information Network Division of Hewlett-Packard Company. It performs TCP and UDP tests using bulk data transfer from client memory to server memory (i.e. no disk file transfer is involved). The measured performance indicators include elapsed times, throughput, etc. Netperf is documented as a tool, but the directions for its usage and the interpretation of its results are not available.
The DCSP benchmark presented in Section 2 fits in with these three network benchmarks in that it is an appHcations level network bench-
mark. It is intended as a general purpose network benchmark to be used to solve two classic benchmarking problems: (1) performance evaluation, comparison and selection of competitive networked computer systems and (2) performance monitoring, tuning, and optimization of existing networks.
2 The Organization of Communicating Client/Server Pairs
The nature of workload in computer networks can be abstracted to a combination of local processing performed by individual nodes, and unidirectional or bidirectional communication between pairs of nodes. Bidirectional communication is the most general model of network workload, because it can include a scalable amount of local processing, as well as file transmit and receive operations. Additional reasons to organize network benchmarks using a cyclic data transfer (i.e. the transfer of data from a client to a server, and then back to the client) are: (1) scalability, (2) flexibihty, and (3) elimination of clock synchronization problems. The cyclic data transfer operation can be repeatedly performed for any specific number of pairs of nodes in the network. The total network workload can be scaled and adjusted by selecting both the number and the distribution of communicating client/server pairs per network. This yields high flexibihty in workload design and is applicable for all types of networks. An important advantage of this organization is that performance measurements can be accomplished independently on both the client and the server side of each communicating client/server pair, eHminating the problem of clock synchronization in the network (performance indicators are measured independently for each node). Since all processes running on each node collect the performance data, it is possible to obtain a detailed insight into the activity and performance of both the individual nodes and the whole network.
There are a number of other requirements that network benchmarks should satisfy. First, in all cases it is highly desirable that all network performance measurements can be accomplished by a single performance analyst. The workload must
be suitable for all sizes of networks (LAN, MAN, WAN), and all major network topologies. The benchmark should use the same hardware and system software resources that are used by actual network users. In addition, if benchmarking is performed in a Unix environment, then benchmarks should be portable across most Unix systems.
The basic structure of our client/server program is presented in Fig. 1. The benchmark program has two entry points corresponding to the client and the server processes. For each data transfer operation there are four combinations for the location of the source and destination of data, and data transfers can be organized (1) from memory to memory, (2) from memory to disk, (3) from disk to memory, and (4) from disk to disk. In this way it is possible to include or exclude the activity of the file management part of the operating system. As in all black-box measurement methods it is possible to subtract elapsed times for measurements performed with and without a selected feature in order to isolate the net effect of the analyzed feature (in this case the Unix file management system).
The main loop consists of three operations: SEND, RECEIVE, and WAIT. The client process starts by requesting connection to the server node. After establishing the connection and performing other initialization operations the chent process repeats a number of SEND-RECEIVE-WAIT cycles. In each cycle it sends a data file to the server, receives the same data file back from the server, and then waits a random time interval before starting a new SEND-RECEIVE-WAIT cycle. The transmitted data file consists of M data quanta whose size is Q. The transmitted data file size is MQ, and it can be adjusted by selecting the values of M and Q. The maximum control over the data transmission process is obtained if the quantum size is selected to be the default TCP socket buffer size, and in our experiments we used Q = 4 KB (4096 bytes). In cases where the random wait time is uniformly distributed it can be adjusted by selecting its average value Tw (of course, the highest traffic intensity can be achieved by selecting T^ — 0). When the measurement is completed the client process sends an end-of-measurement message to the server.
The server process starts by accepting the
connection from the client and then repeats a RECEIVE-WAIT-SEND cycle until it receives the chent's message that the measurement is to be terminated. The cHent and the server activities are integrated in the same program, but they differ both in the start of measurement and the end of measurement process. The server process should be started before the client processes.
The performance measurement consists of a selected number of cycles and at the end of each cycle we log a record consisting of the following data:
•	u[i] = accumulated user's CPU time at the beginning of the send operation,
•	s[i] = accumulated system's CPU time at the beginning of the send operation,
•	= begin time of the send operation,
•	t2[i] — end time of the send operation,
•	t3[i] = end time of the i"' receive operation.
The recording process starts at the predefined time t start and ends at the predefined time tstop, but the client and server processes will remain active until the time tend- In other words, if the real time t satisfies the condition tstart < t < tstop < iend , then all chent and server processes generate log files containing the following records: u[i],s[i],ti[i],t2[i],t3\i] , i = 1,... ,m . Here m denotes the number of cycles performed during the recording interval [tstart, istop]- The evaluator must start (manually or automatically) all chent and server processes sufficiently before the time tstart- In such a way the network traffic attains a steady state before the beginning of the measurement process. The process of recording measured values in the log file terminates at the time tstop when the traffic is still in steady state. Thus the clocks on various nodes need not be synchronized, because the data transfer will continue until the time tend- At the time tend all client processes send the end-of-measurement message to their corresponding server processes, and both the client and the server processes terminate. Therefore, if the maximum difference between the un-synchronized clocks of nodes in the network is less than tend — tstop then the whole measurement will
(^^^^ent entry^	(^^^rver ent^^ ^
t< t
start
\l
t< t
start
Establish(request) connection
Initialize(open flies,
allocate resources, etc.)
£stablish(accept) connection Initialize(open files, allocate resources, etc.)
= current time
start ~ recording measured values
t = stop recording stop measured values
t = experiment
termination time
Memory
a
Data
	•
Disk	
Send
Measure t2
t< t
end
Memory
Receive
Data
Measure u,s, t 1
t>= t
end
Received quit message
Send quit message Disconnect Free resources
I

Disconnect Free resources
(^^^Client exit	(^^erver exit~^)
Main loop
Wait
n
Disk
Measure ' t.
Log
IF(t start <t< t stop)
THEN Save u, s, t j ,t 2 ,t 3
Fig. 1: Basic Structure of the Client/Server Program
- Client Cycle Time -^
0 ti
Client Round Trip Time t2
T (time)
Client
Client
Send
Receive
Server
Receive
blocked —^
Wait
Wait
Send
Send
I
Receive
J- blocked
I
I
Server Round Trip Time
Wait
Server Cycle Time
T (time)
Server
Fig. 2: Timing Diagram for a Client/Server Pair
be performed during the steady state of the network traffic. By selecting sufficiently large values of tend any difference between unsynchronized clocks can be easily compensated. This is important in all cases where we have no control over some (usually remote) node clocks.
Using the recorded times , s[i] , , ) t^li] , i — l,...,m we can define the following performance indicators of the cycle (i = l,...,m-l):
•	Send Time:	= t2[i\ - ti[i]
•	Receive Time: Tr[i] = tsli] - t2[i\
•	Wait Time:	Tu,[i] =ti[i + l]- ^3(1]
•	Round Trip Time: = T,[i] + Tr[i]
= h[i\-ti\i]
•	Cycle Time:	Tc\i] -h 1] - ti[{\
•	User Time:	T„ser[«] = u[i + 1] -
•	System Time:	I'sysli] = s[i + 1] -
•	CPU Time:	Tp\i\ = TuseAi] +
•	Processor Utilization per process:
U[i\ = TMlTc\i\
These performance indicators are random variables that depend on the workload distribution and parameters (5,M, and as well as on the performance of the network hardware and software. Competitive networks can be compared using the parameters of distributions of these random variables, primarily the average values T«, Tr, Tw,
Rj Tqj ^usevi ^sys-i ^p'i	JJ.
The changes of states and relationships of round-trip cycles for chent and server processes are shown in Fig. 2. In cases where all nodes are identical and the network workload is balanced (i.e. each node has the same number of server and client processes) the mean values of client and server cycle times and client and server round-trip times should be the same. In all other cases (unbalanced workload, different routing, etc.) that is not the case.
Network measurements based on a specific form of data transfer can also be realized using any of network-oriented Unix commands such as ping, ftp, mail or finger. The simplest Unix-based cyclic
data transfer can be realized using the ping command. The ping mechanism is based on ICMP and limited to repeating round-trips between two nodes with packets having machine dependent size of at least 64 and no more than 4096 bytes. It reports only the packet loss statistics, and the round-trip times. It is primarily designed for manual fault isolation, and its use in system evaluation and selection studies is severely limited. However, an expanded version of ping, called windowed ping, can generate a controlled volume of network workload and can be used for testing network performance. This approach was successfully implemented by Mathis [14] to test and evaluate the IP performance of specific network products and technologies. In the case of system evaluation and selection studies this method has a limited value, since it is primarily Internet oriented, and has some of limitations of the original ping approach.
The proposed cyclic data transfer measurements offer more flexibility than the Unix command approach. The DCSP workload is designed particularly for network evaluation and selection analyses: it has more adjustable parameters than Unix commands, it can be organized to collect more performance indicators, and it is suitable for all types of networks. In addition, the DCSP benchmark is useful for the performance analysis of various file location strategies (memory, local disks, or NFS disks), and for measuring the operating system overhead.
3 The DCSP Network Workload
The communicating client/server pair is the basic building block of our synthetic network workload. The structure and the traffic intensity of the network workload can be determined by defining a suitable distribution of client/server pairs over the nodes of the analyzed network. We assume the general network configuration shown in Fig. 3, where workstations (or more complex computer systems) share a common transport medium with or without network file servers. The communicating client/server pairs can be organized in two ways as shown in Fig. 4. We use either asymmetrical (simplex) or symmetrical (duplex) communication between two nodes. In the case
of asymmetrical communication one node has a client process and the other node has the server process. In the symmetrical case the communication is bidirectional: each node includes both client and server processes. If a higher level of network load is needed, then each node will have more than one client and server process.
The DCSP network workload consists of any number, and any distribution of symmetrical and/or asymmetrical client/server pairs. It is obvious that the number of different network workloads that can be built in this way is practically unlimited. It is also clear that any workload can be obtained by using multiple copies of the same basic client/server benchmark program. Therefore, this family of network workloads is extremely flexible, scalable, controllable, and also reliable, because it consists of well tested uniform basic elements. All these properties certainly contribute to our ability to create proper network workloads; they are necessary, but not sufficient to claim that selected DCSP workloads properly represent specific real network workloads and automatically generate meaningful network performance indicators. At this point we can proceed in two directions: (1) to develop basic standard network workloads that use the most common network topologies and generate meaningful general network performance indicators, and (2) to develop a methodology for modehng arbitrary real network workloads using optimum distributions of communicating chent server/pairs. Of course, both directions are feasible, but only the first direction will be covered in the rest of this paper.
The basic network and workload parameters of the network benchmarking process using communicating client/server pairs are the following:
•	the number of nodes in the network: n ,
•	total number of client and server processes per network: N ,
•	the average number of processes per node (node load): L := N/n ,
•	the individual node load: Lj, j = 1,... ,n .
A specific organization of network workload can be achieved by adopting a given distribution of client and server processes over the nodes of a network. In the case of solving the problem of
comparison and selection of competitive networks we suggest the use ot balanced workload where Li = L2 — ■■■ = Ln = L = const. Of course, the balanced workload is an idealized case, but it is the most important special case of network workload, because load balancing efforts are usually aimed at achieving this specific case.
In cases where the purpose of network performance measurement is to study the sensitivity of the network to nonuniform load distributions, the workload must be unbalanced, and the number of processes per node will vary from node to node. Such workloads can be easily organized by nonuniformly distributing the communicating client/server pairs performing the cyclic data transfer. The unbalanced network workloads can be organized in more different ways than the balanced workloads, and an important special case that models a network file server (NFS) and a number of diskless workstations is exemplified in Fig. 5. This characteristic structure of workload will be called the star structure. The clients take data from local memory, and server processes (all located on the network file server node) save data on the server disk(s). This type of the DCSP synthetic workload simulates the high intensity of NFS disk traffic and provides an adjustable level of data communication with clients.
Balanced network workloads should be organized with varying levels of node load. According to the increasing level of node load we suggest the following classification of balanced network workloads:
•	Pairs Structure:
-	asymmetrical: N = n , L = 1
-	symmetrical : N = 2n , L = 2
•	Ring Structure
-	unidirectional: N = 2n, L = 2, (n > 2)
-	bidirectional : N — 4?i, L — i, (n > 2)
•	Combinatorial Structure:
N = 2n{n - 1) , L = 2(n - 1) , (n > 3).
Assuming an even number of nodes the asymmetrical pairs structure of network workload for the case of a typical Ethernet-based LAN is exemplified in Fig. 6. This is the case with the lowest node load, L = 1. In the symmetrical case
J. Dujmović et al.
WS	2	WS	.... n	WS
WS: Workstation
Fig. 3: General Network Configuration
Network	....	Network	V o y			
File Server		File Server	1	Host A		Host B
			1 1		------->»	
Client
Host A
Cyclic Data Transfer
■\Host J B
Server
Host A
Host B
Symbolic notation

Host A
(
Cyclic Data Transfer
Host ) B
Symbolic notation
Fig. 4: Symbolic notation of Simplex & Duplex Communication
©00
WS
WS
-T"
I
C: Client
S: Server	\	,
WS: Workstation "A i ^
WS	
f / f	
	
	Disk
	
Fig. 5: Unbalanced DCSP workload to model workstations with a file server
Network File Server WS: Workstation
WS
WS
WS
WS
WS
WS
Fig. 6: Pairs Structure of the DCSP Workload
Network File Server WS: Workstation
WS
T
WS
TT
WS
WS —1— 1
X
WS -T—
X
WS
Network File Server WS: Workstation
WS
T
1\
I/ \ V
' '.'s-
WS


WS
-I- -y- - r

X
WS
A V		V
WS	<r------e^l	WS
Fig. 7: Ring Structure of the DCSP Workload
Fig. 8: Combinatorial DCSP Workload
the communicating nodes remain the same but the bidirectional communication is used, yielding L = 2. In the pairs structure each node communicates with only one node in the network.
An increased level of communication and node load can be achieved by using the ring structure of network workload where each node always communicates with two adjacent nodes. The unidirectional ring structure is exemplified as a LAN shown in Fig. 7. In such a case each node has a client and a server process, yielding the node load L = 2. However, the node load can be easily doubled by using the bidirectional communication. In such a case we have two clients and two servers per each node, and consequently L — A.
In cases where a very high level of load is needed it is convenient to use the combinatorial structure of network workload exemphfied in Fig. 8. This workload always uses bidirectional communication. Each node acts as both the client and the server for the remaining n — 1 nodes. Consequently, each node must support L = 2(n - 1) processes.
For all presented structures of network workload the network traffic intensity can be adjusted by selecting (1) the number of nodes, (2) the unidirectional/bidirectional communication, (3) the location of data, (4) the average wait time, and (5) the size of transmitted data. These adjustable parameters provide for high flexibility in selecting a suitable network workload. In cases where the node load is low it is possible to use a low or zero wait time. However, in cases of high (and especially combinatorial) node load, the nonzero wait time is the primary factor for adjusting the level of network workload.
The DCSP workload enables the measurement of the file access overhead in the way illustrated in Fig. 9. This figure shows the main hardware and software components of each node and the way the DCSP benchmark controls the location of data used in the cychc data transfer. If the data are located in the application buffer (which is in the main memory), then the file management part of Unix will not be active, and the system part of the CPU time will mainly reflect the activities of TCP/IP. However, if we store data on local disks, then the corresponding system part of the CPU time increase reflects the file management overhead. In addition, the total round-trip time
also increases, due to disk activities performed by both the client node and the server node.
4 A Standard Classification of Network Measurements
Let us consider a network having n identical nodes and optional network file servers connected by a transport medium, as shown in Fig. 3. We assume that individual nodes can be diskless workstations, or workstations or servers having local disk units.
Similar to the classification of queuing models using letter codes separated by slashes, we propose a standard classification of network measurements in the following form:
W/D/C/n : M/Q/T^/L
where W denotes the organization of workload, D denotes the location of data, C is the type of communication between nodes, and n denotes the number of active nodes in the network (with or without the NFS nodes). In the case of the DCSP benchmark the part of the description following the colon {M/Q/Tw/L) defines more detailed workload parameters (number of quanta, quantum size in kilobytes, the average wait time in seconds, and the average node load), and in some cases (where it has an assumed value) it can be omitted. Following is the list of main options:
1.	Workload structure (W) options:
P = client/server pairs R = ring structure C = combinatorial structure S = star structure
2.	Data location options {D):
L = local disk (workstation) E = external disk (network file server) M = data generated and stored in the local main memory
3.	Communication between pairs of nodes (C):
A = asymmetrical (unidirectional communication)
S = symmetrical (bidirectional communication)
In a trivial case all communicating client and server processes can be located on the same
node; however, we normally assume n > 1. If n = 2 then the only interesting workload is W = P. Generally, the case W = P assumes that n is an even number, and that the workload is balanced. If n = 3 then there is no difference between the cases W = R and W = C\ these cases differ only if n > 3. Therefore, the difference between the basic workloads (P,R,C,S) becomes visible only if n > 4. For example, if n = 4 then the structure of four basic workloads is
K
W = P
W = R
W = C
W = S
The proposed workload classification technique yields a spectrum of well defined network workloads: if n > 3 then for any number of active (communicating) nodes there are 4 • 3 • 2 = 24 different W/D/C types of workload. Users of the DCS? benchmark can easily select and specify a desired type of workload. For example, the notation R/L/S/Q denotes that the corresponding network performance measurement is reahzed using the ring workload structure, data located on local disks of each workstation, and symmetrical communication where each of the six workstations includes two client and two server processes. So, the R/L/S/6 case creates a network workload of ni = 6 • 4 = 24 processes.
5 Network Performance Measurements Using the DCSP Benchmark
To verify our approach to network workload design and network benchmarking, we performed three groups of experiments. The first group of experiments was performed to verify the use of the DCSP benchmark in performance comparison of three different LAN configurations. These experiments show user-level measurements of selected LAN performance indicators. The measured indicators can be used as inputs in the process of comparing competitive systems and selection of the most convenient option. The second group of experiments investigate the behavior of LAN's when the DCSP workload increases, causing network saturation phenomena. The third group of
experiments illustrate the use of the DCSP benchmark in WAN performance measurement.
5.1 LAN performance compćirison
In this experiment we used three local area network configurations: (1) DEC network based on DECstation 5000/20, (2) HP network based on HP 9000/834, and (3) SUN network based on SUN SPARCstation IPC workstations. In our experiments the workstations were connected by Ethernet, and the DCSP benchmark was the only activity in the network. The network performance indicators for two elementary pairs and ring structures with zero wait time are presented in Table 1, where all times are expressed in seconds. To create a high intensity of network activity we applied a zero wait time, which causes the equahty of the round-trip time and the cycle time. The purpose of the experiment is to exemplify the expected values of performance indicators and to show how the proposed network benchmarking process can contribute to the comparison of various networks.
The most important results shown in Table 1 are the average round-trip time {R) and the average processor utiHzation per process (?7i). The workload is balanced and the obtained value of R is approximately the same for all nodes and for all client and server processes in the network. Therefore, we can define the global network throughput, X, as the total number of round-trips that the analyzed network performs in a time unit (usually a second):
nL
Similarly, we can define the average processor utilization at each node, U, as follows:
V = LTJI .
For example, in the presented R/L/A/3 case we have L = 2, n = 3, and the resulting performance indicators for DEC, HP, and SUN networks are
X =1.75 sec-i {DEC) = 0.77 sec-^ {HP) = 1.20 sec-i {SUN) .
These throughputs show differences in performance that can be used for a global comparison of the presented networks. For all three networks the processor utiHzation at each node is approxi-
Table 1: Performance Indicators for DEC, HP, and SUN Networks
NETWORK WORKLOAD P/L/A/2 : 10/4/0/1		Ts	Tr	T^	R	Tc	T user	Tsys	Tp	Üi[%]
DECstation 5000/20	Client	0.22	0.90	0	1.12	1.12	0.48	0.10	0.58	51.60
	Server	0.23	0.90	0	1.12	1.12	0.48	0.10	0.58	51.81
HP 9000/834	Client	L07	1.50	0	2.56	2.57	1.18	0.10	1.28	49.80
	Server	1.06	1.51	0	2.57	2.57	1.19	0.09	1.28	49.70
SUN Sparc IPC	Client	0.68	1.02	0	1.70	1.70	0.80	0.06	0.86	50.27
	Server	0.76	0.94	0	1.70	1.70	0.74	0.06	0.80	47.13
NETWORK WORKLOAD P/L/S/2 : 10/4/0/2		T.		T^	R	Tc	T J- user	Tsys	Tp	Ü1 %
DECstation 5000/20	Client	0.34	1.81	0	2.15	2.16	0.48	0.11	0.58	27.07
	Server	0.35	1.81	0	2.15	2.16	0.48	0.10	0.58	26.93
HP 9000/834	Client	1.94	2.99	0	4.93	4.94	1.18	0.11	1.29	26.15
	Server	1.86	3.07	0	4.93	4.94	1.18	0.10	1.28	25.87
SUN Sparc IPC	Client	1.21	1.89	0	3.09	3.10	0.79	0.07	0.86	27.85
	Server	1.36	1.73	0	3.09	3.09	0.75	0.06	0.81	26.24
NETWORK WORKLOAD P/M/S/2 : 10/4/0/2		Ts	Tr	T^	R	Tc	Tuser	Tsys	Tp	Č7l[%
DECstation 5000/20	Client	0.07	0.12	0	0.18	0.19	0.00	0.07	0.07	39.78
	Server	0.07	0.12	0	0.19	0.19	0.00	0.07	0.07	37.15
HP 9000/834	Client	0.20	0.26	0	0.46	0.46	0.00	0.08	0.08	17.82
	Server	0.19	0.27	0	0.46	0.46	0.00	0.08	0.08	17.89
SUN Sparc IPC	Client	0.07	0.12	0	0.19	0.20	0.00	0.05	0.05	29.31
	Server	0.07	0.12	0	0.19	0.20	0.00	0.05	0.05	28.88
NETWORK WORKLOAD R/L/A/3 : 10/4/0/2		Ts	Tr	T^	R	Tc	Tuser	Tsys	Tp	Üi[%]
DECstation 5000/20	Client	0.24	1.48	0	1.72	1.72	0.48	0.11	0.59	34.17
	Server	0.26	1.45	0	1.71	1.72	0.48	0.10	0.58	34.36
HP 9000/834	Client	1.47	2.44	0	3.91	3.92	1.18	0.12	1.30	33.24
	Server	1.26	2.65	0	3.91	3.91	1.18	0.12	1.30	33.28
SUN Sparc IPC	Client	0.95	1.56	0	2.50	2.50	0.80	0.07	0.86	34.53
	Server	1.04	1.46	0	2.50	2.51	0.75	0.07	0.81	32.63
NETWORK WORKLOAD R/L/S/3 : 10/4/0/4		Ts	Tr	Tu,	R	Tc	Tuser	T sys	Tp	Üi %
DECstation 5000/20	Client	0.35	2.82	0	3.16	3.16	0.48	0.11	0.59	18.78
	Server	0.43	2.73	0	3.16	3.16	0.48	0.11	0.59	19.10
HP 9000/834	Client	2.89	4.76	0	7.65	7.65	1.19	0.13	1.32	17.22
	Server	2.47	5.18	0	7.65	7.65	1.19	0.14	1.33	17.37
SUN Sparc IPC	Client	1.84	2.41	0	4.25	4.25	0.80	0.07	0.86	20.47
	Server	1.29	2.81	0	4.10	4.11	0.75	0.07	0.82	20.65
mately 67%. Other experimental results presented in Table 1 can be summarized as follows:
•	Client and server processes behave in a consistent way and generate identical results within the range of inevitable measurement errors.
•	In all cases the recorded send time is less than the receive time, reflecting the way in which send and receive calls are implemented in Unix.
•	In all cases where the data are located on disks, the measured user's CPU time is substantially larger than the system's CPU time, regardless of the fact that the majority of processing consists of executing system's programs. This reflects the way in which Unix accounts for user and system CPU times.
•	In cases where the data is located in memory, the user part of the CPU time becomes negligible, and the round-trip times substantially reduce with respect to the cases which include disk accesses. These results show that the majority of the CPU time is spent supporting disk file access operations. In addition, our other measurements indicate that disk operations are frequently responsible for creating a bottleneck and causing saturation phenomena in multinode networks.
•	In the case of paired workload (P/L/A/2 and P/L/S/2), the average processor utihza-tion per single process approximately satisfies the relation Ui = l/nL. The total processor utihzation for a node in the network is TJ = LUi = 1/n.
•	In the case of the ring workload (R/L/A/3 and R/L/S/3), the average processor utihzation per single process approximately satisfies the relation Ui = 2/nL. This kind of result can appear if the communicating client and server processes make the corresponding processors busy during the data transfer. Then, at any time, 2 out of n processors are busy, and n — 2 processors are idle waiting for a chance to communicate. In such cases the average processor utilization for each workstation is U = LUi = 2/n.
•	Symmetrical workloads in the case of the zero wait time approximately satisfy
^ l{sym) ~ ^l(as2/m)/2 j U gym — U asym ■
•	The comparison of network performance can be based on average round-trip times. The resulting ratios (the ratio to the best system whose performance is denoted 1) are:
DEC : SUN : - 1 : 1.5 : 2.3 (P/L/A/2) DEC : SUN : iTP = 1 : 1.4 : 2.3 (P/L/Sß) DEC : SUN : HP = 1 : 1.5 : 2.3 (R/L/A/S) DEC : SUN : HP = 1 : 1.3 : 2.4 {R/L/S/3)
These results are consistent and illustrate the use of the DCSP benchmark for system evaluation and comparison. Of course, in this case we are primarily interested in workloads which include disk activities. The role of disk performance is also visible from the fact that the corresponding ratios in the P/M/S/2 case are DEC : SUN : HP = 1 : 1 : 2.5 .
The goal of the presented analysis was to exem-phfy the applicability of the proposed DCSP workload, and not to compare competitive products. The availability of DEC, HP, and SUN networks was the primary reason for their comparison. It is easy to note that these networks are not sized as competitive products.
5.2 LAN Performance for Various Numbers of Nodes
This experiment was performed to investigate the use of the DCSP benchmark in the study of LAN saturation phenomena. LAN performance measurements were performed for two local networks of DECstations: the first with 2100's and the second with 5000/20's. We used three workloads differing only in data location: R/E/S/n:10/4/0/4, R/L/S/n:10/4/0/4, and R/M/S/n:10/4/0/4. Since L=4 the global network throughput (expressed in round-trips per second) is X — 2n/R.
In the case of DECstation 2100 LAN the average round-trip times R{n) and the corresponding throughputs X(n) for 2 < n < 7 are shown in Figures 10 and 11. The cases which use data on local disks and data on external (NFS) disks differ
Round-trip Times
Fig. 9: Measurement of File Access Overhead
t/5
T3 C
O
IZ)
O
Data on Local Disks
		
		
		
		
		
	Data in Main Memory	
1 2 3 4 5 6 7 8 Number of nodes in the ring
Fig. 10: Average round-trip times for DEC 2100 LAN running DCSP R/E/S/n, R/L/S/n, and R/M/S/n workloads
Network Throughput
12
^ .0
00 Ui
B.
a S
•a a
3
o OS
	1 1			1	1 1 -Ó	
			Dal	a in Mai	n Memory	
						
						
						
						
	Datao 1	n NFS Disks__^—— 1 1 1			1	1
Round-trip Time and Throughput
3.5 -1-1-1-1-
3 O.
M 3 2 S
T3
s
H ■a
S
2.5
1.5
ž
0.5
		
Noi	"malized Time/	
		
		
		put
	Normalized Throug	
		
1 2 3 4 5 6 7
Number of nodes in the ring
2	3	4	5	6
Number of nodes in the ring
Fig. 11: Global network throughput for DEC 2100 LAN running DCSP R/E/S/n, and R/M/S/n workloads
Fig. 12: Normalized round-trip times and throughputs for DEC 5000/20 LAN and DCSP R/E/S/n workload
insignificantly. This may support the contention that NFS caches disk files at local disks, supporting disk file I/O from local disk storage rather than from NFS via the network. In addition, this ehminates disk queuing effects on the NFS disk. Then the relatively slow local disks of these workstations become the bottleneck of the network. Without disk queuing, and far from other possible bottlenecks (e.g. those related to the transport medium), the round-trip times are almost constant, there is a moderate overlap in workstation activities and the global network throughput is an increasing function of n that shows no saturation.
The effects of slow disks become apparent if we disconnect disks and store data in the workstation memory (see Fig. 9). The resulting effect is a dramatic reduction of the round-trip time (more than 10 times for a small number of nodes). Now the network has a different bottleneck and the saturation effects are clearly visible in Fig. 11. The network with 5 nodes is already fully saturated; its maximum global throughput (X = 12.25 round-trips per second) can be used as a valuable global performance indicator.
In the case of the LAN with DEC 5000/20 workstations the results of the DCSP benchmark are summarized in Fig. 12. The presented results are normaHzed so that the unit values correspond to the initial case where n = 2. The round-trip time linearly increases, and an increasing overhead causes the global network throughput to be a decreasing function of n. The processor speed of DEC 5000/20 is greater than in the previous case of DEC 2100. Therefore, this type of saturation for R/EyS/n workload could be caused by the NFS disk queuing.
All presented measurements were performed using T^ = 0. Using T^; > 0 we can create additional opportunities for overlapped (simultaneous) usage of network resources yielding an increase in the number of workstations working without saturation.
5.3 WAN Performance Measurements
The DCSP workload can be used to get a userlevel insight into some global aspects of WAN performance. Our experiment was based on the P/E/S/2:10/4/0/2 workload installed on two HP 9000 workstations communicating using the Internet. One workstation was located in Worcester,
MA, and the other was in San Francisco, CA. The organization of WAN measurement is similar to LAN measurement based on the DCSP workload. Of course, the measured performance indicators are now substantially affected by transport delays between remote locations. Two typical distributions of the round-trip time, similar to the Poisson distribution, are shown in Fig. 13.
The round-trip central processor times (both the user part and the total time) are distributed similarly to the normal distribution with a very small standard deviation, and with practically constant system CPU time, as shown in Fig. 14. The shape of all presented distributions is similar to the LAN case. However, in the case of the round-trip time both the average value and the standard deviation are greater than in the LAN case.
As expected, the WAN traffic load affects the round-trip time distribution. The distributions presented in Fig. 13 show the difference between two three-hour experiments performed during the day (2-5 p.m. EST) and night (0-3 a.m. EST). The day experiment included 1122 round-trips yielding Rday = 8.4 seconds and standard deviation (Tday = 3.7 seconds. The night experiment included 1817 round-trips yielding shorter round-trip times, Rnight — 5.9 seconds, and substantially smaller standard deviation dnight = 1-95 seconds. The same workstations communicating in isolated LAN environment attain the maximum performance limit R = 4.9 seconds and negligible standard deviation a = 0.08 seconds. Therefore, this measurement shows a rather good performance of Internet communications, including low transport delays, and relatively small performance variations between the periods of high and low load. Of course, it is possible to find examples with different WAN performance results; our goal was not to provide a detailed analysis of WAN performance, but to show that relevant WAN performance indicators can be obtained from user-level measurements based on the DCSP workload.
The presented results can be compared to the ping round-trip times for the same pair of workstations. The ping measurement results (for day time conditions) are shown in Fig. 15. Not surprisingly, the ping times are substantially shorter than the corresponding DCSP times (ping results are hmited to the transfer of 4 KB, but even 10
Round-trip Time Distribution
	0.4
	0.35
	0.3
	
	
C u	0.25
Q	
^	0.2
1	0.15
XI	
ž	0.1
	0.05
	0
1 1		1
	: \ Nighl (0-3 a.m. EST)	
		
		
		
	/ ; V D^iy p n	.EST)
m	i_1_1 ^^ 1 ^	—n 0 ° òH -
5	10	15
Round-trip Time in Seconds
20
Fig. 13: Typical WAN round-trip time distributions for day and night conditions (DCSP P/E/S/2 workload)
CPU Time Distribution
	0.9
	0.8
	
o C	0.7
u	
3 CT	0.6
a	
u-	0.5
T3	
• ü	0.4
."rt	0.3
O	0.2
z	
	0.1
	i 1 ' n' ' ' /t ' '	
	User CPU Time i V : ^ : 1 User + System Time	
	j	
	1	
	t	
	■	
	1	
	/ ^	
		
1 InT 1 1		1_riM 1_1_1_1_
1.05 I.I 1.15 1.2 1.25 1.3 1.35 1.4
Round-trip CPU Time in Seconds
1.45 1.5
Fig. 14: Typical distributions of WAN round-trip processor time (DCSP P/E/S/2 workload)
Ping Round-trip Times n-1-1-1-1-1-r
Fig. 15: Ping round-trip times between San Franscisco, CA, and Worcester, MA
1000 1500 2000 2500 3000
Packet size in bytes
such transfers can be completed in less than 2 seconds). This shows that the ping usage of network resources does not reflect and cannot represent the type of user-level workload which is characterized and generated by the DCSP benchmark.
6 Conclusions, Current Work, and Future Work
We have presented a simple black-box approach to benchmarking computer networks with a scalable network workload. Our goal was to provide a methodology for the user-level network performance measurement suitable for evaluation, comparison, and selection of computer networks. In addition, our workload is suitable for measurements of saturation phenomena in local networks, and for userlevel WAN measurements.
Using a suitable number and various characteristic distributions of communicating cHent/server pairs we can organize a number of specific balanced and unbalanced scalable network workloads. These workloads are suitable for measuring, at each node, the following nine performance indicators: (1) send time, (2) receive time, (3) wait time, (4) round-trip time, (5) cycle time, (6) user CPU time, (7) system CPU time, (8) total CPU time, and (9) processor utilization. All indicators are measured using basic facilities available in C and Unix, without speciaHzed performance monitoring tools. All client and server processes run at the user level and all measurements can be performed without support and/or consent of system programmers and administrators. Depending on the objectives of performance analysis, the DCSP workload can be executed either alone, or in addition to a real network workload.
We have also proposed a simple technique for standardized classification of network performance measurements based on specifying the type of network workload, the location of data for cyclic data transfer, the type of communication between nodes, and the number of nodes in the analyzed network. This classification can be modified and/or extended, and it was found very practical for organizing and controlling a large number of results generated by DCSP network benchmarks.
We verified the validity of our approach with measurements on local area network configurati-
ons consisting of HP, SUN, and DEC workstations. Our measurements show that the DCSP workload can provide user-oriented and user-level insight into the overall network performance and can be used as a means for comparing alternative network solutions, and making justifiable capacity planning and equipment procurement decisions. In addition, the presented experiments show that the DCSP workload can be used for identifying a variety of different dynamic behaviors of existing networks. In particular, by combining results obtained for various DCSP workloads which use data located in memory and on local and NFS disks, it is possible to identify and control both the overhead generated by Unix, and various LAN saturation phenomena.
In cases of networks with relatively large number of nodes it is necessary to have tools for automatic generation of the DCSP workload. In addition, we sometimes need to.measure more detailed and/or more precise performance indicators than those that can be generated at the simple C/Unix user level. These requirements define the basic direction for the future work. This work is directed towards a network performance measurement environment which integrates three major components: (1) a DCSP network workload generator, (2) a network performance monitor, and (3) models for performance analysis, performance prediction, and capacity planning.
Our current work is focused on the development of a control module for the DCSP workload. This module generates communicating client/server pairs and distributes them on active nodes to create a desired structure of the network workload. In addition, it specifies all parameters of performance measurement, starts the measurement, and supervises the collection of measured data. At the end of measurement, it transfers and integrates all files containing measured results and creates a data base that is structured according to requirements of performance analysis modules.
The network performance monitor is a module speciaHzed for generating specific measured data both for the DCSP workload and for the real network workload that can run concurrently with the DCSP workload. It performs traditional on-hne monitoring of network performance indicators, enabhng the analyst to trace dynamic phenomena, to experiment, and to adjust the para-
meters of the network hardware, software, and workload. In addition, the monitor also generates control data necessary for the control module. For example, the collecting of DCSP data can be conditioned in a variety of ways; in such cases, it should be temporarily suspended if the conditions are not satisfied, and automatically restarted when the conditions are satisfied.
Acknowledgment
The authors would like to thank Mr. Rufus J. Freddie for his participation in LAN measurements.
References
[1]	Bershad, B.N., T.E. Anderson, E.D. La-zowska, and H.M. Levy, Lightweight Remote Procedure Call. ACM TOCS, 8(1), 1990, pp. 37-55.
[2]	Cabrera, L., E. Hunter, M.J. Karels, and D.A. Mosher, User-Process Communication Performance in Networks of Computers. IEEE TSE, Vol 14, No. 1 (1988) pp. 38-53.
[3]	Dixit, K. and J. Reilly, SPEC Developing New Component Benchmark Suites. SPEC Newsletter, Vol. 3, No. 4, pp. 14-17, Dec. 1991.
[4]	Dujmović, J.J., Workload Characterization, Benchmarking, and the Concept of Total Resources Consumption. Proceedings of the Second International Conference on Computer Capacity Management (H.L. Börding and D.J. Schumacher, Ed.), San Francisco, April 8-10, 1980, pp. 151-163.
[5]	Ferrari, D., G. Serazzi, and A. Zeigner, Measurement and Tuning of Computer Systems. Prentice-Hall, 1983.
[6]	Funk, B.K., SPEC - The Changing Computer Benchmark Consortium. SPEC Newsletter Vol. 3, Issue 3, pp. 3-5, September 1991.
[7]	Graphics Performance Characterization Committee, The Effort to Standardize Performance Measurement. GPC Quarterly Report, Vol. 2, No. 4, 1992, pp. 10-12.
[8]	Hennessy, J.L. and D.A. Patterson, Computer Architecture A Quantitative Approach. Morgan Kaufmann, 1990.
[9]	Hewlett-Packard Company, Netperf: A Network Performance Benchmark. Revision 1.7. Information Networks Division, Hewlett-Packard Company, March 5, 1993, pp. 1-19.
[10]	Jain, R, The Art of Computer Systems Performance Analysis. John Wiley, 1991.
[11]	Kay, J., and J. Pasquale, The Importance of Non-Data Touching Processing Overheads in TCP/IP. Computer Communication Review Vol. 23, No. 4 (1993) pp. 295-268.
[12]	Kinicki, R.E. and D. Finkel, Comparison of Mach Distributed Performance Using the WPI Benchmark Suite. Proceedings of the 26th Hawaii International Conference on Systems Sciences, Vol. 2, 1993, pp. 40-49.
[13]	Kohler, W., How to Improve OLTP Performance and Price/Performance. TPC Quarterly Report, Oct. 15, 1992, pp. 1-9.
[14]	Mathis, M, Windowed ping: an IP layer performance diagnostic. Computer Networks and ISDN Systems 27 (1994) 449-495.
[15]	Papadopoulos, C., and G. M. Parulkar, Experimental Evaluation of SUNOS IPC and TCP/IP Protocol Implementation. IEEE/ACM Transactions on Networking, Vol. 1, No. 2 (1993) pp. 199-216.
[16]	Van Renesse, R., H. Van Staveren, and A.S. Tanenbaum, The performance of the Amoeba Distributed Operating System. Software -Practice and Experience 19(3), 1989, pp. 223234.
[17]	Thekkath, C.A. and H.M. Levy, Limits to Low-Latency Communication High-Speed Networks. ACM TOCS, 11(2), 1993, pp. 178-203.
[18]	Watson, A. and B. Nelson, LADDIS: A Multi- Vendor and Vendor-Neutral SPEC NFS Benchmark. 1992 LISA VI - October 19-23, 1992, Long Beach, CA, pp. 17-32.
Petri Nets and Self-Stabilization of Communication Protocols
Wuxu Peng^
Department of Computer Science Southwest Texas State University San Marcos, TX 78666, U.S.A. AND
Kia Makki
Department of Computer Science University of Nevada Las Vegas Las Vegas, Nevada 89154-4019 U.S.A.
Keywords: Petri nets, self stabilization, communication protocols Edited by: Xindong Wu
Received: June 18, 1995	Revised: September 19, 1995 Accepted: September 29, 1995
Using Petri nets as the concurrent model, we discuss the issue of self-stabilizing communication protocols in this paper. We argue that (1) self-stabilizing extensions of communication protocols should not interfere with their normal operations by giving a new deßnition of self-stabilizing extension and a ring net that meets the new definition, and (2) communication protocols are usually delay sensitive and time-Petri nets is a more suitable formal model (than ordinary Petri nets) to express self-stabilizing property. We also explore the possibility and suitability of using Petri nets with a new type of transitions - the max transitions - to express self-stabilizing communication protocols. Finally, to facilitate practical implementations, two types of interferences - external and internal interferences - are formally identified and discussed.
1 Introduction	ditional approach has severe problems, however.
The distributed nature of distributed systems and
Communication protocols are an integrated and	computer networks makes such systems vulnera-
critical part of distributed systems and computer	ble to events absent in centraHzed systems. For
networks. Concisely specifying and efficiently ve-	instance, the communication channels linking ma-
rifying communication protocols are always major	chines may break or suffer from outside interfe-
challenges to network designers.	rence. Such a problem may cause either loss or
The traditional approach to verifying communi-	corruption of messages. A corrupted message that
cation protocols is similar to the well-known we-	passes the normal error detection mechanism may
akest pre-condition and strongest post-condition	cause the system to enter an illegal state. A power
technique employed in verifying sequential pro-	surge in one machine may cause this machine to
grams. For example, to verify that a protocol is	mal-function transiently and hence send conflict
free of deadlocks, we may show that if the pro-	messages to different machines in the system. We
tocol starts from a legal state then it can never	observe that in both these two cases, a little per-
be in an illegal deadlock state, i.e. a state in	turbation can cause a system to stray from its
which several machines are waiting for messages	legal states and remain there thereafter. To bring
from one another in order to proceed. This tra-	the system back to its legal states, drastic measures such as shutting down and rebooting the
^The first author is supported in part by a faculty rese- System have tO be taken, arch enhancement grant from Southwest Texas State University.	The concept of self-stabihzation of computer
systems was first proposed by E. W. Dijkstra in his seminal paper [5]. Informally, we say that a computer system is s elf-stabilizing if and only if the following two conditions are satisfied:
(1)	if the system is in a legal state, it will remain in the set of legal states thereafter provided there is no perturbation;
(2)	if some perturbation causes the system to enter an illegal state, then the system can return to a legal state within a finite amount of time without outside interference.
From the definition, it is clear that a self-stabilizing system is much more robust than a system that is merely logically correct. Self-stabilizing systems are faults-resilient in the sense that they can recover from systern faults and failures by themselves. For example, when a computer system is rebooted, it is very important to ensure that all system parameters are properly initialized. For a self-stabilizing system, however, we can just turn on the power without concern about its initial state. The self-stabilization property guarantees that the system will eventually enter a legal state within a finite amount of time and remain there thereafter.
Although the concept of self-stabihzation was proposed nearly twenty years ago, it did not receive its deserved attention until the middle of 1980s. After Dijkstra's paper [5], Kruijer in [14] considered the issue of self-stabiHzation in tree-structured systems with distributed control. Bastani, Yen and Chen considered the self-stabilization problem of a class of distributed programs for industrial process-control systems in [1]. Brown, Gouda, and Wu considered the self-stabilization problem for token ring systems in [2]. The behavior of each machine in the ring is characterized by a regular expression and three non-interfering and delay-insensitive self-stabihzing protocols were presented. Gouda and Multari examined the issue of designing self-stabihzing communication protocols in [9]. The most important conclusion from that paper is the observation that the existence of both timeout and infinite number of states is a necessary condition for a communication protocol to be self-stabihzing. Burns and Pachi showed in [3] a uniform self-stabilizing solution (i.e. each machine in the ring
executes the same code) for unidirectional rings where the number of machines is prime.
Research on the issue of self-stabilization of Petri nets only started in the late of 1980s. Cherka-sova, Howell, and Rosier considered the issue of self-stabilizing bounded Petri nets in [4]. Their definition of self-stabilization is based on reachability sets. Ghosh discussed in [8] the issue of self-stabihzing a special class of Petri net models - the marked graphs. His definition is based on sequences of firing transitions and self-stabilization is achieved by using inhibitor arcs.
In this paper we consider the issue of self-stabihzation of communication protocols expressed in Petri nets. We first argue that self-stabihzing extension of any communication protocol should not interfere with the normal operation of the protocol. This claim is supported by a new definition of self-stabilizing extension and illustrated by a new self-stabihzing extension of a ring net from [8].
Timeout is a fundamental technique used in communication protocols. For instance, in a token ring, timeout is used to detect loss of tokens. In shding window protocols, timeout is used to retransmit previously sent but not yet acknowledged messages. As pointed out in [9], timeout is a necessary condition for communication protocols to achieve self-stabilization. The need of timeout in communication protocols can be attributed to the distributed nature of distributed systems and computer networks. Unreliable communication channels may lose or corrupt messages. Individual machines can crash due to various hardware and/or software failures. The need of timeout indicates that communication protocols are delay sensitive, not delay insensitive. For example, long delays in communications are normally treated as errors because most users usually cannot tolerate an excessively long delay. Therefore delay insensitive protocols, such as the three elegant protocols in [2], may not be suitable for many practical applications.
One elegant model for delay sensitive communication protocols is Time-Petri nets (TPNs), introduced in [15]. Informally, a TPN is just an ordinary Petri net with additional restrictions on the timing of firing of transitions. We argue that TPNs are more suitable (than Petri nets) to express self-stabihzing property of communi-
cation protocols. We show that a time-sensitive token ring can be extended to be self-stabilizing using a TPN that is operating at half of the speed of the original net. The convergence rate (the rate at which a disturbed system returns to a legal state) of the extended net can be accurately bounded.
We also explore the possibility of using max transitions to express self-stabilizing communication protocols. The notion of max transitions was originally proposed by Ghodsi and Kant in [7] to model the abort semantics in performance measurement. We show that, with a modified definition, max transitions can often give a more compact representation of self-stabiUzing protocols. Two examples, one is the ring net and another is the sliding window protocol, are given to illustrate the point.
. We strongly believe that practical implementation consideration should be a critical factor in evaluating possible self-stabilizing extensions of communication protocols. A self-stabilizing extension, no matter how elegant theoretically, has little value if it could not be practically implemented. Out of this consideration, we formally identify two types of interferences - external and internal interferences - which should be avoided in self-stabilizing protocols.
The remainder of this paper is organized as follows. Section 2 provides necessary definitions and concepts. In particular, we define the concept of self-stabilizing extension of a Petri net. We believe that our definition is innovative. Section 3 Illustrates our self-stabilizing definition through examples. Section 4 discusses, out of practical considerations, how to construct self-stabihzing extension of communication protocols through decentralized control. In Section 5 we-discuss, through two examples, how to use max transitions to express self-stabilization. The modeling power of Petri nets with max transitions is briefly discussed. Section 6 formally defines and discusses two types of interferences - external and internal interferences. Section 7 provides concluding remarks and future research topics.
2 Basic definitions
In this section we define most of the concepts used in this research, including Petri nets, extended
Petri nets, max transitions and generalized Petri nets, self-stabilizing extensions of Petri nets. The definition of TPNs is delayed until Section 4.
Definition 2.1 (Petri nets) A Petri net is a four-tuple (P,T,/,0), where P = {pi, • • • is a finite set of places; T = {ti, - ■ ■ , tm} is a finite set of transitions satisfying	I : T ^
is the input function; and O : T	is the
output function, where P°° is the multinet over P.
Let N denote the set of nonnegative integers. A marking (x is an assignment of tokens to places of a Petri net, i.e. is a function from P to N:
II : P^N.
A marked Petri net is a five-tuple {P,T,I,0,tJ.), where C = {P,T J, O) is a Petri net and /i is a marking of C. A transition t in a marked Petri net with marking ß is enabled if Vpi G I{t) ß[pi) > 0. A transition t enabled in a marking ß may fire and result in a new marking ß' by first removing one token from each of its input places and then adding one token to each of its output places, i.e. ß' = ß — I{t)-\-0{t).
Definition 4 {Extended Petri nets) An extended Petri net is a five-tuple {P,T,I,0,I'), where {P,T,I,0) is a Petri net, and I' : P ^T is the set of inhibitor arcs.
Let (P,T, I, O, I' ,ß) be a marked extended net. A transition t £ T is enabled (with respect to ß) if Vp G I(t) ß{p) > 0 and Vp € I'{t) ßip) = 0. As in the Petri net case, the firing of t will result in a new marking ß' = ß — I (t) -f- 0{t). Notice that if 7' = 0, then the extended net is an ordinary Petri net.
Petri nets) is a seven-tuple
Definition 5 {Generalized A generalized Petri net {P,T,I,0,I',M,S), where {P,T,I,0,I') is an extended net, M CT is the set of max transitions, and S : 'T ^ , where Vp G S {t) there exists a path leading from place p to transition t, is the input set of max transitions. The enabling and firing of non-max transitions are the same as in extended net. A max transition t is enabled in a marking ß if 3p £ I{t) ß{p) > 0. An enabled max transition t may fire by first
W. Peng et al.
removing one token from each place in I(t), then removing all tokens from each place in the input set S(t), then adding one token to each place in 0{t).
Following the notation used in [7], a max transition t will be denoted by a rectangle with an associated copyright sign Notice that our definition here is slightly different from the original definition in [7].
For a marked net with a given initial marking, more than one transition may be enabled and thus it is possible for more than one transition to fire simultaneously. Here we adopt the classic interleaving semantics for firing. The set L of all possible firing sequences starting from a given marking forms a formal language over T [17, 16] (i.e. L CT*, where T* is the Kleene closure of the set T).
Definition 6 A net N2 is a s elf-stabilizing extension of a net Ni (with respect to a given marking) if and only if every firing sequence a generated by N2 from an arbitrary initial marking can be expressed as ßa' satisfying:
(1)	ß e T^ and a' € Tf;
(2)	there exists a finite sequence ß' € T* such that ß'a' is a legal firing sequence in Ni.
Two comments are in order here. First, the above definition does not require the original net Ni to be live or safe. Assume that net A^i is specified by a predicate R. Then by above definition, net N2 is a self-stabilizing extension of Ni as long as starting from any initial marking N2 will sfitisfy R after a finite number of firing steps. Second, the definition implicitly requires that the extension should not interfere with the normal operations of the original protocol. This is reflected in the requirement that ß'a' must be a legal firing sequence in A^i.
Another implication of the above definition is that every terminating net is trivially self-stabilizing extension of itself (taking a' = e). Therefore in this paper we consider net modeling non-terminating communication protocols. As every sequence from T* is finite, for non-terminating protocols Definition 6 has to be changed to use infinite firing sequences over T, notated as T'^, which is the set of all infinite strings over T (cf. [21] for details).
GMhcHhcH
'OH
(a)

01
(b)
'2n-l
Figure 1: A ring net and its two extensions by Ghosh: (a) the ring net; (b) the first extension; (c) the second extension.
3 Self-stabilizing without
interfering normal operations
Ghosh discussed in [8] the issue of self-stabihzing a special class of Petri net models (ring nets and marked graphs). In both cases, inhibitor arcs are used to achieve self-stabilization. Figure 1 shows a ring net and its two self-stabiHzing extensions.
One problem with the two self-stabilizing extensions is that, in both cases, the system explicitly interferes with the operation of the net, even in the absence of operation errors. For instance, in the first extension on Figure l.(b), transition tn drains all tokens in the net and then the absence of tokens in the whole net causes transition to to be enabled and fire. We beheve that in a self-stabihzing extension of any system, the extension should not interfere with the normal operation of the original system and interference will come only after the system enters an illegal state. Incidentally, it is easy to see that both extensions in Figure 1(b) and Figure 1(c) do not satisfy Definition 6. The net in Figure 1(a) can be characterized by the regular expression {tit2 ■ • ■ tn)*■
1 0				
)-1		2		3
			CH	
Pn 'n
Figure 2: An alternative self-stabilizing extension of the ring net
Every normal firing cycle in both nets in Figure 1(b) and Figure 1(c) includes the transition ^o, which is absent in the original net.
A new self-stabiHzing extension of the net in Fig 1(a) is given in Figure 2. As shown by the following theorem, the new extension is guaranteed to operate as usual as long as the system is in legal states and will try to recover from failure as soon as it enters an illegal states.
Theorem 4 The network in Figure 2 is a self-stabilizing extension of the ring net in Figure l.(a).
Proof: Self-stabilization is clearly implied by the following three observations:
Observation 1: if there is no token in the net, then transition ^o can fire and bring the system to a legal state.
Observation 2: assume that there is at least one token in place pi. Transition ti cannot fire if there are tokens in any other places. Tokens in places P3,P4, ■ • ■ ,Pn will eventually reach place Pi-
Observation 3: If there are tokens in both places Pi and p2, then only transition t is enabled. Every firing of t will drain two tokens from the net. If place p2 becomes empty after fs firing but Pi still holds more than one token, firing of ti will move one token from pi to p2- So if initially the total number k of tokens in the net is odd, then after k/2 firings of transition t only one token will be left in place pi. li k is even, then after k/2 firings of transition t the net will be empty.	■
4 Self-stabilizing through decentralized control
In this section we argue and show that time-Petri Nets, introduced in [15], is more suitable to model the self-stabilizing properties of communication protocols.
The main problem with the ring net in Figure 2 is the diflficulty associated with implementing inhibitor arcs that involve non-neighboring places. For instance, checking whether transition to in Figure 2 is enabled practically asks for the verification of the emptiness of all places in the net, while checking if transition ti is enabled demands the verification of the emptiness of places P2iP3r " iPn- In a ring net, such a verification is equivalent to a round of communications for consensus among all the transitions, which is difficult and expensive to implement (considering the fact that such verifications must be performed before every firing of transition ìq or ti).
There are at least two approaches that can be used to cope with the above problem. The first approach is to launch the communications checking for absence of tokens periodically, not before every time to or ti is fired. Although this may reduce the cost of implementing self-stabilization, it also compromises the self-stabilizing property. The second approach is to make use of the time sensitive property of most practical communication protocols.
Timeout is a fundamental technique used in communication protocols. For instance, in a token ring, timeout is used to detect loss of tokens. In sliding window protocols, timeout is used to retransmit previously sent but not yet acknowledged messages. As pointed out in [9], timeout is a necessary condition for communication protocols to achieve self-stabilization. The need of timeout in communication protocols can be attributed to the distributed nature of distributed systems and computer networks. Communication channels are not reliable and messages can be lost or corrupted. Individual machines can crash due to various hardware and/or software failures.
The need of timeout can also be justified by the fact that long delays in communications are normally treated as errors because most users usually will not tolerate an excessively long delay. This points out that communication protocols are de-
pi ti
p2 t2
p3 t3
"CH
pn tn
ul
To illustrate the usefulness of TPNs in expressing delay-sensitive communication protocols, we give a self-stabilizing extension of the ring net, as shown in Figure 3. In that figure, it is assumed that 6 =	and the time parameters for transitions in Figure 3 satisfy the conditions q + q > e, t^ > 0 and t* = t** = 0, and > e+ 11* + q*. The following theorem states that the net in Figure 3 is self-stabilizing.
Figure 3: A self-stabilizing extension of the ring net based on TPN
lay sensitive, not delay insensitive. Therefore delay insensitive protocols, such as the three elegant protocols in [2], may not be suitable for many practical applications.
One elegant model for delay sensitive communication protocols is Time-Petri nets (TPNs), introduced in [15]. Informally, a TPN is just an ordinary Petri net with additional restrictions on the timing of firing of transitions. Formally we have the following definition.
Definition 7 (cf. [15]) A TPN is a PN in which each transition ti is associated with two real numbers t* and t** satisfying^
- tltt* > 0; -t*< c-
The firing rule of a TPN is defined as:
—	If ti has been enabled for a period of time equal to or greater than t^, then ti can fire, using the the same rule for token redistribution as in an ordinary PN.
—	If ti has been enabled for a period of time equal to t**, then ti must fire, using the the same rule for token redistribution as in an ordinary PN.
Intuitively, t* is the minimum time for which transition ti has been continuously enabled before it can fire, while t** is the maximum time for which transition ti has been enabled continuously before it must fire.
^Please note that for historical reeLSons, the notations t* and t** used in this definition are similar to that of the Kleene closure.
Theorem 5 The network in Figure 3 is a self-stabilizing extension of the ring net in Figure l.(a).
Proof: It is clear from the structure of the net that if there is no token in the net, transition t[ will fire after t'* = ö -t- i** -I- ^2* units of time.
Assume that there is more than one token in the net. First we observe that any pair of tokens that are already in place pi and p2 respectively will be removed by the firing of transition t. There are still two cases to consider.
Case 1: there are two or more tokens in places p3,p4, ■•• at some time point. Due to the requirement that t* t2 > 9, one of these tokens must have been in either place pi or p2 when the other tokens arrive at place pi. Therefore the transition t will be enabled after at most 6 time units and the firing of t will remove two tokens from the net. Case 2: there are two or more tokens, one is in place pi (or P2) while the others are in places P3,Pa, - ■ ■ :Pn- If the token in pi or P2 is still in pi or p2 when a token from Pz,- " yPn arrives at pi, these two tokens will enable t and be removed. If the token in pi or p2 is not in Pi or p2 when a token from Pa, - ■ ' iPn arrives at pi, the requirement that i* > Ö ensures that the latter will still be in pi or p2 when the former arrives at pi.
It should be noted that it is not guaranteed that the t'^ will fire only when the net is empty. However, from the above discussion, it should be clear that even if t'l accidentally generates an extra token (this may happen if the existing token has been delayed unexpectedly long by one of the transitions), the net will discard one or both tokens. In the formal case, the system immediately returns to a legal state. In the second case, t'l will regenerate a new token, which again brings the net to a legal state. This completes the
proof.	■
The net in Figure 3 has several distinct properties over the net in Figure 2. First, the only inhibitor arc from place pi to transition tj in Figure 3 connects two neighboring place and transition. This implies that a node need only check the emptiness of its neighboring channel, not all the channels over the net. Second, it shows that self-stabilization of a ring net as shown in Figure 1(a) can be achieved by operating the net at half of the normal operation speed. Third, the convergence rate of of the net in Figure 3 can be easily and accurately estimated. Assume that m tokens were in the net at some time point. It can be easily seen, as from the argument in the proof of above theorem, that after at most {m/2)6 time units, either all the m tokens are drained from the net or only one token is left. In the late case, the net already returns to a legal state. In the former case, transition t'l will regenerate a single token after 9' = 6 + t^* time units, i.e. the net returns to a legal state within 9' -F (m/2)9 time units.
It is also interesting to discuss the convergence rates of the two self-stabiHzing Petri nets in Figure 1(b) and Figure 1(c) with the one in Figure 3. On the surface, the two nets in Figure 1(b) and (c) converge much fast than the net in Figure 3. However, in practice, this may not necessarily be the case. For instance, to implement the n inhibitor arcs in Figure 1(b), transition to must receive some kind of token not present report from each of the n places. This may take a substantial amount of time.
5 Using max transitions
In last section, inhibitor arcs are used to achieve self-stabilization. As pointed by Ghosh, use of inhibitor arcs seems mandatory in order to achieve self-stabihzation. In this section we show that using max transitions together with inhibitor arcs could greatly simphfy the representation of self-stabilizing communication protocols.
Figure 4 shows a net using both max transitions and inhibitor arcs. Transition ti is the only max transition with input set {pi,p2; • • • ,Pn}-
Pi	1 P2 2 pj
Pn 'n
O
Figure 4: A self-stabilizing extension of the ring net using both max transitions and inhibitor arcs
Theorem 6 The network in Figure 4 is a self-stabilizing extension of the ring net in Figure l.(a).
Proof: Similar to the net in Figure 2, the case where the net is empty is handled by transition By the definition of max transitions, the firing of transition ti will remove all tokens from all its input places. Hence every firing of will remove all the tokens in places P1,P2, ■ ' ' iPn-	■
Our second example, as shown in Figure 5, is a self-stabihzing extension of the well known sliding window protocol. This example further illustrates the usefulness of max transitions in specifying and expressing self-stabihzing property. A good introduction to sliding window protocols and their self-stabihzing versions can be found in [20, 9]. In Figure 5, the input sets ^(ta) = {pi,P2,P3,P5}, Situ) = {P7,P8,P4,P6}- The possible error where there are tokens in both places pi and P2 (or p-j and ps) signals errors such as invalid values for variables, which may cause several conditions in a guarded command to incorrectly hold simultaneously, for instance. The max transition ts (similarly for ill) will remove all tokens in places Pi,P2,P3 and p5 before it fires. This is equivalent to reset the various variables to correct and consistent values and clean up the buffers. For simplicity of presentation, the time parameters for this protocol are omitted. Some constraints on the time parameters are obvious. For instance, must be at least equal to tH.
Theorem 7 The sliding window protocol in Figure 5 is s elf-stabilizing.
Proof: As always, the error of total-loss-of-token is handled by the inhibitor arcs. The error of creation of arbitrary tokens is solved through the
wait for ack 1
expect O
discard ack 0
Figure 5: A self-stabilizing extension of sliding window protocol
max transitions ti and t^. If there is more than one token in places pi,..., the firing of transition ti will remove all tokens in the places.	■
It is interesting to point out that Petri nets with max transitions are strictly less powerful than Petri nets with inhibitor arcs. This is evidenced by the fact that the language L*, where L = {a"ò"|n > 0}, cannot be generated by any Petri nets with max transitions. In fact the class of ordinary Petri nets added with max transitions is not closed under the Kleene closure operation. Details about the modehng power of max transitions are being given in a forthcoming paper in [13].
6 External and internal interferences
As evidenced from the previous sections, in-discriminated use of inhibitor arcs and/or max transitions may cause extreme difficulty in implementing self-stabilizing protocols. The concept of "decentraHzed control" used in Section 4 is intended to remedy the problem. We formally propose here the concepts of external and internal interferences, which will allow us to formally quantify the problem.
In general, a component (e.g. a sender, receiver, or a channel) within a communication protocol may be represented by several places and transitions in a Petri net. Let Ci and C2 be two communication components that are not physically hnked. Informally, an external interference occurs when one place (transition, resp.) belonging to Ci is forced to connect to a transition (place, resp.) belonging to C2 by some arc added during the process of self-stabihzing the net. This point has been stated in Section 4, where we argued that the self-stabilizing extension shown in Figure 2 is very difficult and expensive to implement. However, controlled use of inhibitor arcs, i.e. the arcs that don't cause external interference, such the net shown in Figure 3, may minimize the implementation difficulty. Similar arguments hold for the use of max transitions. For instance, although the use of max transitions in Figure 4 gives a more concise representation, implementing the max transition in that figure is quite difficult and expensive. On the other hand, the two max transitions in Figure 5 are quite easy to implement because they involve only places local to the sender or receive process.
A place or transition in a net is said to be internal if it is only connected to transitions or places belonging to the same component. An internal interference occurs when the internal states of one or more components are forced to be connected to the outside during the self-stabilization extension procedure. The first protocol given in [2] is an example of internal interference, where the state change of a process depends on the internal states of one of its two neighbor processes. On the other hand, the sliding window protocol in Figure 4 does not suffer from neither external nor internal interference, because the max transition (ill, resp.) does not refer to any places or transitions in the receiver (sender, resp.) at all. The communication channels there are modeled by four places and none of them is internal.
Both external and internal transitions will cause difficulty in practical implementation, thus should be avoided.
7 Discussions
Using Petri nets as the concurrent model, we have discussed the issue of self-stabihzing communi-
cation protocols. We have argued that (1), self-stabilizing extensions of communication protocols should not interfere with their normal operations, and (2) communica,tion protocols are usually delay sensitive and time-Petri nets is a more suitable formal model (than ordinary Petri nets) to express self-stabilizing property. The possibility and suitability of using Petri nets with max transitions to express self-stabilizing communication protocols have also been explored.
As pointed by Ghosh in [8], the use of inhibitor arcs to represent self-stabilizing Petri nets seems unavoidable. This may raise the question about the tractability and suitability of Petri nets for self-stabihzing communication protocols. We feel that in-discriminated use of inhibitor arcs will at least cause extreme difficulty in implementing self-stabilizing protocols. This point has been stated in Section 4, where we argued that the self-stabilizing extension shown in Figure 2 is very difficult and expensive to implement. However, controlled use of inhibitor arcs, such as the net shown in Figure 3, may minimize the implementation difficulty. Similar arguments hold for the use of max transitions. For instance, although the use of max transitions in Figure 4 gives a compact representation, implementing the max transition ti in that figure is quite difficult and expensive. On the other hand, the two max transitions in Figure 5 are quite easy to implement because they involve only places local to the sender or receive process. The issue of using generalized time-Petri nets to specify and verify communication protocols is worth of further investigation.
It should be pointed out that the notion of re-coverability and recoverable communication protocols proposed by Merlin and Färber in [15] is quite close to (although weaker than) the self-stabihzation concept adopted in this paper.
Both the nets in Figure 2 and Figure 3 are asymmetric. This is in agreement with the conclusions in [5]. In particular, it is not difficult to see that there is no symmetric version of self-stabilizing extension of the net shown in Figure 3. Trying to design such an extension will easily reveal the difficulty and impossibility.
The fact that the self-stabilizing extension in Figure 3 can only operate at half of the speed of the original net indicates that self-stabilization is usually expensive to implement. This also im-
plies, that self-stabilization may be a property too strong and too expensive for certain applications. The bound on the cpnvergeriqe rate for the self-stabilizing extension in Figure 3 is new and it is not known if this is the optimal bound for this type of delay sensitive rings.
The issue of self-stabilizing colored Petri nets is another interesting and important issue. Many complex concurrent phenomena are very difficult, if not impossible, to be modeled by low-level Petri nets. Some others cannot be adequately described by low-level Petri nets at all. Research in the area of high-level Petri nets is intended to remedy this problem [12]. High-level Petri nets are in general more expressive than its low-level counter parts and thus often give more succinct and manageable specifications. More importantly, all the formal analysis techniques developed for low-level nets have been easily extended to high-level nets.
The class of Colored Petri nets (CPNs) is one of the most influential types of high-level nets. Informally, in a CPN, tokens are differentiable -they are typed (i.e. colored). Each place now is associated with a color set - the set of types of tokens that can legally be in it. Transitions are now guarded with boolean expressions. A transition can fire provided that (1) it is enabled in the usual sense (for low-level nets) and (2) the associated guard expression is evaluated to true. Each arc is associated with an arc expression which regulates conditions under which certain types of tokens can be removed (from input places) or produced (to output places). Detailed definition can be found in [11].
Figure 6 (a) is a CPN that models the one-bit sliding window protocol. The color set associated with each place used is obvious and the explanation is omitted. Compared with Figure 5, the CPN is much smaller and succinct. A self-stabilizing extension of the CPN P is shown in Figure 6 (b), where the input set for the max transitions ti and ig are {^1,753} and {p2,'Pi} respectively.
Further research is needed to extend existing methodologies for low-level Petri nets to CPNs and develop new methodologies suitable for CPNs.
(s,x)
(a,x)
(s,x)
(a,x)
Figure 6: A CPN modeling the one-bit sliding window protocol: (a) the one-bit sliding window protocol; (b) the corresponding self-stabilizing extension
References
[1]	F. Bastani, I. Yen, and I. Chen, Class of inherently fault-tolerant distributed programs. IEEE Ti-ansaction on Software Engineering, Vol. 14(10), Oct. 1989, pp. 1432-1444.
[2]	GM. Brown, M.G. Gouda, and C. Wu, Token systems that self-stabilize. IEEE Transaction on Software Engineering, Vol. 14(6), June. 1989, pp. 845-852.
[3]	J. Burns and J. Pachi, Uniform self-stabilizing rings. ACM Tr-ans. on Programming Lang, and Systems, Vol. 11, No. 2, 1989, pp. 330-344.
[4]	L. Cherkasova, R. Howell, and L. Rosier, Bounded self-stabilizing Petri nets. Proc. of the 12th Annual Petri Net Conference, Dec. 1991.
[5]	E.W. Dijkstra, S elf-stabilizing systems in spite of distributed control. Communications of Association for Computing Machinery, Vol. 17(11), Nov. 1974, pp. 643-644.
[6]	Carlo Ghezzi, Dino Mandrioli, Sandro Mora-sca, and Mauro Pezze, A general way to put time in Petri nets. Proc. of the 5th International Workshop on Software Specification and Design, IEEE Computer Society Press, 1989.
[7]	M. Ghodsi and K. Kant, Well-formed generalized task graphs. Proc. of 1991 IEEE International Conference on Parallel and Distributed Systems, Dec. 1991.
[8]	S. Ghosh, Stabilizing Petri nets. Proc. of 1991 IEEE International Conference on Parallel and Distributed Systems, Dec. 1991.
[9]	M.G. Gouda and N.J. Multari, Stabilizing communication protocols. Technical Report, TR-90-20, Dept. of Comp. Sci, The Univ. of Texas at Austin, June 1990.
[10] M.G. Gouda, R. Howell and L. Rosier, The instability of self-stabilization. Acta Informatica, Vol 27, Sept. 1990, pp. 697-724.
[11]	K. Jensen, Coloured Petri nets: a high-level language for system design and analysis. In Advances in Petri nets 1990. Lecture Notes in Computer Science 483, Springer-Verlag, 1991, pp. 342-416.
[12]	K. Jensen and G. Rozenberg (eds.). Highlevel petri nets: theory and practice. Springer-Verlag, 1991.
[13]	K. Kant and W. Peng, On the modeling power of max transitions and high-level Petri nets with max transitions. Under preparation.
[14]	H.S.M. Kruijer, Self-stabilization (in spite of distributed control) in tree-structured systems. Information Processing Letter, Feb. 1979, pp. 91-95.
[15]	P.M. Merlin and D.J. Färber, Recoverability of communication protocols - implications of a theoretical study. IEEE Trans, on Comm., Vol. COM-24, Sept. 1976, pp. 1036-1043.
[16]	T. Murata, Petri Nets: properties, analysis, and applications. Proc. of IEEE, Apr. 1989, pp. 541-580.
[17]	J. Peterson, Petri nets theory and the modeling of systems. Prentice-Hall, Englewood Cliffs, NJ, 1981.
[18]	Proc. of International Workshop on Timed Petri Nets, Torino, Italy, IEEE Computer Society Press, July 1985.
[19]	P.H. Starke, Some Properties of Timed Nets under the Earliest Firing Rule. Advances in Petri Nets 1989, (ed.) G. Rozenberg, LNCS 524, Springer-Verlag, 1989.
[20]	A.S. Tanenbaum, Computer networks. Prentice-Hall, Englewood Cliffs, NJ, 1989.
[21]	W. Thomas. Automata on Infinite Objects. In Handbook of Theoretical Computer Science, Vol B. (ed.) J. van Leeuwen, pp:133-192, MIT Press, 1990.
[22]	W.M. Zuberek, Timed Petri Nets Definitions, Properties and Applications. Microelectronics and Reliability, Vol. 31(4), 1991, pp. 627-644.
Interface Methods: a Means for the Integration of Inheritance in a Concurrent OOP Language
Agostino Poggi
Dipartimento di Ingegneria dell'Informazione
University of Parma, Viale delle Scienze, 43100 Parma, Italy
Phone: +39 521 90 5728, Fax: +39 521 90 5723
E-mail: poggiOCE.UniPR.IT
Keywords: concurrent object-oriented programming languages, interference between inheritance and concurrency, software reuse
Edited by: Rudi Murn
Received: September 26, 1995 Revised: November 26, 1995 Accepted: January 9, 1996
Several researchers have pointed out that inheritance often conflicts with concurrency in object-oriented languages, resulting in inheritance anomalies where redefinitions of inherited methods are necessary in order to maintain the integrity of objects. Several solutions have been proposed for resolving the anomalies; however, some of them are incomplete and do not solve all the possible types of inheritance anomalies. Others make the defìnition of classes complex. This paper copes with the inheritance anomaly problem presenting a solution that minimizes the redefinition of inherited methods without increasing the complexity to write them. This solution is based on the use of a special set of methods, called interface methods, that divide their body from two sets of synchronization constraints respectively used to enable the execution of the body of this method and to disable the methods that cannot be executed after this method. In particular, the paper presents a number of examples that illustrate how our proposal can solve easily the different types of inheritance anomalies.
1 Introduction	of the problem of the interference of inheritance
with concurrency (inheritance anomaly problem) Inheritance and delegation are the means for sha-	(America, 1987; Kafura & Lee, 1989; Tomlinson ring knowledge in an object-oriented language.	^ Singh, 1989; Matsuoka & Yonezawa, 1993). In They are not alternative, but can coexist in a	synchronization often interferes with inhe-same language (see, for example, Vulcan - Kahn	"^^^ce because a local change of the synchroni-et al., 1987). There is an ongoing debate on the	constraints in a class may cause the rede-advantages of inheritance as opposed to those of	^nition of synchronization constraints in the su-delegation. The most common opinion is that in-	Perclass. Moreover, the effect of the local change heritance is more important than delegation beca-	^e not Hmited to the superclass because the use it provides more power in reusability, even if	superclass inherits from other classes and its chandelegation provides more power and flexibility in	g^s are inherited by all its subclasses, sharing because an object can delegate dynami-	Matsuoka and Yonezawa (1993) identified three cally its messages to different objects, while the	"^^in reasons why inheritance anomalies occur, knowledge sharing of inheritance is statically fi-	These three reasons are: i) partitiohing of accep-^gj	table states, ii) history - only sensitiveness of
, acceptable states, and iii) modification of accep-However, delegation has often been preferred j-^j^jg states
in concurrent object-oriented programming languages (Wegner, 1990; Chin &; Chanson, 1991; i) partitioning of acceptable states. The hfe of Wyatt et al., 1992; Agha et al., 1993) because	an object can be described on the basis of the
set of states it can assume. This set of states can be divided into disjoint subsets, called accept sets, according to the synchronization constraints of the object. When a subclass adds a new method, the set of states might need to be further partitioned as regards the set of states of the superclass because the synchronization constraints of the new method may not be properly accounted for in the partitioning of the superclass. For example, the states of a first-in first-out bounded buffer class, called bounded-buffer, offering a method, get, that removes an element from the buffer, and a method, put, that stores an element into the buffer, can be divided in three accept sets respectively named empty, partial and full If a subclass of the bounded.buffer class, called getž.huffer, adds a method, get2, that removes a couples of elements, then the accept set partial must be partitioned into two accept sets, get2-partial and one-element, because it is necessary to distinguish the state at which only one element is into the buffer and so get2 cannot be executed. In this case, the inherited methods must be redefined if their synchronization code involve expressions on accept sets.
ii)	history - only sensitiveness of acceptable states. The synchronization constraints of some methods depend on history information. For example, if a subclass of bounded-buffer, called gget-buffer, adds a method, gget, whose behavior is almost identical to that of get, with the exception that cannot be accepted immediately after the invocation of put, then the state "after executing put" cannot be defined on the basis of the variables inherited from bounded-buffer. In this case, it might be necessary to add a new variable, after-put, to maintain the state "after executing put" and to redefine the methods, put and get, because they must set and reset the value of that new variable.
iii)	modification of acceptable states. The introduction of a new method modifies the set of states under which the methods inherited from the superclass could be invoked. For example, let us consider a class, called locker, that suspends the service of all the mes-
sages through the execution of the method, lock, until it receives and accepts a request of the method unlock and executes it. This class can be used together the bounded-buffer class to define a new class, called lock-buffer. In this case, it may be necessary to redefine the methods inherited from bounded-buffer because the synchronization constraints introduced by locker causes the modification of the acceptable states of get and put since they cannot be executed when their object is locked.
In recent years, research on the inheritance anomaly problem has achieved some important results and some proposals have shown solutions of the inheritance anomaly problem. However, these proposals are based on some synchronization schemes that often make the definition of classes complex.
In this paper, we present an approach to the problem of interference between inheritance and concurrency that solves the inheritance anomaly problem, but uses a simple synchronization scheme that makes the definition of classes easy and natural. The next section discusses the features and the hmits of other approaches to synchronization in concurrent object-oriented systems. Section three presents our solution. Finally, section four presents the main contributions of our work, its relationships with the works presented in section two, and our future research directions.
2 Related work
The possibility to solve the problem of interference between inheritance and concurrency depends on the type of message passing control strategy. In fact, Kafura and Lee (1989) noted that only decentralized message control can be combined with inheritance without restriction of reuse and extensibihty because centrahzed message control groups synchronization constraints into a centralized method which cannot be inherited by a subclass if it almost always differs from the method of the superclass in some synchronization constraints^.
^This is the reason why concurrent object-oriented programming languages that use centralized message passing control usually do not support inheritance (Yonezawa, 1990; America, 1987; Agha, 1986).
One of the first approaches to integrate inheritance with concurrency has been introduced by Hybrid (Nierstrasz, 1987). Hybrid associates a delay queue with each method and this queue can be closed or opened by some other object methods. Nevertheless, when a subclass adds a method, then the methods inherited from its superclass must be modified if they must open or close the delay queue of the new method. This disadvantage is not only present in the cases that involve one of the three types of inheritance anomaly, but in all the cases that a new method modifies the value of a variable involved in the synchronization constraints of some inherited methods. For example, a subclass of the bounded-buffer class introduced in the previous section, that adds a method, last, for removing the last element of the buffer, must redefine both the inherited methods because get must close the delay queue of last when the buffer becomes empty, and put must open the delay queue of last when the buffer contains an element.
ACT-l-l- (Kafura &; Lee, 1989) integrates the become operation of actor's languages with behavior abstractions which denote sets of open methods. Therefore, the current behavior of an object indicates the set of methods that can be accepted and executed. Nevertheless, this approach has two main hmits: it is based on the assumption of closed system and so loses the property of openness of the Actor model (Hewitt & de Jong, 1983), and it does not solve the inheritance anomaly problem. For example, the definition of the get2-buffer class introduced into the previous section must redefined the put method because, while the inherited method can become full or partial, the new method can become full, get2-partial or one-element, and must redefined the get method because, while the inherited method can become partial or empty, the new method can become get2-partial, one-element or empty.
Rosette (Tomlinson k, Singh, 1989) proposes the integration between inheritance and concurrency through the use of enable-sets. Enable-sets are "natural units of inheritance" associating a set of executable methods with an object state. This approach derives from behavior abstraction and presents the following main properties: i) the composition of enable-sets for generating new
enable-sets (a subclass can both inherits enable-sets and use them to define new ones), ii) the pos-sibihty to pass enable-sets to other objects through messages, iii) the parameterization of enable-sets, and iv) the scheduling of messages on the basis of a pattern order dividing messages into classes of priority. Nevertheless, it uses the same synchronization mechanism of behavior abstraction and so it has the same problems to cope with the inheritance anomaly problem.
Guide (Decouchant et al., 1989) proposes the use of method guards. Each method has an activation condition (guard) which determines the states where the method may be executed. Method guards are a more natural and elegant synchronization scheme. The partitioning of states is not a problem, because they are able to directly judge whether the message is acceptable or not under the current state, but they do not solve both the history - only sensitiveness of acceptable states and the modification of acceptable states anomalies because in these cases it often involves the redefinition of a large set of methods. For example, the definition of the gget class introduced into the previous section involves the redefinition of put because it must set the variable after-put, and of get because it must reset the same variable.
Synchronization Actions (Neusius, 1991) are one of the few synchronization approaches that solves all the three types of inheritance anomaly. Synchronization Actions are based on methods divided in four parts: matching, pre, action and post that respectively specify the guard, the pre-actions, the body and the post-actions of the method. Synchronization Actions support intra-object concurrency and use behavior abstractions to exclude mutually interfering operations. This approach strengthens object encapsulation-separating the variables used in the synchronization constraints from the other variables and by imposing that they can be managed only by the synchronization parts, that is, the matching, pre and post parts of a method. However, the fact that the value of these variables cannot be modified by the body of a method often involves duphca-tion of code and variables in the body and in the synchronization parts (for example in the bounded-buffer example, the variables used to indicate the first and the last full position cannot be used
both to access the corresponding element and to check the state of the buffer). Moreover, its parts cannot be incrementally modified or reused in the definition of other methods.
Prolung (1992) solved the problem of the interference between inheritance and concurrency through the separation between synchronization constraints and object methods. He describes a synchronization constraint as restrictions that apply to invocations of object methods. Constraints can be incrementally modified, the restriction appHcable to a given method of a superclass can be increased for the same method of its subclasses, and new constraints can be generated by the aggregation of other constraints. However, the fact that a class can only increase the restrictions of its superclass is often a disadvantage. For example, let us reconsider the locker class introduced in the previous section, the synchronization constraint that disables the execution of the methods different from unlock when the object is locked, has the following definition:
(locked) prevents all_except unlock
This definition prohibits that a subclass of the locker class could include a new method that may be executed when the object is locked. For example, a locking class, introducing a method, inquireJock, that must be executed irrespective of the value of the lock because it inquires and returns the state of the lock, naturally derives from the locker, but cannot be defined as its subclass.
Matsuoka and Yonezawa (1993) proposed a synchronization model that allows both guard-based and accept method set based synchronization schemes to coexist and be integrated, so that the best scheme can be chosen to describe the diiferent synchronization constraints. This approach solves the three types of inheritance anomalies and the association between guards and accept method sets may reduce the re-definitions of the methods belonging to the same set. However, on the one hand, the advantage of this association is limited because a guard which is associated with an accept method set can involve method arguments only if all the methods of this set have those method arguments. On the other hand, in the case of partitioning of states anomaly this approach is always less efficient than guard methods because, while the guard methods approach adds only the new methods and their guards.
this approach adds the new methods and the new guards and refines some inherited accept method sets.
3 Our approach
In our approach, an object, that we call c-unit (concurrent unit), is composed of four main parts, an interface controller, a set of special methods, called interface methods, which serve messages, a set of other methods, which are used by interface methods to perform their tasks, and a set of variables.
Figure 1: The structure of the object.
The interface controller is an active component that gets the first message from the object input queue and serves it through the appropriate interface method when the synchronization constraints
allow, the service. If they do not allow the service of the message, then the interface controller puts the message into a pending queue. At the end of a service, the interface controller checks again if the messages in the pending queue may be served and, if so, serves them through the appropriate interface method (figure 1 shows a graphical description of the object model).
An interface method is composed of three parts respectively called body, guard and transitions. The body part is the code performing the task of the method. The guard and transitions parts define the object synchronization constraints. The guard is a boolean expression that is based on instance variables, interface method arguments and guard parts of other methods, and disables the execution of the body of this interface method when it evaluates to false. The transitions part is a set of couples, each of them is composed of a boolean expression and a set of interface methods. This boolean expression has the same form of a guard with the obvious exception that it cannot contain other guard parts, but it can contain other transitions parts. When a boolean expression evaluates to true, the corresponding set of interface methods cannot be the next executed method. The transitions part is evaluated after the body execution and is used to manage synchronization constraints that depend on historic object state information.
3.1 Object definition and specialization
This synchronization mechanism has been used in a C-I-+ implementation of CUBL - Concurrent Unit Based Language (Poggi, 1994). Therefore, the definition and the specialization of an object and of its methods follow C-f—f- rules. However, the three parts of an interface method can be separately speciahzed.
3.1.1 Example 1: object definition
As example of object definition, let us consider the bounded-buffer class introduced into section 1. Its definition contains the variables defining the buffer structure, the class constructor, and the two interface methods, get and put. Each of these two interface methods is composed of a body that performs the method task and a guard that
defines the state condition where the body can be executed (see figure 2).
class bounded_buffer c_unit {■ , . ,. int buf[MAX]; int in, out; public : bounded_buffer() { in = 0; out = 0;
}
int interface get() { guard (in != out)
// true when the buffer // is not empty, body { int res;
res = buf[out++]; out 7. = MAX; return res;
}
}
void interface put (int item) -[ guard
(((in - out) % MAX) < ( MAX - D) // true when the buffer is not full; 11 (MAX - 1) is the number of // elements that the buffer can // contain, body { buf [in++] = item; in % = MAX;
>
}
}
Figure 2: The bounded-buffer class.
3.1.2 Examples 2 and 3: object specialization
As specialization example, let us define a bounded buffer that introduces a new interface method for removing the last element put into the buffer. We can obtain this type of buffer by specializing the previous bounded-buffer class. This new class, called extended-buffer, introduces an interface method, called last, that has the same guard of the inherited interface method, get, and its body removes the last element put into the buffer (see figure 3).
class extended_buffer : bouiided_buffer { public : int interface last() { guard
guard:get() II true when the buffer // is not empty, body { in = (—in % MAX) ; return buf[in];
}
>
>
Figure 3: The extended-buffer class. Note that the form guard:get() calls the guard part of the get interface method inherited from the bo-unded.huffer class.
This approach allows an incremental modification of guard synchronization constraints. In fact, let us define a bounded buffer which supports writers priority. We can obtain this type of buffer by speciahzing the definition of the bounded-buffer class. This new class, called priority-buffer, adds a synchronization constraint to the guard of get that checks the presence of put requests into the object queue through a variable, called put-queued, managed by the interface controller (see figure 4).
class priority_buffer : bounded_buffer { public : int interface getO { guard
((bounded_buffer::guard:get0 != 0) && ((guard:put() == 0)
I I (put_queued == 0))) // true when the buffer is not empty // and when the buffer is full or // there are not put requests in // the object queue.
}
>
Figure 4: The priority-buffer class. Note that the form bounded-buffer: : guard: get 0 calls the guard part of the get interface method inherited from the bounded-buffer class and that the form guard:put 0 calls the guard part of the put interface method inherited from the bounded-buffer class.
3.2 Inheritance anomalies solutions
This approach solves the inheritance anomaly problem reducing the redefinition of method code and eliminating the modification of the main body of the methods when a subclass introduces new synchronization constraints. The following examples show how our approach solves the three types of inheritance anomaly.
3.2.1 Examples 4 and 5: partitioning of states anomaly
As example of the partitioning of states anomaly, let us define the getž-buffer class introduced in section 1. This class is a speciaUzation of the previous bounded-buffer and its definition introduces an interface method, called get2, whose body gets couples of elements and whose guard evaluates to true when there is more than one element in the buffer (see figure 5).
class get2_buffer : bounded_buffer { public.:
int* interface get2(int* els) { guard
(((in - out) % MAX) > 1) // true when the buffer contains // more than eoi element, body { els [0] = buf [out++]; els[l] = buf Cout++] ; out y. = MAX; return els;
}
}
>
Figure 5: The get2-buffer class.
This approach also gives a good solution when synchronization constraints depend on method arguments. For example, let us define a bounded buffer allowing the withdrawal of a variable number of elements fixed by the external requests. This bounded buffer is obtained by specializing the previous bounded-buffer class. This new class, called mget-buffer, introduces an interface method, called mget, whose body gets m elements from the buffer and whose guard evaluates to true when there are a number of elements into the buffer equal or greater than m (see figure 6).
class mget_buffer : bounded_buffer { public :
int* interface mget(int* eis, int m) ■[ guard
(((in - out) y. MAX) >= m) // true when the buffer contains a // number of elements equal or // greater than m. body { int i;
for(i =0; i < m; i++)
els [i] = buf [out++]; out y. = MAX; return els;
>
}
}
Figure 6: The mget.buffer class.
3.2.2 Examples 6 and 7: history - only sensitiveness of acceptable states anomaly
As example of the history - only sensitiveness of acceptable states anomaly, let us define the ggeLbujfer class introduced into section 1. This class is a specialization of the previous bounded-buffer and its definition introduces an interface method, called gget, that has the same guard and body parts of the inherited get interface method. Moreover, it specializes -put interface method adding the transitions part to disable the gget interface method after its execution (see figure 7).
This approach allows an incremental modification of transitions synchronization constraints too. In fact, let us specialize the previous gget.buffer class introducing an interface method that removes the last element put into the buffer and that, in the same way of gget^ cannot be accepted immediately after the invocation of put. In this case, we define a new class, called gget.bufferl, by adding the interface method last, and a new transition to the transitions of put that disables the execution of last (see figure 8).
class gget_buffer : bounded_buffer { public :
void interface put(int item) { transitions
TRUE disable { gget }; // gget cannot be the // next executed method.
}
int interface ggetO guard
guard:get 0 // true when the buffer // is not empty, body i body, get 0 ;
// calls the body of get. }
>
}
Figure 7: The gget.buffer class. Note that the forms guard:get() and body:get() respectively call the guard and the body of the get interface method inherited from the bounded-buffer class.
class gget_bufferl : gget.buffer { public :
int interface lastO { guard
guard:get 0 // true when the buffer is not empty, body { in = (—in y, MAX) ; return buf[in];
}
>
void interface put(int item) { transitions gget.buffer:: transitions:put() TRUE disable i last }; // gget and last cannot be // the next executed method.
>
}
Figure 8: The ggetJ)ufferl class. Note that the form guard:get() calls the guard part of the get interface method that it is inherited from the gget.buffer class and that the form gget.buffer: :transitions;put() calls the transitions part of the put interface method that it is inherited from the gget.buffer class.
3.2.3 Example 8 and 9: modification of acceptable states anomaly
As example of the modification of acceptable states anomaly, let us define the lock-buffer class. The definition of this buffer can be obtained by simply mixing the definition of the locker class into the definition of the bounded-buffer class. The definition of the locker class contains two interface methods, lock and unlock] the transitions part of lock disables the execution of all the interface methods different from unlock!^-, the guard of unlock avoids its execution when the object is not locked (see figure 9).
Synchronization constraints can be incrementally modified, but can be redefined too. Therefore, problems do not exist in the definition of a subclass of the locker class introducing a method, inquireJock, that must be executed irrespective of the value of the lock (see figure 10).
class locker c_unit {
int locked: public : lockerO { locked = FALSE;
}
void interface lockO { body "C locked = TRUE;
}
transitions
TRUE disable all.except { unlock }; // only unlock can be executed.
>
void interface unlockO -[ guard"
(locked == TRUE) // true when the object is locked, body { locked = FALSE;
>
}
>
class lock_buffer
: locker, bounded_buffer
{ }
Figure 9: The locker and lock-buffer classes.
^Its effect hits all the interface methods of the^ubclasses of the locker class.
class inquire_locker : locker { public :
int interface inquire_lock () { body { return locked;
}
>
void interface lockO { transitions TRUE disable
all_except { unlock inquire_lock }; // only unlock and inquire_lock // can be executed.
}
>
Figure 10: The inquire-locker class.
4 Conclusions
In this paper, we have presented a framework to solve the problem of the interference between inheritance and concurrency in concurrent object-oriented programming languages. This synchronization mechanism has been used into a C-I-+ implementation of CUBL - Concurrent Unit Based Language (Poggi, 1994). CUBL is a concurrent object-oriented programming language, which derives from the Actor model (Agha, 1986) and ABCL (Yonezawa, 1990). This languages includes both passive and active objects, and communication is based on a large set of message exchange primitives which allow broadcasting besides past, now and future messages. CUBL supports physical distribution of the objects in a UNIX network managing communication with external objects via UNIX sockets.
Our synchronization scheme derives from guard methods (Decouchant et al., 1989) adding to them a new synchronization part, that is, the transitions part, to solve the two types of inheritance anomahes that guard methods cannot solve, that is, the history - only sensitiveness of acceptable states and the modification of acceptable states anomahes.
Unlike Matsuoka and Yonezawa (199-3), we attach guards to single methods and not to accept methods sets because it is simpler and more natural to define a method together with its synchronization constraints than to define together all the
synchronization constraints of a class. In fact, when a class contains a large number of accept method sets the addition of a new method may be complex because the programmer must understand what accept method sets must contain this method, and may cause a cumbersome redefinition of the synchronization code when this method must be included into a large part of the inherited accept method sets. Moreover, on the one hand, the transitions part may reduce the redefinition of methods in the cases of history - only sensitiveness of acceptable states and modification of acceptable states anomahes. On the other hand, the association of a guard to a single method simplifies and often eliminates re-definitions of inherited methods in the cases of state partitioning anomaly.
Class and methods can be specialized taking advantage of C-|—I- inheritance rules. In particular, the three parts of an interface method can be separately speciaUzed. The use of C-|-f allows an efficient implementation of interface methods. In fact, its three parts are translated into three inline C-f-f methods and so the call of an interface method is not made heavy by additional method calls. Moreover, the use of program transformation for the implementation of the guard parts reduces the overhead that would be derived from their naive implementation (Matsuoka et al., 1990).
This framework is based on the assumption that an object has at most one active thread and so method executions are serialized. The problem to have several active threads in an object is one of our future research directions and can be simply solved on the basis of the method used by Synchronization Actions (Neusius, 1991). In fact, we can introduce a new synchronization part to be executed after the evaluation of the guard part of the method. This new part will have the same form of the transitions part and its task will be to disable the methods that cannot be executed in parallel to that method.
Other future research directions are the development of real applications to make experiences on the use of such an object model and the refinement of the object model implementation to increase efficiency.
5 Acknowledgements
The work has been partially supported by the National Research Council through "CNR - Progetto Finalizzato Trasporti" and by the Ministri of University, Scientific Research and Technology through "MURST%40 - Sistemi Intelligenti".
References
[1]	G. Agha. Actors, A Model of Concurrent Computation in Distributed Systems. The MIT Press, Cambridge, MA, 1986.
[2]	G. Agha, P. Wegner, and A. Yonezawa. Research Directions in Concurrent Object-Oriented Programming. The MIT Press, Cambridge, MA, 1993.
[3]	P. America. Inheritance and subtyping in a parallel object-oriented language. In Proc. OOPSLA '87, pages 234-242, Orlando, FL, 1987.
[4]	P. America. POOL-T: a parallel object-oriented language. In A. Yonezawa and M. To-koro, editors, Object-Oriented Concurrent Programming, pages 199-220, Cambridge, MA, 1987. MIT Press.
[5]	R.S. Chin and S.M. Chanson. Distributed object-based programming systems. ACM Comput. Surveys, 23(1):91-124, 1991.
[6]	D. Decouchant, S. Krakowiak, M. Meysem-bourg, M. Riveill, and X. Rousset de Pina. A synchronization mechanism for typed objects in a distributed system. SIGPLAN Notices, 24(4):105-107, 1989.
[7]	S. Frolung. Inheritance of synchronization constraint in concurrent object-oriented programming languages. In Proc. ECO OP '92, pages 185-196, Utrecht, The Netherlands, 1992.
[8]	C. Hewitt and P. de Jong. Analyzing the rules of descriptions and actions in Open System. In Proc. AAAI-83, pages 162-167, Washington, DC, 1983.
[9]	D.G. Kafura and K.H. Lee. Inheritance in actor based concurrent object-oriented languages. The Computer Journal, 32(4):297-304, 1989.
[10]	K. Kahn, E. lYibble, M. Miller, and D.G. Bo-brow. Vulcan: Logical concurrent objects. In B. Shriver and P. Wegner, editors. Research Directions in Object-Oriented Programming, pages 75-112, Cambridge, MA, 1987. The MIT Press.
[11]	S. Matsuoka, K. Wakita, and A. Yonezawa. Synchronization constraints with inheritance: What is not possible - so what is? Technical Report 10, Department of Information Science, The University of Tokio, Tokio, Japan, 1990.
[12]	S. Matsuoka and A. Yonezawa. Analysis of inheritance anomaly in object-oriented concurrent programming languages. In G. Agha, P. Wegner, and A. Yonezawa, editors. Research directions in Concurrent Object-Oriented Programming, pages 107-150, Cambridge, MA, 1993. MIT Press.
[13]	C. Neusius. Synchronization actions. In Proc. ECOOP '91, pages 118-132, Geneva, Switzerland, 1991.
[14]	O. Nierstrasz. Active objects in Hybrid. In Proc. OOP SLA '87, pages 243-253, Orlando, FL, 1987.
[15]	A. Poggi. Agents and resources management with CUBL. In Proc. HICSS '94, pages 112121, Maui, HI, 1994.
[16]	C. Tomlinson and V. Singh. Inheritance and synchronization with enable-sets. In Proc. OOPSLA '89, pages 103-112, New Orleans, LA, 1989.
[17]	P. Wegner. Concepts and paradigms of object-oriented programming. OOOPS Messenger, l(l):7-87, 1990.
[18]	B.B. Wyatt, K. Kavi, and S. Hufnagel. Parallelism in object-oriented languages: a survey. IEEE Software, 9(6):56-66, 1992.
[19]	A. Yonezawa. ABCL: An Object-Oriented Concurrent System. The MIT Press, Cambridge, MA, 1990.
The Fringe Bucket-Quadtree, a Robust and Efficient Spatial Data Structure with a High Population Density
R. Maelbrancke and H. O li vie
K.U.Leuven, Department of Computing Science, Celestijnenlaan 200A, B3001 Heverlee, Belgium Phone: ++32-16-327643, Fax: ++32-16-327996 E-mail: {rudim,olivie}@cs.kuleuven.ac.be
Keywords: quadtree, spatial data structures
Edited by: Matjaž Gams Received: June 1, 1995
Revised: September 20, 1995 Accepted: October 10, 1995
In this paper we present three methods for the augmentation of population density in bucket oriented quadtrees. These improvement techniques are incorporated in a new data structure called the Fringe Bucket-quadtree. Analysis of this structure shows a population density of 77%.
1 Introduction
In a number of application areas, such as geographic information systems (GIS for short) and spatial databases, searches involving several attributes are very common. If such multiple-attribute searches are the rule and the single-attribute searches are the exception, multi-attribute indexes are the best choice. Spatial data structures are best suited to represent multi-attribute indexes. An overview of such data structures is given by Samet [12].
Since the number of data represented in a GIS is huge, indexes for external storage media are recommended. For an optimal use of the external storage capacities, it is recommended that such indexes have a high filling degree or population density. The population density is the relationship between the actual number of data items in a bucket and the maximal number of data items the bucket can hold.
In this paper a fringe search quadtree is introduced. Fringe search trees have data references in a Hmited number of nodes at the end of each search path.
The fringe search tree compares to a combination of a node search tree and a leaf search tree. Leaf gearch trees ßtore their data references only in the leaves. Node search trees store data in the internal nodes.
We call a bucket oriented fringe search quadtree
Figure 1: A PR quadtree
a Fringe-Bucket quadtree (FB-quadtree for short). The FB-quadtree has a high population density combined with a low and deterministic number of block acesses.
To argue the way we improved quadtrees, we first give an overview of preceding research results.
1.1 PR quadtrees
In [12], Samet presents bucket Point Region quadtrees (PR quadtrees) as an alternative for Finkel's point quadtrees [3].
PR quadtrees differ from quadtrees in three ways (cf. figure 1).
1.	when a region is spht it is devided into four equal sized squares
2.	the PR quadtree is a leaf search tree; every time a node is split the data are distributed among the four sons
3.	the leaves can contain more than one information item; the number depends on the size of the data buckets the leaves are referring to
Figure 2: A holey brick (A)
Since PR quadtrees have a data structure comparable to point quadtrees, search algorithms can be borrowed [14]. However, the population density is only 46% on average [1].
1.2 Holey Bricks
Spatial data structures have often been compared with their linear ancestors. Especially when bucket trees were introduced, the average population density of 69% in B-trees became a main goal. Nievergelt was one of the first researchers to mention the population density for spatial data structures in [8]. He conducted simulation studies showing a 70% population density. However, problems such as the splitting of buckets other than the overflowing bucket and the dead space problem due to the lack of merging candidates, may seriously degrade the population density. The kdB-tree [11], a spatial B-tree, reached a population density of only 60% for uniformly distributed data. The BD tree [9], also termed BANG file [4], is another major example. However, all these structures are very dependent on the data distribution.
In 1990, Lomet [7] introduced the holey brick tree (hB-tree for short). It opened a new direction in the quest for the 69%.
The main problem in reaching this goal is the way the splitting of the space is performed. When the points are distributed in the form of a cross (cf. figure 2), the region cannot be split up into two regions in such a way that there is a good distribution over the two parts. Therefore Lomet has introduced holey bricks. A region is divided into equal shaped parts if there is no cross distribution problem, otherwise a small part is cut out (cf. figure 2). However, hB-trees have a fairly complex structure which is hard to maintain.
To avoid a complex data structure when using holey bricks, Ouksel has introduced the Nested
Interpolation Based Grid File (NIBGF for short) [10]. The main idea is that on a same level regions are spht into two if a | to | distribution over the two regions can be reached, otherwise a small part is pushed down to a lower level in order to obtain a holey brick. Since the NIBGF uses B-trees to store information, it is a very efficient and robust data structure. A main drawback is the complexity of the algorithms, especially for the deletion.
1.3	Joining Buddy Partitions
Seeger [13] could reach a 74.5% population density [5] by joining neighbour partitions into one bucket in his implementation of the BUDDY-f structure. The fact that the BUDDY-F structure is static restricts its applications.
1.4	Conclusion
We know from Samet that ciuadtrees are easy to maintain and commonly used structures. It has been argued that the alternatives mentioned above are all very complex and difficult to maintain.
In section 2, methods to improve the population density of PR quadtrees are introduced. A data structure, incorporating this population density improvements, the Fringe Bucket-quadtree is defined in section 3 and empirically analyzed in section 4.
2 Augmenting the Population Density
Before we give three methods for the augmentation of the population density of PR quadtrees, we give a short description of the node structure.
2.1 The PR Quadtree Node Structure
The node structure consists of two parts, the discriminators and the keys. The discriminators are kept in main memory, while the keys are stored on block structured external storage media. We represent a node in a PR quadtree as five pointers. Four pointers are used for the North-West (NW), North-East (NE), South-West (SW) and South-East (SE) sons. In case of a leaf node the
DATA-BUCKET
NW NE SW SB
Figure 3: A node of a PR quadtree
na-t
Figure 4: A possible combination of squares in a quadtree
fifth pointer points to the external data bucket containing the keys and the data references. In figure 3 a schematic representation of the node of a PR quadtree is given.
As a PR quadtree always splits in the same way, the coordinates of the square in the quadtree partition can be calculated dynamically. Therefore the coordinates of the represented square must not be kept in the nodes.
2.2 Method 1: The Combination of Nodes
In PR quadtrees, regions are always spht into four equal sized squares. This may lead to underpopulated nodes.
Method 1 combines nodes in such a way that a higher population density is reached. If the nodes are chosen carefully, a fairly good distribution of the points is obtained.
Figure 4 illustrates a combination of squares. The resulting structure can still be traversed in the same way as a PR quadtree. If a node represents a combination of squares, it has more than one incoming edge.
If a node that represents a combination of squares flows over it is split in such a way that the best population density is obtained. Once a node represents only one square, when it flows over it will give birth to new combined nodes. The empirical analysis of the implementation of this method (cf. section 4) shows a population density of 66%.
Figure 5: The transformation a leaf search quadtree into a node search quadtree
2.3 Method 2: Transforming the PR Quadtree into a Node Search Tree
The population density can be augmented if the PR quadtree is transformed into a node search tree.
In case of an overflow, the contents of the node do not have to be distributed over the son nodes anymore. On the contrary, the father node may keep the data, while the sons are ready to accept new data.
If we adapt method 1 to node search trees, an overflowing node gives birth to only one son node which is the combination of four squares. Hence, this combined son node represents a space that has the same size as it fathers one. In that way, the 50% minimum bound is obtained if the data are distributed over the father and the son. This method also solves the cross-distribution problem and the repeated spHtting problem (in case of points that are located extermely close to each other). Data are indeed distributed over the levels instead of over the nodes on one level. In figure 5 an example is given.
Let us analyze the node occupancy mathematically. The lowest level of an original PR quadtree contains at most 3 times the number of nodes in the rest of the tree. We know that the fiUing degree in the lowest level of a PR quadtree improved using method 1 is 66%. If only insertions are performed, the higher levels of a PR quadtree that is improved with methods 1 and 2, have a population density of 100%, leading to
^ ■ 66 -I- i • 100 = 77
O	o
This is more than what Kriegel [5] has obtained with the BUDDY+ structure. Empirical analysis of method 1 combined with method 2 show a population degree of 77.7%.
2.4 Method 3: Restricting Method 2 to the Lowest Two Levels
As separating discriminators and data is essential for good range search performance [6], method 2 is not ideal.
If we restrict method 2 to the lowest two nodes on each search path of the tree,we only weaken a little the separation of discrimators and data. If we define this weakening strictly, we require that on each path towards a key, at most two data buckets should be visited. Since nowadays memory caches are much larger than data blocks on external media, these two levels will not remarkably reduce performance. The empirical analysis of method 3 shows a slightly higher population density than the one obtained with method 2, namely 78%.
3 Fringe Bucket-Quadtrees 3.1 Definition
The multi-attribute keys of the data are represented by n coordinates of an n-dimensional space S.
Preliminary Definitions
-	5 is split repeatedly in n-squares
-	a simple node is a node that represents an n-square
-	a compound node is a node representing a combination of n-squares
-	a compound node that contains 2" n-squares is called complete.
-	an inner node is a node that has no data references
-	a node containing a data reference is called a data node
___Inner luxlc
Simple node
^ Compound node Complete node
Figure 6: The structure of an FB-quadtree
—	a data node that does not represent data for one or more n-squares of the region it represents, is called a hybrid node ^
-	a data node with no sons is called a leaf node
Definition 1 A hierarchical spatial data structure (tree for short) is called a Fringe Bucket-Quadtree, FB-quadtree for short, iff
1.	exactly one node, called the root has no incoming edges; the root represents S
2.	each inner node u has 2"- outgoing edges; each edge represents an 7i-square of the partition into 2" of the subspace represented by v, each edge points to the node (either a simple or a compound node) that contains the represented n-square
3.	each son node of v may have at most 2" incoming edges from v
4.	a compound node does not have sons
5.	on each path from the root to a leaf, at most the last two nodes in the path can refer to the data buckets . Such a bucket can contain at most b keys.
We denote by b the bucketsize of an FB-quadtree. In figure 6 a scheme of the FB-quadtree is shown.
^This case occurs when a search path is crossing the data node while still more than one data node is following this one


a.
DATA-BUCKET
NW
NE SW
SE
Figure 7: A node of an FB-quadtree
3.2	The Node Structure
The node structure for FB-quadtrees is the data structure of the PR quadtree, augmented with a tag for each outgoing edge. A true edge tag determines the presence of data for the n-square represented by that edge in the referred data bucket. Figure 7 gives a schematic view on the node structure for n = 2.
3.3	The Point Search Algorithm
Search for the key k in an FB-quadtree t BEGIN
start from the root;
determine in which n-square k can be placed;
let E be the edge associated with that n-square;
IF the current node is an inner node
THEN continue the search on the lower level in the node reached along E;
ELSE
IF the current node is a hybrid node
and the iJ-tag is false THEN continue the search on the lower level in the node reached along E\ ELSE
BEGIN
search in the data bucket of the current node for k] IF k is found THEN stop ELSE continue the search on the lower level in the node reached along E; END
3.4 The Insertion Algorithm
The insertion algorithm is described top down.
3.4.1	Insert the Key k
Initially an FB-quadtree consists of one simple leaf node. Insert the key k.
BEGIN
perform the point search algorithm {ß denotes the node where the search ends);
IF the father of ß denoted by cß^ß) ha.s vacant positions
THEN add k to 4>iß)
ELSE CASE ß
has vacant positions: add k to ß; break;
is a simple leaf node: CALL double
ß and add k to ß\ break; otherw^ise CALL split or reduce ß
and add k to ß END
END
3.4.2	Double the Data Node
Double ß
BEGIN
create a complete compound node on the level below ß;
move the keys in 4>{ß) that correspond to the n-square represented by ß to the son of/3
set the tag in ^{ß) which corresponds to the incominge edge of ß to false;
IF the son of ß contains less than 50% of keys
THEN move keys from ß to its son until the 50% is reached
END
END
Figure 8 and figure 9 illustrate this case.
Figure 8: Doubling a data node
Figure 9: The creation of a hybrid node
3.4.3 Split or Reduce the Data Node
Split or reduce ß.
BEGIN
IF all keys are located in one n-square of the region
THEN ß is reduced to a simple data node that represents the involved n-square and is doubled
ELSE split /? in a simple data node and either a compound data node or simple data node depending on the arity of ß.
END
The cases are respectively illustrated in figures 10
and 11.
Figure 11: Splitting a bucket
3.5 The Deletion Algorithm
The deletion algorithm can be derived directly from the insertion algorithm: Delete key k.
BEGIN
perform the point search algorithm {ß denotes the node where the search ends);
remove k from /?;
IF the population in ß is less than 50% THEN
BEGIN
IF (l){ß) contains more than 50% of keys
THEN move a key from </>(/?) to ß ELSE
IF a brother compound node of ß contains less than 50% of keys THEN join ß with this brother ELSE
IF /3 is a complete compound node THEN BEGIN
join ß with (f){ß)-, IF (j){(t>{ß)) is a hybrid node THEN set to false 4>{(j>{ß)ys edge tag pointing to ß END
END
IF (j){ß) is a hybrid bucket and ß has no sons anymore
THEN set to false (t>{ßys edge tag pointing to ß
Figure 10: Reducing a bucket
END
Distribution	6= 16	b = 100
Uniform Gauss Diagonal Gauss Vertical Gauss Gauss Clusters	77.33 % [77%, 79%] 78.00 % [77%, 79%] 77.82 % [77%, 78%] 77.82 % [77%, 78%] 77.60 % [77%, 78%]	78.22 % [75%, 80%] 76.90 % [76%, 78%] 74.73 % [73%, 75%] 76.51 % [75%, 77%] 75.86 % [74%, 77%]
Average	77.71 %	76.45 %
Table 1: Mean population density and running mean population density bounds of FB-quadtrees
4 Empirical Analysis 4.1 Input Sets
To test the robustness and behaviour of the FB-quadtree, several input sets were generated. All sets consist of one million points taken from [0..8192[x[0..8192[. We have chosen real values to avoid an upper bound on the height of the tree. Five kinds of distributions have been generated, namely:
1.	a uniform distribution
2.	a Gaussian distribution
3.	a diagonal Gaussian distribution, along the diagonal a small band of gaussian distributed points
4.	a vertical line distribution, as in 3, but along a vertical axis
5.	ten clusters of Gaussian distributions
Distr. / Level	b		1,2,3,4,5,6,7,8,9,10,11,12,13]
Uniform	16 100		0,0,0,0,0,0,97,73,58,0,0,0,0] 0,0,0,0,100,100,56,0,0,0,0,0,0]
Gauss	16 100		0,0,89,85,85,85,85,86,72,61,0,0,0] 0,98,82,87,86,85,85,61,0,0,0,0,0]
Diagonal Gauss	16 100		0,0,0,0,0,0,59,78,85,90,73,59,0] 0,0,0,0,0,53,84,72,90,59,0,0,0]
Vertical Gauss	16 100		0,0,0,0,0,0,97,96,83,90,72,59,0] 0,0,0,0,0,0,47,88,88,61,0,0,0]
Gauss Clusters	16 100		0,7,5,21,25,45,77,86,85,86,68,58,0] 5,4,12,23,40,69,82,85,82,61,0,0,0]
Table 2: The distribution of the population densities (in %) among the levels
Distribution	0 = 2	0 = 4	6-8
Uniform Gauss	85.26 % 85.19 %	81.02 % 81.00 %	79.15 % 79.04 %
Table 3: The population density under several bucket sizes (100000 points)
4.3 Discussion of the Results
The empirical analysis gives us a good indication of the improvements that have been made applying the introduced adaptations to PR quadtrees.
The population density of FB quadtrees is higher than more complicated structures which were invented to obtain a higher population density (e.g. hB-trees, NIBGF, BUDDY, ...). In comparison with the results obtained by Ang and Samet [1] for uniformly distributed data the gain is about 72%: 47% -> 81%. This comparison is shown in table 4.
4.2 Results
In the following tables(l,2) results of empirical analysis are given. To calculate the mean population density, the running average is calculated from the 10000th insertion on. The result given is the running average while performing the 1000000th insertion. The lower and upper bounds of the running average are given too ([minimum %, maximum %]).
To show the influence of the bucket size, the experiment is redone for small ò's and shows an even better filling degree. Remarkable is the high fil-hng for Ò = 2, which means that more than half of the buckets are filled. These results are displayed in table 3.
5 Conclusion and Further Research
We have introduced three methods for the augmentation of the population density in PR-
b	PR quadtree	FB-quadtree
2	46.0 %	85.2 %
4	46.3 %	81.0 %
8	47.5 %	79.0 %
Avg.	46.6 %	81.7 %
Table 4: Population comparison with PR quadtrees
quadtrees. We gained about 70%, resulting in a 77% population density, by combining parts and allowing the two lowest nodes in a path to contain data.
These methods have resulted in a new structure called the Fringe Bucket-quadtree. It is very useful for database implementations. Our further work will consist in the elaboration and the analysis of a generalization of the FB-quadtree for higher dimensions. Current analysis of the octtree case show comparable results.
References
[1]	Ang C. & Samet H. (1989) Node distribution in a PR quadtree. In Design and Implementation of Large Spatial Databases, p. 233-252, Berlin Heidelberg New York: Springer Verlag.
[2]	Bayer R. & McCreight E. (1972) Organisation and maintenance of large ordered indexes. Acta Informatica, 1, 3, p. 173-189.
[3]	Finkel R. A. & Bentley J. L. (1974) Quad trees: a data structure for retrieval on composite keys. Acta Informatica, 4 ,1, p. 1-9.
[4]	Freeston P. (1987) The BANG file. In Proceedings of ACM SIGMOD, p. 260-269.
[5]	Kriegel H. P. & Schiwietz .M et al (1989) Performance comparison of point and spatial access methods. In Design and Implementation of Large Spatial Databases, p. 89-114, Berlin Heidelberg New York: Springer Verlag.
[6]	Lomet D. B. (1991) Grow and post index trees: Role, techniques and future potential. In Advances in Spatial Databases, p. 183206, Berlin Heidelberg New York: Springer Verlag.
[7]	Lomet D. B. & Salzberg B. (1990) The hB-tree: A multiattribute indexing method with good guaranteed performance. ACM Transactions on Database Systems, 15 ,4, p. 625658.
[8]	Nievergelt J. & Hinterberger H. & Sevcik K. C. (1984) The grid file: an adaptable symmetric multykey file structure. ACM Transactions on Database Systems, 9, 1, p. 38-71.
[9]	Ohsawa J. & Sakauchi M. (1983) The BD-tree: a new n-dimensional data structure with high efficient dynamic characteristics. Information Processing, 83, p. 539-544.
[10]	Ouksel M. A. & Mayer O. (1992) A robust and efficient spatial data structure. Acta Informatica, 29, p. 335-373.
[11]	Robinson J. T. (1984) The K-D-B-tree: A search structure for large multidimensional dynamic indexes. In Proceedings of ACM SIGMOD Conference on Management of Data.
[12]	Samet H. (1990) The Design and Analysis of Spatial Data Structures. Addison-Wesley Series on Computer Science and Information Processing. Addison-Wesley Publishing Company.
[13]	Seeger B. & Kriegel H. P. (1990) The buddy-tree: an efficient and robust access method for spatial data base systems. In Proceedings of the 16th International Conference on Very Large Data Bases, p. 590-601.
[14]	Shafferd C. A. & Samet H. & Nelson R. C. (1990) Quilt: a geographic information system based on quadtrees. Int. J. Geographical Information Systems, 4, 2, p. 103-131.
THE NEW YORK ACADEMY OF SCIENCES (NYAS)
Oh Plato, Oh Socrates, you who walked and talked in the grove of Academe, join us in our walks and talks—and take pride in the published records of the new and the true that our Academy gives the world.
—John A. Wheeler Astrophysicist, University of Texas [1]
In this column, Informatica will bring periodically, news and relevant information concerning the largest worldwide academy of sciences—The New York Academy of Sciences. Reports will include the initiatives and activities of the NYAS members related to the editorship of Informatica and the support offered by NYAS to scientific events and communities outside NYAS.
1	The World-wide Relevance of NYAS
NYAS is a proper cosmopolitan academy, according to its membership spread over the globe, its international organization experience, scientific excellence regarding members and publications, and a long—almost 180 years lasting— tradition and democracy. This position is wide before other, nationally conceptualized academies of sciences, in which certain national (ist) preferences are dominating, for example, political impact, national appurtenance, privileged status of local academy members (e.g., their permanent monthly financial rewarding), ethnic (nepotistic) selection, arrangement of rules, etc.
2	A Short Presentation of NYAS
2.1 Introduction
A new member of the Academy [1] becomes part of the oldest and most distinguished scientific societies in America, if not in the world—a society which since 1817 has numbered among its members Thomas Jefferson, John James Audubon and, in this century, more than 40 Nobel Laureates and many other eminent scientists.
The international membership embraces over 47,000 distinguished scientists from every state in the U.S.A. and some 160 countries all over the world [2]. Through individual membership one's knowledge will be enriched and broaden when communicating with colleagues in one's own and related fields, but it will also bring the member a remarkable range of tangible benefits of substantial value from other subjects.
2.2 Board of Governors
The board of the governors in the period July 1995-June 1996 is the following:
-	Chairman of the Board
Joshua Lederberg, Nobel Laureate in Physiology or Medicine, Geneticist, The Rockfeller University
-	President
Henry M. Greenberg, Columbia University
-	President-Elect
Martin L. Leibowitz, Managing Director, Salomon Brothers Ine
-	Treasurer
Henry A. Lichstein, Vice President, Citibank, N.A.
-	Governors
Eleanor Baum,
Dean, School of Engineering, The Cooper Union
Barry R. Bloom, Weinstock Professor, Department of Microbiology and Immunology, Albert Einstein College of Medicine
D. Allan Bromley, Sterling Professor of the Sciences & Dean of Eng., Yale University
Edward Cohen, Chairman & CEO, Ammann & Whitney, Ine
Susanna Cunningham-Rundles, Associate Professor of Immunology, Cornell University Medical College
Bill Grenn, Director, N.Y.C. Housing Development Corporation
Sandra Panem, Vice President, Oppenheimer Management Corporation
Richard A. Rißind, Chairman, Sloan-Kettering Institute
Dominick Salvatore, Professor and Director of Graduate Program, Department of Economics, Fordham University
David E. Shaw, Managing General Partner, D.E. Shaw k Co
William C. Steere, Jr., Chairman & CEO, Pfizer Ine
Shmuel Winograd, Director, Mathematical Sciences Department, T.J. Watson Research Center, IBM
-	Past Chairman
Cyril M. Harris, Batchelor Professor Emeritus of Electrical Engineering & Professor Emeritus of Architecture, Columbia University
-	Honorary Life Governor
William T. Golden, Chaiman Emeritus, American Museum of Natual History
-	Legal Counsel: Helene L. Kaplan, Skadden, Arps, Slate, Meagher & Flom
2.3	Senior Staff
Chief Executive Officer Rodney W. Nichols
Executive Editor, The Annals Bill M. Boland
Controller Vernon A. Bramble
Editor-in-Chief, The Sciences Peter G. Brown
Director, Communications Ann E. Collins
Director, Marketing & Membership Katherine T. Goldring
Manager, Sections and Meetings Matthew M. Katz
Director, Human Resources Leslie A. Mooney
Director, Policy Staff Susan U. Raymond
Director, Education Lori D. Skopp
Director, Human Rights of Scientists Sve-tlana Stone
Director, Conferences Renée Wilkerson (acting)
2.4	Section Chairs and Vice-Chairs
Anthropology Co-Chairs
Linda Bäsch and R. Brian Ferguson Co-Vice-Chairs
Antonio Lauria-Pericelli and Johanna Lessinger
Atmospheric Sciences Chair Stanley Gedzelman Vice-Chair Mark Kramer
Biochemistry Chair Laurence P. Wennogle Vice-Chair Jo Anne M. Saye
Biological Sciences Chair Gertrude Pfaffenbach Vice-Chair Lorraine M. Moran
Biomedical Sciences Chair Victoria N. Luine
Vice-Chair Christina R. McKittrick
Chemical Sciences Chair A.M. Venkatesan Vice-Chair Kang Zhao
Computer and Information Sciences Chair D. Frank Hsu
Vice-Chair Pauline M. Rothstein
Economics Chair Laurence R. Klein
Co-Vice-Chairs Dominick Salvatore and Edmund Phelps
Engineering Chair Joel J. Kirman Vice-Chair James Cohen
Environmental Sciences Chair David C. Locke Vice-Chair John L. Cusack
Geological Sciences Chair Gerald M. Friedman Vice-Chair Daniel Habib
History and Philosophy of Science Chair David Cassidy
Vice-Chair Bruce Chandler
Inorganic Chemistry and Catalytic Science Chair
Lynn Francescani Vice-Chair Robert Beer
Linguistics Chair Avraham Schweiger Vice-Chair F. Frank LeFever
Mathematics Chair Helen Strassberg
Vice-Chair David Seppala-Holtzman
Microbiology Chair Zigmund C. Kaminski Vice-Chair Stephen Morse
Neuroscience Chair Alcmene Chalazonitis Vice-Chair Marlene Schwanzel-Fukuda
Physics and Astronomy Chair Harry Sticker
Vice-Chair David W. Kraft
Polymer Science Chair li'alle Levon
Vice-Chair Abraham Ulman
Psychology Chair Joan Gay Snodgrass Vice-Chair Barbara A. Mowder
Science and Public Policy Chair Frank Fischer
Vice-Chair Maarten de Kadt
Science Education Chair John W. Fedors Vice-Chair Ellen Goldstein
Women in Science Chair Alice Deutsch Vice-Chair Frances Stern
Chairs and vice-chairs data are from [3].
2.5 Scientific Conferences
Each year the Academy sponsors 15-20 conferences where internationally recognized authorities from different specialities gather to discuss some of the most significant research being conducted today. The hallmark of an Academy conference is an interdisciplinary approach of the topics of common interest.
Some Academy conferences are held in Europe (e.g., Czech Republic, Hungary, Italy, etc.). It depends upon the initiative and organization possibilities of the members outside U.S.A.
Every country grapples with the goal of ensuring that science and technology are powerful engines of economic growth. The Academy has joined with the Federal Reserve Bank of New York to present a conference on Technology and Economic Development. The insights will be important worldwide.
Chinese People's University in Beijing, whose son was killed at Tiananmen Square, is compiling a book of names of those who were killed or injured in the incident. She has endured numerous house arrests and death threats by the Chinese government.
The Academy has long been a champion in the field of human rights. In May, the Academy teamed with Columbia University's Center for the Study of Society and Medicine to present a workshop for health professionals.
3 A Short Analysis of the NYAS Membership
For Spring 1995 a picture of the Academy's worldwide membership was provided [5]. With membership growing rapidly during the past three years, the Academy turned to Roper Strach Worldwide, a market research firm, to learn more about members and their views. The study confirms recent changes in the membership and examines members' views of current and prospective Academy programs and services.
Since 1989 the number of members outside the U.S. has risen significantly and now makes up about half the membership. The Academy's approximately 47000 members in some 160 countries around the world are distributed regionally as follows:
2.6 Human Rights
The Academy has an active human rights program, including identification of and protest against human rights abuses of scientists around the world. Andrei Sakharov and Fang Lizhi, who both made their first U.S. appearances at the Academy, are among the scientists who have credited the Academy with coordinating efforts that led to their release from exile.
In 1995, NYAS Human Rights of Scientists Award was imparted to Chinese Academics Xu Liangying and Ding Zilin in recognition of their roles in calling for the establishment of democratic law in China, and for their compassionate support of survivors and other dissidents involved in the Tiananmen Square incident. Xu, who drafted the petition, is a noted physicist and translator of Albert Einstein's works. Ding, a philosopher at the
Eastern Europe	7	percent
Western Europe	22	percent
Asia and the Pacific	12	percent
Africa and the Middle East	4	percent
South and Central America	4	percent
North America	51	percent
In addition, to the U.S., 15 countries, each, are home to 500 or more Academy members. Those countries in descending order are Japan, Germany, Russia, France, Canada, and Italy—with more than 1,000 members each—and Australia, the United Kingdom, Spain, Switzerland, Sweden, Brazil, Argentina, Israel, and Belgium.
The sample used for the survey was stratified into cells that reflect the regional distribution of the members, and it consisted of 3,500 members in 36 countries in each of which there are 70 or more members.
The avarage age of the Academy members is 55 years, with the mean age of members in the U.S. being somewhat higher than that in other parts of the world. The Asia-Pacific region, with the mean age 48, is the only area where the mean age is below 50. Overall, 15 percent of the members are female, but that segment ranges from 17 percent in the U.S. to only 3 percent in Western Europe.
According to the NYAS survey, nine of ten members are employed, and three out of four are employed full-time. Among those employed, the distribution in terms of setting is as follows:
College or university	34	percent
Industry or private business 19	percent
Medical school	13	percent
Hospital or clinic	10	percent
Government	9	percent
Private practice	7	percent
Independent consultant	6	percent
Nonprofit organization	5	percent
Other	2 percent
On the average, academy members belong to 4.5 science organizations in addition to the academy. The mean is highest in the U.S., at 4.8, and lowest in Eastern Europe, at 3.3. Overall, only 6 percent of the Academy members do not belong to any other science organization, with the range from only 3 percent in U.S. to 16 percent in Eastern Europe.
4 A Detailed Presentation of NYAS Publications
The NYAS regular publications are the Annals, the Sciences Magazine, and Focus—the member newsletter.
The proceedings of the Academy's conferences and selected other meetings are published in the Annals series. Each of the approximately 30 volumes published annually provides a sharply focused picture of current knowledge and advanced work. Out of the thousands of scientific publications worldwide, the Annals are ranked in the top two percent of those publications most cited by scientists.
Each book of the Annals is typically priced at $100 or more (some of them at $190), but members may select one volume each year at no charge, plus up to five addiiional volumes as only
$15 each. The Annals make invaluable additions to a personal library at savings that total many times the low annual dues ($105).
The Sciences is a unique bimonthly magazine of all the sciences which has twice won the National Magazine Award for General Excellence. Its highly readable articles have covered such fascinating topics as The Mother Tongue from which all languages derive, No Simple Slumber, exploring the enigma of sleep. Terminal Viruses, that attack computers, How Many People Can the Earth Support?, and so much more.
In the November/December issue (1995, pp. 36-39), the paper Ghost in the Machine is attractive. The cast of characters on the Internet now includes robotic impersonators as well as the multiple personalities of its human users. Like dreams and beasts, the computer stands on the margins of human life. It is a mind that is not yet a mind. It is an object, ultimately mechanism, but it acts, interacts and seems, in a certain sense, to know. After all, people also act, interact and seem to know, yet ultimately they are made of matter and programmed DNA. We think we can think. But can it think? Could it ever be said to be alive?
Each issue is beautifully illustrated with fine art that serves as an enlightening companion to the text. Individuals and libraries worldwide subscribe to The Sciences, but for the NYAS member the subscription is included in the membership.
Focus is the Academy's bimonthly member newsletter. Focus keeps members informed about upcoming Conferences and Annals, reports on Academy activities and programs, news about fellow Academy members and about stimulating events open to members. Focus helps to link a member to the Academy and to more than 47000 other members around the world—in 160 countries.
5 NYAS E-Mail and Internet Services
Currently, the ordinary e-mail to and from Academy is possible. Some of the topics are the following:
Conferences Corporate Membership
conferencetoyas.org corpmeniQnyas. org
Darwin Associates Development Education Focus
Human Rights Junior Academy Lyceum Club Membership Policy Programs Publications Public Relations Sections
darwinQnyas.org developmentflnyas.org education®nyas.org focus@nyas.org humanrightsSnyas.org jracademyQnyas.org lyceumSnyas.org membershipSnyas.org policyQnyas.org publicationsQnyas.org pubrelations®nyas.org
sectionsQnyas.org
The Academy has announced its presence on the Internet. Data are taken from [4].
6 Jhe NYAS 1996 Membership Directory
The NYAS announced the publication of New York Academy of Sciences 1966 Membership Directory. The 1996 edition will completely update the 1994 edition [2]. It will be an important library-quality reference volume for over 45,000 Academy members and will enable members to share information and ideas with their colleagues all over the world.
A complete member listings will include name, academic information, areas of speciality, job title, employer name, address and phone, fax and electronic mail numbers plus residence address and phone number. In addition, NYAS membership year and special Academy honors will also be included.
The directory will include three customized cross-reference sections to help someone to locate fellow members with ease:
-	The alphabetical section—members listed with a complete biographical profile as described above.
-	The Geographical Section—members listed alphabetically under country, city and state.
-	The Discipline/Specialty Section—members listed alphabetically under primary specialty categories.
In addition, the Directory will include helpful information about the Academy.
7	Possibilities Offered to Scientific Events and Communities
The NYAS publishes proceedings of conferences held under sponsorship other than its own in the Annals. If someone is organizing a multidiscipli-nary conference on a topic of current or impending scientific interest and wish to have the proceedings considered for publication in the Annals, he/she can send the proposal to the Executive Editor. The NYAS seeks conferences broad enough so that one topic may be examined under the lens of several disciplines, capturing a picture of a field at a critical point in its development. The Annals are offered to more than 47,000 members and distributed to 750 libraries and academic institutions worldwide. For further information contact Bill Boland, Executive Editor, the Annals, New York Academy of Sciences, 2 East 63rd Street, New York, NY 10021. .
8	A Middle-European View of NYAS Importance
Middle Europe has its Western (capitalist) and Eastern (postcommunist) Part. Especially for the last, it is important to catch the train of scientific and technological development. Why can NYAS play in this respect a significant role particularly in the individual development of scientists and engineers?
As mentioned before, the national academies in the Eastern Europe are elitist and financially privileged communities for the members and they lack national initiative and action for the development of science and technology (S&T), irrespective whether they have an international system of scientists' communication, annals and journals for S&T advancement, education of young and future scientists, a system for protecting the human rights of scientists, academies' international conferences embracing the most relevant fields of development, or not, and, last but not least, the foreign members of those academies do not have the same rights and privileges as the national members. In these academies, virtually, the nationalistic and privileged status of members is in the foreground.
NYAS offers an opposite alternative to national academies. As the reader can see from this report, it is an internationally wide open Academy, where the initiative of each of its members is welcome and appreciated. It is led by distinguished internationally recognized personalities and boards and accompanied by international conferences and scientific publications. NYAS offers a service for human rights, youth education program, and world-wide communication among scientists.
9 Local Clubs of NYAS
Currently, in Slovenia, there are 34 members of NYAS (November, 1995 [2]). In November, 1995, the initiative group for founding a Club was established. The program of the Club is expected to be regularly international, with foreign members and impact.
Members could meet once a month in a domestic and social environment. At the meetings, after a common dinner, the following topics could be discussed:
-	Presentation of a member's professional work, his/her scientific (research, engineering) activity and publications, special individual problems.
-	Scientific and company research activities in the Club country: national science politics, academies of sciences, concourses, projects, scientists, researchers.
-	International scientific activity: meetings with guest researchers (professors), academics, directors from U.S.A. and other parts of the globe.
-	Programs and activities for scientific education in basic and middle schools.
-	Publications of the Club.
-	The possibilities for organizing international conferences, meetings, institutions (American university, foundation) in the Club country.
10 The Active NYAS Members among the Editors, Referees, and Readers of Informatica
Because of the fast growing of the Academy's membership in the last year, it is not possible to get the data about the active members among the editors, referees, and editors of Informatica. The new Academy Membership Directory will be out of print at the end of May, 1996, and will include only membership data up to December, 1995. Although data should not become public in general, according to the agreement of members, an innerly identified group of the active Academy's members could agree on some common initiative concerning the Academy possibilities on one side, and the benefits of Informatica, its prestige, its citing by authors in other journals, and its distribution over the globe, on the other side.
So far, three authors of Informatica decided to express their NYAS membership publicly. The Editor-in-Chief hopes and recommends that other authors follow them. On the other hand, the Editor will appreciate if editors, referees, and readers of Informatica can transmit information to his email box (anton.p.zeleznikarQijs .si).
References
[1]	Invitation of the Board of Governors of the New York Academy of Sciences to Membership in the Academy, NYAS, New York 1995.
[2]	Directory of Members 1994, NYAS, New York 1994.
[3]	The Sciences 35 (1995) Number 5 (September/October, p. 4.
[4]	Focus August/September 1995, p. 5.
[5]	Focus October/November 1995, p. 1-2.
[6]	Catalog of the NYAS Annals August/September 1995, p. 5.
Compiled by J. Šlechta and A.P. Železnikar
Marquis Who's Who
in Science
and Engineering
1996-1997
Introduction
Who's Who in America was founded by A.N. Marquis in 1899.
Academics, scientists, engineers, and researchers in every field are advancing the knowledge each day. As the pace and scope of these advancements continue to grow, communication between scientists of all specializations becomes increasingly necessary. Who's Who in Science and Engineering is designed to facilitate such communication. It provides scientists and engineers, as well as others, the ability to identify researchers and achievers throughout the scientific community.
The Contents of Who's Who
Biographies of 24,000 such individuals will appear in the 3rd Edition of Who's Who in Science and Engineering. While the majority of listees come from the United States, many come from over 115 other nations.
The published biography becomes part of the permanent library collections of many of the world's most distinguished corporations, organizations, and institutions. Among those that have requested copies of the 3rd Edition immediately upon publication are: American Association for the Advancement of Science; Aree Associates, Inc.; Centers for Disease Control; Compuware Corporation; MacArthur Foundation; Lawrence Livermore Laboratory; Massachusetts Institute of Technology (M.I.T.); U.S. Department of Commerce, etc.
Many "vanity publishers" and "membership organizations" include biographical information in their publications regardless of reference-worthiness. In this respect. Marquis Who's Who is not to confuse with other publishers who charge
a fee for publishing a biography, or who require to purchase a book.
Each edition of Who's Who in Science and Engineering provides the most current and authoritative biographical information available on prominent individuals in the fields of science and engineering. The Geographic, Professional, and Awards indexes are additional features within this publication. The Geographic and Professional indexes group Biographees by country, state, city, as well as by field and specialty. Containing 16 distinct fields and over 100 specializations, these indexes provide quick and easy reference to all the biographies in the book. The new and expanded Awards Index lists over 300 award-granting agencies and over 500 awards bestowed upon approximately 1,800 scientists and engineers. This unique index enhances the reference value of Marquis Who's Who in Science and Engineering, and attests to Marquis' commitment to the principle of reference value within all Marquis publications.
Board of Advisors
The 3rd Edition was compiled with the assistance of the following distinguished individuals:
-	Hojjat Adeli, Professor of Civil Engineering, The Ohio State University;
-	William C. Anderson, Executive Director, American Academy of Environmental Engineers;
-	Robert F. Barnes, Executive Vice President, American Society of Agronomy;
-	Bruno A. Boley, Professor of Civil Engineering, Columbia University;
-	William L. Duax, Executive Vice President of Research, Medical Foundation of Buffalo, Inc.;
-	Herman Feshbach, Professor Emeritus, Massachusetts Institute of Technology;
-	Walter Freiberger, Chairman, University Committee on Statistical Science, Brown University;
-	Y. C. Fung, Professor of Bioengineering and Applied Mechanics, University of California, San Diego;
-	Pierre Galletti, Professor of Medical Science, Brown University;
-	Roland D. Glen, Former Director, Association of Consulting Chemists and Chemical Engineers, Inc.;
-	Bryan Gregor, Former Secretary, The Geo-chemical Society;
-	Michael Guillen, Science Editor, ABC-TV, Instructor, Core Curriculum Program, Harvard University;
-	Leonard C. Klein, Secretary, American Mi-crochemical Society;
-	Robert G. McKinnell, President, International Society of Differentiation, Inc.;
-	Frederick W. Oehme, Professor of Toxicology, Kansas State University;
-	Jeremiah P. Ostriker, Director, Princeton University Observatory;
-	Linda L Resnik, Executive Director, White House Conference on Library and Information Science; and
-	Charles Wert, Former Head, Department of Metallurgy and Mining Engineering, University of Illinois.
Biographees
According to the reference value of the Bi-ographee's outstanding achievements. Marquis Who's Who selects the biographical profile for inclusion in the forthcoming Edition. At this occasion, a Certificate of Recognition is commissioned. This certificate is absolutely free, the way Marquis is thanking for biographee's participation. Biographees may reserve their copy of Who's Who in Science and Engineering at a special savings ($60.00) available only to them. This pre-publication reservation is offered solely as a courtesy, without obligation.
Publication Date and Address
Schedulded publication date of the 3rd Edition is April, 1996.
The production order of the new 3rd Edition of Marquis Who's Who in Science and Engineering will be carefully estimated to fill the requirements of the institutions that rely on Marquis for accurate biographical reference materials—schools, libraries, and corporations, to name a few. Once the print run is established, the edition will be closed and will not be reprinted.
The address is:
Marquis Who's Who
121 Chanlon Road
New Providence, NJ 07974
A Request and Advice to the Editors and Authors of Informatica
As the Editor-in-Chief of Informatica is informed, the biographical profiles of some of the Editors of Informatica have been selected for inclusion in the forthcoming 3rd Edition of Marquis Who's Who in Science and Engineering (April 1996) and 14th Edition of Marquis Who's Who in the World (December 1996). The Editor-in-Chief oLInformatica expresses his deep respect with congratulations for such an achievement.
Further, the Editor invites the Members of the Editorial Board and Authors of Informatica to inform him (submitting a note via the e-mail) upon such nominations, selections, international academy and society memberships, professional and other awards, and other important achievements, for the sake of reference which is given to the local authorities in the competition for the Informatica financial support, and for the sake of the international recognition of individuals and the journal Informatica.
At the end of the year. Informatica will publish together with the Contents of the Volume the achievements of Informatica's Editors and Auth-
ors.
Compiled by A.P. Železnikar
Press Release
European Design & Test Conference and Exhibition 1996
The World's Favourite CAE Conference - With a Major International Exhibition
By the number of submissions, ED&TC 96 is the largest event of this type in the world! ED&TC has received a total of 396 regular submissions, from which an international review panel has selected 86 papers for inclusion in the IEEE Proceedings (27% acceptance rate) and 48 User's Forum papers.
The number of submissions has allowed to extend the traditional 4- track structure with a 5th track during a major part of the Conference resulting in 2 tracks on CAD, 1 on Design, 1 on Test and 1 for the User Forum. This year, there have been two major new trends:
1.	The number of papers on mixed/analog digital systems has gone up significantly. Such analog signals within a system may represent, for example, antenna signals of portable phones or sensor and actuator signal in automobile electronics. Thus, design automation for complete systems has to take analog signals into account. It is interesting to know that analog and mixed-signal designs have found special attention in Europe and at the ED&TC Conference.
2.	This year, for the very first time, ED&TC covers the design and design tools for microsystems. CAD tools for the design of microelectromechani-cal systems are emerging and papers will discuss how existing microelectronics CAD tools are to be extended and complemented to allow the design of these miniaturised inter- disciplinary components. The research on testing these components will also be addressed, as well as the boost to the market by foundries. Outstanding papers include contributions on CAD by TIMA, Grenoble in cooperation with T.U. Budapest and T.H. Darmstadt, and on fault modeling by Fh-G, Dresden.
The full programme is available at: http: //Is12-www.informatik.uni-dortmund.de /edtc/96/edtc.html
In addition, the Conference will include a number of innovative contributions to just recently in-
troduced areas:
1.	Hardware/software codesign is now moving from a rather broad overall view of the issues towards a more detailed view of particular solutions. For example, solutions on where software code should be stored in an embedded system in order to optimize the performance will be presented by Tomiyama and Yasuura from Kyushu University, Japan.
2.	Behavioural synthesis techniques (to a major extent originating from Europe) are now being introduced into industry. ED&TC will include a highly-ranked paper by R. Bergamashi et al. (IBM) on how to verify the results of these techniques.
3.	Low-power design continues to reduce the power consumption of microelectronics. New approaches for replacing standard-sized transistors by transistors optimized for low power consumption will be introduced by Coudert (Synop-sys Ine, USA).
4.	IDDQ-testing for CMOS devices is becoming a more mature topic. The Conference will include a highly-rated overview by T. Williams et al. (IBAd!) of what to expect in this area in the next ten years.
5.	With the trend towards more and more complex systems, built-in self test (BIST) techniques are increasingly used. In this area, papers on gate-delay testing and MCM testing have received high scores.
The Technical Programme will be introduced by 3 keynote speakers: M. Jacobs from SIEMENS, G. House from MENTOR GRAPHICS and W. Maly from Carnegie-Mellon University. They will discuss a worldwide perspective on the 3 main themes of the Conference. A European dimension will be brought by G. Metakides, Director of IT Programme at the European Commission, by chairing this keynote speakers session.
ED&TC will feature this year hot topic sessions on system aspects. Each session will include an invited talk and a round table. The topics selected for ED&TC 96 are: microprocessor design, broadband telecom and IDDQ testing in practice.
Once again, ED&TC will feature a User Forum in addition to the four tracks of "hot topics", CAD, Design and Test. The User Forum will bring the latest in industrial and academic experience in circuit design which, together with the Poster session, will allow for intensive discussion with authors. In addition, IEEE Design and Test is co- organising the Panel session on "Mixed-Signal Design & Test: Reality or Illusion-Do We Have the Right Tools?". There are two other panel sessions on the subjects "Submicron Design Tools; What Problems Can We Expect, and Who Will Supply the Tools?" and "DFT is not the Issue - Bad Design and Bad CAD is!". Tutorials will also be available on hot issues in design and test of ASICs and Systems.
Alongside the Conference is a full-scale, international exhibition, featuring the world's leading suppliers of CAE, ASICs, FPGAs PLDs, and Test program development software. A visit to the exhibition is a chance to meet technical specialists from the companies driving design technology. Inside the exhibition area will be a theatre running technical presentations from exhibiting companies, which will be free of charge to all visitors to the exhibition.
A last, but not least, touch is a gala dinner and a visit to the new Grand Louvre including explanations by the Museum guides.
ED&TC 96 is sponsored by EDAA, the IEEE Computer Society DATC and TTTC, and ACM/SIGDA, in cooperation with other European Societies. More information: C. Lopez-Barrio General Chair ED&TC 96 Tel.: -1-34 1 337 4310 Fax: +34 1 337 4212
P. Marwedel
Programme Chair ED&TC 96 Tel.: +49 231 755 6111 Fax: +49 231 755 6116
B. Courtois
EDAA Chair Tel.: +33 76 57 46 15 Fax: +33 76 47 38 14
J. Kenyon
Exhibition Chair Tel.: +44 171 404 0564 Fax: +44 171 831 2057
Machine Learning List
The Machine Learning List is moderated. Contributions should be relevant to the scientific study of machine learning. Mail contributions to ml@ics.uci.edu. Mail requests to be added or deleted to ml-request@ics.uci.edu. Back issues may be FTP'd from ics.uci.edu in pub/ml-list/V<X>/<N> or N.Z where X and N are the volume and number of the issue; ID: anonymous PASSWORD: <your mall address> URL- http://www.ics.uci.edu/AI/-ML/Machine-Learning.html
L. Eggermont
Vice General Chair ED&TC 96 Tel.: +31 40 784 961 Fax: +31 40 786 422
Errata
Due to an error, the name of James Geller slipped from the list of referees in Informatica Vol. 19 No. 4 at page 424 in Acknowledgements. We sincerely apologise.
ICCCN'96 CALL FOR PAPERS
FIFTH INTERNATIONAL CONFERENCE ON
COMPUTER COMMUNICATIONS AND NETWORKS
October 16 - 19, 1996
Double Tree Hotel, Rockville, Washington D. C., USA
Sponsored by DataTech and NASA. In cooperation with NSF,.NIST, USL, USC Communication, IEEE Computer Society, ACM, and IEEE Communication Society.
The objective of this conference is to provide an effective forum for original and fundamental advances in Computer Communications and Networks and to foster communication among researchers and practitioners working in a wide variety of scientific areas with a common interest in improving.Computer Communications and Networks.
Scope
The primary focus of the conference is on new and original research results in the areas of design, implementation and applications of Computer Communications and Networks. We solicit the submission of papers that address novel, challenging and innovative results. Therefore the topics that will be addressed include, but are not limited to:
ATM Networking, Internet Services/Applications, Distributed Multimedia Applications, Real Time Communications, Quality of Services (QoS) Issues, LAN/WAN Internetworking, Interoperability, Personal Communication Services, Network Management, Wireless Networks, Intelligent Networks, Multicast Protocols, Network Security, Optical Networks, Reliable Networks, High Speed Network OAM/Protocols, Video-on-Demand, Traffic Management, Multimedia Human-Machine Interface, Performance Mode-fing/Anaiysis, Communication Software, Protocol Verification/Validation/Testing
Submission
Authors are invited to submit complete and original papers. Papers that may be submitted for consideration include those that have not previ-
ously been published in another forum, or are not currently being published or reviewed by another journal or conference. All submitted papers will be refereed for quality, correctness, originality and relevance. The program committee reserves the right to accept a submission as long, short or poster presentation. Of particular interest are papers which address experiences with concrete Computer Communications and applications. All accepted papers will be pubHshed in the conference proceedings. Authors will be interested to know that special issues of journals containing outstanding papers from the conference are being planned.
Manuscripts should include an abstract and be limited to 5000 words. Submissions should include the title, author(s), author's affiliation, email address, fax number and postal address. In case of multiple authors, an indication of which author is responsible for correspondence and preparing the camera ready paper for the proceedings should also be included. Six copies of the manuscript should be submitted by Friday, March 22, 1996 to Program Co-Chair: Dr. David Lee AT&T Bell Laboratories 600 Mountain Avenue, RA/f 2C-423 Murray Hill, New Jersey 07974, USA lee@research.att.com Tel: (908) 582-5872; Fax:(908) 582-5857
For more information about the conference (as opposed to paper submissions) please send e-mail to ic3n@cacs.usl.edu.
Important Dates
Paper submission deadline: March 22, 1996 Notification of acceptance: May 30, 1996 Camera ready papers due: July 1, 1996
Workshops and Tutorials
Proposals are solicited for tutorials and workshops. Please send 3'our proposal by March 22, 1996 to Professor M. Singhal, the Tutorial Chair, Department of Computer and Information Science, The Ohio State University, 2015 Neil Ave, Columbus, OH 43210, USA, singhal@cis.ohio-state.edu, Tel: (614) 2925839, Fax: (614) 292-2911. To Professor K. Efe, the Workshop Chair, The Center for Advanced Computer Studies, The University of SW Louisiana, Lafayette, LA 70504, efe@cacs.usl.edu, Tel: (702) 482-6876, Fax: (702) 482-5791.
Steering Committee
I. Chlamtac-U. Mass, V. Li (chair) - USC, K. Ma-kki - UNLV, J.S. Meditch - U. Wash, E. K. Park -USNA, R. Pickholtz - GWU, N. Pissinou - CACS/ USL, T. Suda - UC Irvine, J.W. Wong - U. Waterloo
Conference Coordinators
C.	Fayet-INT/France, A. Gaivoronski-ITAL-TEL/Italy, C.-S. Kang-Hannam U./Korea, U. Krieger-DBP/Cermany, H. Saito-NTT/Japan
Publicity Co-Chairs
S. Bhattacharya-Iioneyvvell, Mallikarjun Tati-pamula-BNR
Local Arrangements
D.	Su-NIST
Treasurer
E.	K. Park-USNA
General Co-Chairs
K. Makki - UNLV and N. Pissinou - CACS/USL
Program Committee
S. Aggarwal-SUNY/Binghamton, C. Alaetti-noglu-USC/ISI, M. H. Ammar-GA/Tech, H. Arabnia-U. Georgia, M. Bayoumi-USL, K. Bhat-AT&T, A. Bush-NSF, K. Calvert-GA/Tech, S. T. Chanson-UBC, D. Cohen-Bellcore, T. S. Dillon-LaTrobe U., T. Y. Feng-NSF, O. Frieder (Co-Chair)-GMU, E. Geraniotis-DEEI, F. Golshani-ASU, M. G. Gouda-UT/Austin, Z. Haas-Cornell, M. Halem-NASA, J. Harms-U. Alberta, E. Hahne-AT&T, C.-S. Kang-Hannam U., I. Khan-Qualcomm, J. B. Kim-CSUN, D. Lee (Co-Chair)-AT&T, Y.-H. Lee-U. Florida, B. J. Leon-CACS/USL, F. J. Lin-Bellcore, M. T. Liu-OSU, W. Liu-BellSouth, W. M. Moh-SJSU, R. Miller-UMD, M. S. Obaidat-CUNY, S. Olariu-ODU, M. T. Ozsu- U. Alberta, W. Peng-SWTSU, S. Sahni-U. Florida, H. Saito-NTT, M.C. Shan-HP, U. Shankar-UMD, H. Sharif-UNL, A. Silberschatz-AT&T, V. Srinivasan-IBM, D. Su-NIST, Y. Zhang-Hughes
AIMSA'96
7th International Conference on Artificial Intelligence: Methodology, Systems, Applications Sozopol, Bulgaria, September 18 - 20, 1996 Call for Papers and Participation
General information
The AIMSA conference series has provided a biennial forum for presentation of AI research and development since 1984. The conferences, which are held in Bulgaria, cover the full range of AI, and are noted for their well established tradition of international scientific exchange between Central and Eastern Europe and the rest of the world.
AIMSA'96 is sponsored by ECCAI, European Coordinating Committee for Artificial Intelligence.
Topics of interest (including, but not limited to): Automated Reasoning, Cognitive Modeling, Genetic Algorithms, AI Languages and Architectures, AI Applications, Intelligent Decision Support Systems, Knowledge Acquisition, Knowledge-Based Systems, Knowledge Representation, Machine Learning, Natural Language Processing, Neural Networks, Robotics, Vision.
Submission of papers
Papers should be written in English and be printed on A4 paper (double-sided printing is encouraged), using high quality printers. Format should be single -spaced, 12 point type with no more than 50 lines per page and maximum 5000 words (10 pages). The first page of each copy of a submitted paper should contain title of the paper, names and addresses of all authors, an abstract (100-150 words) and a list of keywords. Electronic and fax submissions will not be considered.
The Proceedings of AIMSA-96 will be published by lOS Press and will be available at the conference.
Four copies of submitted papers must be received before 1 April 1996 by the Program Chair:
Allan Ramsay - AIMSA'96 Program Chair, Department of Language and Linguistics, UMIST, PO BOX 88, Manchester M60 IQD, United Kingdom, Email: allan@ccl.umist.ac.uk. Phone:
(44) (0) 161 200 3108 (direct), (44) (0) 161 2036 3311 (switch). Fax: (44) (0) 161 200 3099
Selection of papers
Each paper will be reviewed by at least two members of the program committee. Selection criteria include accuracy and originality of ideas, clarity and significance of results, quality of presentation.
At least one of the authors of an accepted paper is expected to present the paper at the conference.
The program committee will select one paper of exceptional quality among the accepted papers. The authors will receive the best paper award during the conference.
Program committee
Benedict du Boulay (UK), Pedro Meseguer (Spain), Peter Braspenning (The Netherlands), Ewa Orlowska (Poland), Ivan Bratko (Slovenia), Ivan Plander (Slovakia), Peter Brusilovsky (Russia), Allan Ramsay (UK, programme chair), Christo Dichev (Bulgaria), Ronan Reilly (Ireland), Danail Dochev (Bulgaria), Vassil Sgurev (Bulgaria), Altay Guvenir (Turkey), Mark Stickel (USA), Steffen Hoelldobler (Germany), Dan Tu-fis (Romania), Philippe Jorrand (France), Dimi-tar Vakarelov (Bulgaria), Vladimir Marik (Czech Republic), Tibor Vamos (Hungary), Alberto Martelli (Italy), David Young (UK)
Important dates
Submission deadline: 1 April, 1996 Notification of acceptance: 20 May, 1996 Deadline for final papers: 10 June, 1996 Conference: September 18 - 20, 1996
Organization
-	Bulgarian Artificial Intelligence Association
-	Institute for Information Technologies
-	Union of Bulgarian Mathematicians
with the support of the Bulgarian Society for Cognitive Science and of the Bulgarian Association for Pattern Recognition.
with frequent flights from and to Sofia (the capital city of Bulgaria). There are also direct flights connecting the major European cities with Sofia and Bourgas. Transportation between the conference location and the Bourgas airport will be provided.
Local arrangements
Christo Dichev - AIMSA'96, Institute for Information Technologies, Bl. 29A, Acad. G. Bon-chev St., 1113 SOFIA - BULGARIA, Email: cdichev@iinf.bg, Phone: (359) (2) 707 586, Fax: (359) (2) 720 497 http ://www.aubg.bg /faculty/cs/nikolaev/aimsa96.html
Language
The official language of the conference is English.
Joint event
Second Joint Conference on Knowledge-Based Software Engineering (JCKBSE'96), Sozopol, Bulgaria, 21-22 September, 1996.
The aim of JCKBSE'96 is to provide a forum for scientific interchange among scientists in Knowledge-Based Software Engineering mainly from Japan, the CIS countries and Bulgaria on a broad spectrum of academic research, implementation technologies and application systems.
s
Conference venue
Sozopol is a small ancient and picturesque town on the Black sea cost, famous with its architecture and art exhibitions. It is a favorite place for rest and entertainment to most Bulgarians and many foreigners. AIMSA-96 will take place in a holiday home near the beach and in a walking distance from downtown Sozopol.
Travel information
Sozopol is located 30 km south to Bourgas. Bourgas has an international and domestic airport
THE MINISTRY OF SCIENCE AND TECHNOLOGY OF THE REPUBLIC OF SLOVENIA
Address: Slovenska 50, 61000 Ljubljana, Tel.: +386
61 1311 107, Fax: +386 61 1324 140.
WWW:http://www.mzt.si
Minister: Prof. Rado Bohinc, Ph.D.
State Secretary for Int. Coop.: Rado Genorio, Ph.D.
State Secretary for Sci. and Tech.: Ciril Baškovič
Secretary General: Franc Hudej, Ph.D.
The Ministry also includes:
The Standards and Metrology Institute of the Republic of Slovenia
Address: Kotnikova 6, 61000 Ljubljana, Tel.: +386 61 1312 322, Fax: +386 61 314 882., and The Industrial Property Protection Office of the Republic of Slovenia
Address: Kotnikova 6, 61000 Ljubljana, Tel.: +386 61 1312 322, Fax: +386 61 318 983.
Scientific Research and Development Potential.
The statistical data for 1993 showed that there were 180 research and development institutions in Slovenia. Altogether, they employed 10,400 people, of whom 4,900 were researchers and 3,900 expert or technical staff.
In the past ten years, the number of researchers has almost doubled: the number of Ph.D. graduates increased from 1,100 to 1,565, while the number of M.Sc.'s rose from 650 to 1,029. The "Young Researchers" (i.e. postgraduate students) program has greatly helped towards revitalizing research. The average age of researchers has been brought down to 40, with one-fifth of them being younger than 29.
The table below shows the distribution of researchers according to educational level and sectors (in 1993):
Sector
Ph.D. M.Sc.
Business enterprises	51	196
Government	482	395
Private non-profit organizations	10	12
Higher education organizations	1022	426
Total	1,565	1,029
mainly small businesses. The shortfall was addressed by increased public-sector spending: its share of GDP nearly doubled from the mid-seventies to 0,86% in 1993.
Income of R&D organizations spent on R&D activities in 1993 (in million US$):
Sector	Total	Basic	App.	Exp.
		res.	res.	dev.
Business ent.	83,9	4,7	32,6	46,6
Government	58,4	16,1	21,5	20,8
Private non-p.	1,3	0,2	0,6	0,5
Higher edu.	40,9	24,2	8,7	8
Total	184,5	45,2	63,4	75,9
Financing Research and Development. Statistical estimates indicate that US$ 185 million (1,4% of GDP) was spent on research and development in Slovenia in 1993. More than half of this comes from public expenditure, mainly the state budget. In the last three years, R&D expenditure by business organizations has stagnated, a result of the current economic transition. This transition has led to the financial decline and increased insolvency of firms and companies. These cannot be replaced by the growing number of
The policy of the Slovene Government is to increase the percentage intended for R&D in its budget. The Science and Technology Council of the Republic of Slovenia is preparing the draft of a national research program (NRP). The government will harmonize the NRP with its general development policy, and submit it first to the parliamentary Committee for Science, Technology and Development and after that to the parliament. The parliament approves the NRP each year, thus setting the basis for deciding the level of public support for R&D.
The Ministry of Science and Technology is mainly a government institution responsible for controlling expenditure of the R&D budget, in compliance with the NRP and the criteria provided by the Law on Research Activities. The Ministry finances research or CO- finances development projects through public bidding, partially finances infrastructure research institutions (national institutes), while it directly finances management and top-level science.
The focal points of R&D policy in Slovenia are:
-	maintaining the high level and quality of research activities,
-	stimulating collaboration between research and industrial institutions,
-	(co)financing and tax assistance for companies engaged in technical development and other applied research projects,
-	research training and professional development of leading experts,
-	close involvement in international research and development projects,
-	establishing and operating facilities for the transfer of technology and experience.
JOŽEF STEFAN INSTITUTE
Jožef Stefan (1835-1893) was one of the most prominent physicists of the 19th century. Born to Slovene parents, he obtained his Ph.D. at Vienna University, where he was later Director of the Physics Institute, Vice-President of the Vienna Academy of Sciences and a member of several scientific institutions in Europe. Stefan explored many areas in hydrodijnamics, optics, acoustics, electricity, magnetism and the kinetic theory of gases. Among other things, he originated the law that the total radiation from a black body is proportional to the 4th power of its absolute temperature, known as the Stefan-Boltzmann law.
The Jožef Stefan Institute (JSI) is the leading independent scientific research in Slovenia, covering a broad spectrum of fundamental and applied research in the fields of physics, chemistry and biochemistry, electronics and information science, nuclear science technology, energy research and environmental science.
The Jožef Stefan Institute (JSI) is a research organisation for pure and applied research in the natural sciences and technology. Both are closely interconnected in research departments composed of different task teams. Emphasis in basic research is given to the development and education of young scientists, while applied research and development serve for the transfer of advanced knowledge, contributing to the development of the national economy and society in general.
At present the Institute, with a total of about 700 staff, has 500 researchers, about 250 of whom are postgraduates, over 200 of whom have doctorates (Ph.D.), and around 150 of whom have permanent professorships or temporary teaching assignments at the Universities.
In view of its activities and status, the JSI plays the role of a national institute, complementing the role of the universities and bridging the gap between basic science and applications.
Research at the JSI includes the following major fields: physics; chemistry; electronics, informatics and computer sciences; biochemistry; ecology; reactor technology; applied mathematics. Most of the activities are more or less closely connected to information sciences, in particular computer sciences, artificial intelligence, language and speech technologies, computer-aided design, computer architectures, biocybernetics and robotics, computer automation and control, professional electronics, digital communications and ne-
tworks, and applied mathematics.
The Institute is located in Ljubljana, the capital of the independent state of Slovenia (or S'v'nia). The capital today is considered a crossroad between East, West and Mediterranean Europe, offering excellent productive capabilities and solid business opportunities, with strong international connections. Ljubljana is connected to important centers such as Prague, Budapest, Vienna, Zagreb, Milan, Rome, Monaco, Nice, Bern and Munich, all within a radius of 600 km.
In the last year on the site of the Jožef Stefan Institute, the Technology park "Ljubljana" has been proposed as part of the national strategy for technological development to foster synergies between research and industry, to promote joint ventures between university bodies, research institutes and innovative industry, to act as an incubator for high-tech initiatives and to accelerate the development cycle of innovative products.
At the present time, part of the Institute is being reorganized into several high-tech units supported by and connected within the Technology park at the Jožef Stefan Institute, established as the beginning of a regional Technology park "Ljubljana". The project is being developed at a particularly historical moment, characterized by the process of state reorganisation, privatisation and private initiative. The national Technology Park will take the form of a shareholding company and will host an independent venture-capital institution.
The promoters and operational entities of the project are the Republic of Slovenia, Ministry of Science and Technology and the Jožef Stefan Institute. The framework of the operation also includes the University of Ljubljana, the National Institute of Chemistry, the Institute for Electronics and Vacuum Technology and the Institute for Materials and Construction Research among others. In addition, the project is supported by the Ministry of Economic Relations and Development, the National Chamber of Economy and the City of Ljubljana.
Jožef Stefan Institute
Jamova 39, 61000 Ljubljana, Slovenia
Tel.:-f386 61 1773 900, Fax.:-f386 61 219 385
Tlx.;31 296 JOSTIN SI
WWW: http://www.ijs.si
E-mail: matjaz.gams@ijs.si
Contact person for the Park: Iztok Lesjak, M.Sc.
Public relations: Natalija Polenec
INFORMATICA
AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS
INVITATION, COOPERATION
Submissions and Refereeing
Please submit three copies of the manuscript with good copies of the figures and photographs to one of the editors from the Editorial Board or to the Contact Person. At least two referees outside the author's country will examine it, and they are invited to make as many remarks as possible directly on the manuscript, from typing errors to global philosophical disagreements. The chosen editor will send the author copies with remarks. If the paper is accepted, the editor will also send copies to the Contact Person. The Executive Board will inform the author that the paper has been accepted, in which case it will be published within one year of receipt of e-mails with the text in Informatica I^TßX format and figures in .eps format. The original figures can also be sent on separate sheets. Style and examples of papers can be obtained by e-mail from the Contact Person or from FTP or WWW (see the last page of Informatica).
Opinions, news, calls for conferences, calls for papers, etc. should be sent directly to the Contact Person.
QUESTIONNAIRE
Send Informatica free of charge Yes, we subscribe
Please, complete the order form and send it to Dr. Rudi Murn, Informatica, Institut Jožef Stefan, Jamova 39, 61111 Ljubljana, Slovenia.
Since 1977, Informatica has been a major Slovenian scientific journal of computing and informatics, including telecommunications, automation and other related areas. In its 16th year (more than two years ago) it became truly international, although it still remains connected to Central Europe. The basic aim of Informatica is to impose intellectual values (science, engineering) in a distributed organisation.
Informatica is a journal primarily covering the European computer science and informatics community -scientific and educational as well as technical, commercial and industrial. Its basic aim is to enhance communications between different European structures on the basis of equal rights and international refereeing. It publishes scientific papers accepted by at least two referees outside the author's country. In addition, it contains information about conferences, opinions, critical examinations of existing publications and news. Finally, major practical achievements and innovations in the computer and information industry are presented through commercial publications as well as through independent evaluations.
Editing and refereeing are distributed. Each editor can conduct the refereeing process by appointing two new referees or referees from the Board of Referees or Editorial Board. Referees should not be from the author's country. If new referees are appointed, their names will appear in the Refereeing Board.
Informatica is free of charge for major scientific, educational and governmental institutions. Others should subscribe (see the last page of Informatica).
ORDER FORM - INFORMATICA
Name: ............................................. Office Address and Telephone (optional):
Title and Profession (optional) : ............................................................
.................................................... E-mail Address (optional): .............
Home Address and Telephone (optional): ...........
.................................................... Signature and Date: ...................
Referees:
Witold Abramowicz, David Abramson, Kenneth Aizawa, Alan Aliu, John Anderson, Catriel Beeri, Fevzi Belli, Istvan Berkeley, Azer Bestavros, Balaji Bharadwaj, Jacek Blazevvicz, Laszlo Boeszoermenyi, Ivan Bratko, Jerzy Brzezinski, Marian Bubak, Leslie Burkholder, Frada Burstein, Wojciech Buszkowski, Ryszard Choras, Jason Ceddia, Wojciech Chybowski, Andrzej Ciepielewski, Vic Ciesielski, David ClifF, Travis Craig, Tadeusz Czachorski, Sait Dogru, Georg Dorfner, Maciej Drozdowski, Marek Druzdzel, Hesham El-Rewini, Pierre Flener, Terrence Forgarty, Hugo de Garis, Eugeniusz Gatnar, James Geller, Janusz Gorski, Georg Gottlob, David Green, Herbert Groiss, Inman Harvey, Elke Hochmueller, Rod Howell, Tomas Hruška, Ryszard Jakubowski, Piotr Jedrzejowicz, Eric Johnson, Li-Shan Kang, Roland Kaschek, Jan Kniat, Stavros Kokkotos, Kevin Korb, Gilad Koren, Henryk Krawczyk, Ben Kroese, Zbyszko Krolikowski, Benjamin Kuipers, Matjaž Kukar, Aarre Laakso, Phil Laplante, Bud Lawson, Ulrike Leopold-Wildburger, Joseph Y-T. Leung, Raymond Lister, Doug Locke, Jason Lowder, Andrzej Marciniak, Witold Marciszewski, Vladimir Marik, Jacek Martinek, Tomasz Maruszewski, Florian Matthes, Timothy Menzies, Dieter Merkl, Zbigniew Michalevvicz, Roland Mittermeir, Madhav Moganti, Tadeusz Morzy, Daniel Mosse, John Mueller, Hari Narayanan, Jerzy Nogiec, Stefano Nolfl, Tadeusz Pankowski, Warren Persons, Niki Pissinou, Gustav Pomberger, James Pomykalski, Gary Preckshot, Ke Qiu, Michael Quinn, Gerald Quirchmayer, Luc de Raedt, Ewaryst Rafajlowicz, Wolf Rauch, Peter Rechenberg, Felix Redmill, David Robertson, Marko Robnik, A.S.M. Sajeev, Bo Sanden, Iztok Savnik, Wolfgang Schreiner, Guenter Schmidt, Heinz Schmidt, William Spears, Przemyslaw Stpiczynski, Maciej Stroinski, Tomasz Szmuc, Zahir Tari, Jurij Tasič, Piotr Teczynski, Ken Tindell, A Min Tjoa, Wieslaw Traczyk, Marek Tudruj, Andrzej Urbanski, Kanonkluk Vanapipat, Zyunt Vetulani, Olivier de Vel, John Weckert, Gerhard Widmer, Stefan Wrobel, Janusz Zalewski, Yanchun Zhang
a Knowledge Archives (A Plan 1992; see also Železnikar 1993) and solving the problem of Automatic Knowledge Acquisition (Wah et al. 1993). The third task is the creation of the Electronic Library and Librarian; an Electronic Librarian should be able to analyze the contents of arbitrary discourses (from text-books, scientific papers, legal sources, etc.) in order to satisfy diverse requests of end users from U.S.A. and many other countries (Wah et al. 1993).
These tasks^ reflecting the gradual transition to the informational orientation in computer science pose the problem of creating natural-language-processing systems (NLPSs) capable to analyze practically arbitrary texts pertaining to very many application domains. Widely-applicable NLPSs will be highly complicated computer systems. For effective designing complex technical systems of many other kinds, one uses special mathematical theories, e.g. airdynamics in case of airplanes design and hydrodynamics for constructing ships. Analogically, we need to work up a mathematical theory of natural language (NL) use, or mathematical theory of NL-communication, or mathematical linguocyberne-tics (MLC) for speeding-up the progress in designing NLPSs. Such a theory should be a collection of interrelated mathematical models destined for helping creators of NLPSs to overcome difficulties of logical character encountered in their work (Fomitchov 1983, 1984; Fomichov 1988, 1992, 1993a).
During more than twenty years passing after publishing the known pioneer works of R. Montague on formal semantics of NL, a number of other approaches to the mathematical study of NL-semantics has been suggested. The set of these approaches includes, in particular. Theory of Generalized Quantifiers, Situation Semantics, Discourse Representation Theory (DRT), Dynamic Montague Grammar, Dynamic Predicate Logic, and the works of M.J. Cresswell and G. Chi-erchia on structured meanings(SMs) of sentences. It is shown in (Fomichov 1993a, 1993b) that these approaches don't provide a sound initial basis for developing MLC. The main reasons for this con-
'This paper is a private author's work and no part of it may be used, reproduced or translated in any manner whatsoever without written permission of the author except in the case of brief quotations embodied in critical articles.
elusion are that these approaches don't study the problem of describing conceptual structures of arbitrary real NL-discourses pertaining to science, business, medicine, law, etc. and don't investigate suificiently deeply the role of knowledge about the world in NL use.
In particular, it is difficult not to agree with the opinion of Ahrenberg (1992, p. 7) that "in spite of its name, DRT can basically be described as formal semantics for short sentence sequences rather than as a theory of discourse". This opinion seems to be true also with respect to the content of the monograph (Kamp & Reyle 1993).
Like mentioned above approaches. Situation Theory (Aczel et al. 1993; Barwise 1989, 1992, 1993; Barwise k Cooper 1993; Barwise & Etche-mendy 1990; Cooper 1991; Devlin 1991), Theory of Situation Schemata (Fenstad, Langholm, Ve-stre 1992), Categorial Semantics (see van Ben-them 1992) study only some separate aspects of NL-semantics and don't raise the problem of investigating SMs of complicated real discourses and NL use as a whole.
On the contrary, Integral Formal Semantics (IFS) represented, in particular, (in Fomichov 1981, 1982; Fomitchov 1983, 1984; Fomichov 1988, 1992, 1993a, 1993b, 1994) from the very beginning was aimed at studying general regularities of NL use. One of the main principles of IFS is as follows: the basis of researches aimed at constructing a mathematical theory of NL use is to be a problem-independent formal model reflecting many peculiarities of semantic (or conceptual) structures of sentences and discourses of arbitrary great length and providing a description of some class of formal languages convenient for building semantic representations (SRs) of NL-texts in a large spectrum of applications and on different levels of representation. Let's call such a model a widely-applicable Metagrammar of Conceptual Structures, or a widely-applicable Conceptual Metagrammar (CM).
A widely-applicable CM! should enable us to build formal semantic analogues of sentences and discourses; hence the expressive power of formal languages determined by the model may be very close to the expressive power of NL (if we take into account the surface semantic structure of NL-texts). Besides, a CM is to be convenient for describing various knowledge about the world (Fo-
EDITORIAL BOARDS, PUBLISHING COUNCIL
Informatica is a journal primarily covering the European computer science and informatics community; scientific and educational as well as technical, commercial and industrial. Its basic aim is to enhance communications between different European structures on the basis of equal rights and international refereeing. It publishes scientific papers accepted by at least two referees outside the author's country. In addition, it contains information about conferences, opinions, critical examinations of existing publications and news. Finally, major practical achievements and innovations in the computer and information industry are presented through commercial publications as well as through independent evaluations.
Editing and refereeing are distributed. Each editor from the Editorial Board can conduct the refereeing process by appointing two new referees or referees from the Board of Referees or Editorial Board. Referees should not be from the author's country. If new referees are appointed, their names will appear in the Refereeing Board. Each paper bears the name of the editor who appointed the referees. Each editor can propose new members for the Editorial Board or Board of Referees. Editors and referees inactive for a longer period can be automatically replaced. Changes in the Editorial Board and Board of Referees are confirmed by the Executive Editors.
The coordination necessary is made through the Executive Editors who examine the reviews, sort the accepted articles and maintain appropriate international distribution. The Executive Board is appointed by the Society Informatika. Informatica is partially supported by the Slovenian Ministry of Science and Technology.
Each author is guaranteed to receive the reviews of his article. When accepted, publication in Informatica is guaranteed in less than one year after the Executive Editors receive the corrected version of the article.
Executive Editor — Editor in Chief
Anton P. Železnikar
Volaričeva 8, Ljubljana, Slovenia
E-mail: anton.p.zeleznikar@ijs.si
Executive Associate Editor (Contact Person)
Matjaž Gams, Jožef Stefan Institute
Jamova 39, 61000 Ljubljana, Slovenia
Phone: 4-386 61 1773 900, Fax: -f386 61 219 385
E-mail: matjaz.gamsQijs.si
WWW: http : //www2. i j s. si/"mezi/mat j az. html
Executive Associate Editor (Technical Editor)
Rudi Murn, Jožef Stefan Institute
Publishing Council: Tomaž Banovec, Ciril Baškovič, Andrej Jerman-Blažič, Jožko Čuk, Dagmar Šuster, Jernej Virant
Board of Advisors:
Ivan Bratko, Marko Jagodic,
Tomaž Pisanski, Stanko Strmčnik
Editorial Board
Suad Alagić (Bosnia and Herzegovina) Shuo Bai (China) Vladimir Batagelj (Slovenia) Francesco Bergadano (Italy) Leon Birnbaum (Romania) Marco Botta (Italy) Pavel Brazdil (Portugal) Andrej Brodnik (Canada) Janusz Brozyna (France) Ivan Bruha (Canada) Luca Console (Italy) Hubert L. Dreyfus (USA) Jozo Dujmović (USA) Johann Eder (Austria) Vladimir Fomichov (Russia) Janez Grad (Slovenia) Noel Heather (UK) Francis Heylighen (Belgium) Bogomir Horvat (Slovenia) Hiroaki Kitano (Japan) Stavros Kokkotos (Greece) Sylva Kočkova (Czech Republic) Miroslav Kubat (Austria) Jean-Pierre Laurent (France) Jadran Lenarčič (Slovenia) Magoroh Maruyama (Japan) Angelo Montanari (Italy) Ramon L. de Mantaras (Spain) Igor Mozetič (Austria) Stephen Muggleton (UK) -Pavol Navrat (Slovakia) Jerzy R. Nawrocki (Poland) Marcin Paprzycki (USA) Oliver Popov (Macedonia) Sašo Prešern (Slovenia) Luc De Raedt (Belgium) Jean Ramaekers (Belgium) Paranandi Rao (India) Giacomo Della Riccia (Italy) Wilhelm Rossak (USA) Claude Sammut (Australia) Johannes Schwinn (Germany) Jin Šlechta (UK) Branko Souček (Italy) Harald Stadlbauer (Austria) 'Oliviero Stock (Italy)
Gheorghe Tecuci (USA), Robert Trappl (Austria) Terry Winograd (USA), Claes Wohlin (Sweden) Stefan Wrobel (Germany), Xindong Wu (Australia)
An International Journal of Computing and Informatics Gontents:
Profile: V.A. Fomichov
A Mathematical Model for Describing Structured Items of Conceptual Level
A Logic for Conditional Recommendation
Rule-Generating Abduction for Recursive Prolog
Bottom-up Layout Generation
Informational Frames and Gestalts
A Distributed Client/Server Benchmark for Network Performance Measurement
Petri Nets and Self-Stabilization of Communication Protocols
Interface Methods: A Means for the Integration of Inheritance in a Concurrent OOP Language
The Fringe Bucket-Quadtree, a Robust and. Efficient Spatial Data Structure with a High Population Density
V.A. Fomichov
S.K. Das P. Hammond
K. Hirata
W. Hower
A.P. Železnikar
J. Dujmović R. Kinicki S. Subbanna
W. Peng K. Makki
A. Poggi
R. Maelbranckè H. Olivié
33
43 57 65 95
113 125 135
Reports and Announcements
143