f f f f i n f 0 r m a t i f: a 2
YU ISSN 0350-5596
L
Časopis Izdaja Slovensko društvo Informatika, BlOOO Ljubljana, Parmova 41, Jugoslav i ja
ČASOPIS 2A TEHNOLOGIJO RAČUNALNIŠTVA
IN PROBLEME INFORMATIKE
ČASOPIS ZA RAČUNARSKU TEHNOLOGIJU I
PROBLEME INFORMATIKE
SPISANIE ZA TEHNOLOGIJA NA SMETANJETO
I PROBLEMI OD OBLASTA NA INFORMATIKATA
Uredniški odbor:
T. Alekslć, Beograd; D. Bitrakov, Skopje;	P,
Dragojlovic, Rijeka; S. Hodžar, Ljubljana;	B.
Horvat, Maribor; A. MandSiö, Sarajevo;	S. Mlhalie, Varaždin; S. Turk, Zagreb
YU ISSN 0350^596
LETNIK 12,1988-ŠT. 2
Glavni in odgovorni urednik: prof. dr. Anton P. 2eleznlkar
- . . ' -J	- A V i
Tehnični «urednik: dr. Rudolf Murn
ZaložnfSki svet:	,,
T. Banovee, Zavod SR.Slovenije za_statisti ko, Voearski pot 12, 61000 Ljubi jahši;
A.	Jerman-Blažt e'. Iskra Telematika, Kardeljeva
plosead 24, SlOOO LJubljana; .
B.	KlemenCiö, Iskra Telematika, 64000 Kranj;
S. Saksida, institut za sociòlogijo Univerze Edvarda Kardelja,' 61000 Ljubljana; , .
J. VirTnt, Fakulteta zat elektrotehniko, Trzaäka 25. G1Đ00 Ljubljana. ,-
Uredništvo in uprava;
Časopis Informatika, Iskra Delta,' Stegne Ì5B, 61000 Ljubljana, telefon (061) 574 554; teleks 31366-;:YU f Delta.	,
Letna naroönina za delovne organizacije znaša 24000 din, za zasebne naročnike 6000 din, za Studente 2000 din; posamezna Številka 8000 din.
Številka 2iro računa: 50101-678-51841
Pri fInane i ranju ćasopi sa sodeluj e Razi skovalna skupnost Slovenije - ; .
Na podlagi mnenja RepublTskega komiteja za informiranje fit. 23-85, z dne 29. 1. 1986, je Časopis oproščen temeljnega davka od prometa proizvodov.
Printed Tiskarna Kresija, Ljubljana, in March, 1988
v s E B INA
A.P.Zeleznikar 3 O razvojni potudi raftunalni-
äke-iindustrije M. Cams	12 Approaching the Linit of
M. DrobniO .	Classification Accuracy
N. Cuid B. Zalik .
18 Standards in Computer Graphics	-	.
J, Györkös	24 A Survey of Host Important
I, Rozman	and Outstanding Methods ■for
Tatjana Welzer - Software Engineering ;
A.P.Zeleznikar 31 Problems of the Rational Understanding of Inforoation L, M, Patnaik 47 A Critique on Parallel Compu-R.Goviiidarajan ter Architecture M. Spegel
J. Silc	. ' ;,
D. Surla
Nonika K^apus-Kolar
A. M. Jenkins J. V. Carl is
65 The Relation between a Point
and a.Simple Polyhedron 69 Deriving Protocols from Services In the Finite State Machine Representation 76 Control Flowcharting for Data Driven Systems
P.	Kolbessen 83 Paralelni vektorski procescr-
S,	Mavrió	ji in njihova uporaba I
B.	Mlhoviloviä
■V.	Vouk	94 Veöopravilno okolje za delo v
J.	Ferbežar	realnem Času na rafiunalniku
A.	Brodnik	IBM-PC	'
I. Kononeako 102 Terminologija programiranja v Nada Lavraù	Prologu
J, JlugelJ 107 Sinhronizacija v porazdeljenih računalniških sistemih L, Vuga	114 Matematično sodilo o možnosti
strojne inteligence
info
Published by Informatika, Slovene-SoaTèty for' Informatics, Parmova 41, 61000 Ljubljana, Yugoslavia
Editorial Bqarđ„. : _
T. Alckstö, Beograd} D. Bitrakov, Skopje;	P.
Dragojlovlö, Rijeka; S. HodSar, Ljubljana;	B,
Horvat, Maribor; A. MandZiCi, .Sarajevo;^	S. MihaliC, Varaždin; S. Turk, Zagreb
JOURNAL OF COMPUTING AND INFORMATICS
YU ISSN 0350-5596
VOLUME 12,1988-No. 2
Edi tor-ln-Chief : Prof. Dr. Anton P. Zeleznikar
Executive Editor': ' ' '	Dr. Rudolf- Mu'rn"	: •
Publishing Council:
T. Banovec, Zavod SR Slovenije za statistiko, Vožarskt pot.12, 61000 Ljubljana;
A-.-'Jerinan-BlaZi«, Iskra Te lema tika, Kardeljeva plosead 24 61000^ Ljubi Jana ;	■ .. v
B..Klenienöiö, Iskra Telematlka, 64000 Kranj;' '
S..,Saksida, Institut za- sociologi jo Univerze Edvarda Kardelja,: 61000 Ljubljana
J. Virant, Fakulteta za elektrotehniko, Tr2aska-25, 61000 Ljubljana.
.. .. . /" .Headquarters^; ..
Informatica, ; . Iskra Deità, Stegne 15B, 6TÖ0Ö"" Ljubljana, Yugoslavia. Phoné; 61 57 45 54.
Telex; 31366 yu' delta . " "
Annual Subscription ;Rate: "uSi 30 companies, and US$ 15'for Individuals."
for
Opinions Expressed in the Vontributions are .not' necessarily shared by the,Ed i tor iaTBoard
Tisk: Tiskarna Kresijà, Ljubljana, v marcu 1988 •
CONTENTS
A,P.Zeleznikar	3 On the Developmental Inicia-
tive of the Computer Industry' M, Gams	iz Approaching the Limit of
M. Drobniß	Classification Accuracy
N, Guid
B.	ZAlik
18 Standards in Computer Graphics
J. Gyärkäs	24 A Survey of Most Important
I. Rozman	and Outstanding Methods for
Tatjana Welzer Software Engineering
A . P. Zeleznikar 31 Problema of the Ratibnaì Un-^
derstariding of Information L. M. Patnaik 47 A Critique on Parallel Compu-R.GovlndaraJan ter Architecture M. Spegel	T
J. Silt:,
D. Surla
Monika Kapüa-Xolar
A, M. Jenkins J. V. Carl is
65 The Relation between a Point
arid a Simple Polyhedron 69 Deriving Protocols' from Services in the Finite State Machine Representation 76 Control Flowcharting for Data Driven Systems
P, Kolbezen 83 Parallel Array Processors and S. Mavrió	Their Application I
B, Mihovilovid
V.'Vouk J. Ferbežar A. Brodnik
94 Multitasking Real-time Environment "for the IBH-PC'
I. Kononenko 102 Terrainology of Prolog Pro-Nada Lavrdä	graiitining
J. BugelJ	107 Synchronization in Distribu-
ted Computing Systems I. Vüga	114 A Mathematical Judgement on '
possibility of Machine Intelligence
o RAZVOJNI POBUDI RAČUNALNIŠKE INDUSTRIJEV	INFORMATICA 2/88
Anton P. Železnikar
UDK 681.3.001.2	Iskra Delta
Ta tlanek je avtorjev govor ob otvoritvi Deltinega razvojno-proizvod-nega centra v Iskrini tehnološki dolini v Stegnah pri Ljubljani. Članek poudarja vefikot tridesetletno zamisel avtentične strukture in organizacije računalniške tovarne, ki nI le privesek neke raznorodne, računalništvu tuje industrije, temveC je sodobna proizvodna zmogljivost, ki obsega dovolj širok in tehnoloèko zahteven računalniški proizvodni program, Ta proizvodnja naj bi bila podprta s sodobno zamišljenimi raziskavami in razvojem računalniških in informacijskih sistemov. Ze danes je Iskra Delta povezana z raziskovalnimi, razvojnimi in proizvodnimi srediSči doma in na tujem. V prihodnje naj bi bila nacionalna računalniška industrija tudi deleSna posebne skrbi in podpore vlade, industrije in akademije. Iskra Delta sproža tudi že novo industrijsko raziskovalno pobudo za sodobne tehnoloSke raziskave na področju računalniške znanosti, tehnologije in uporabe. Pod okrlljetn Slovenskega društva Informatika je bil ustanovljen t.i. Forum informationis, ki ga sestavljajo najvidnejši računalniški strokovnjaki in novinarji, z namenom, da bi lahko nudil javno in kritično podporo hitro rastoči domači računalniški industriji. Na koncu članka je sporočen poziv vodstvu in inženirjem Iskre Delte, ki naj bi pospešil intelektualno motivacijo in ustrezno poklicno reagiranje inženirjev.
On the Developmental Iniciative of the Computer Industry.
This paper presenta the opening speech, held by the author, at the occasion of the opening ceremony of the Iskra Delta's Researoh, Developmen t, and Manufacturing Center at the so-called Iskra Technology Valley in Stegne, near Ljubljana. The paper apostrophizes more than the thirty-years growing concept of authentic structural and organizational principles of a computer factory, which is not only an appendix of a heterogenous, to the field of computing strange industry, but is a modern manufacturing facility embracing sufficiently broad and technologically pretentious computer manufacturing program. This manufacturing should be supported by contemporarily imagined research and development of computer and information systems. Already today, Iskra Delta is we 11-connected with research, development and manufacturing places in the country and abroad. In future, the national computer industry should receive the necessary care and support from governmental authorities, industry and acaderaia. Practically, Iskra Delta is also giving the so-called industrial research initiative for modern technological and conceptional research in the area of computer science, computer technology, and to other computer-related application. In the framework of the Slovene Society for Informatics the so-called Forum Informationis, uniting the most distinguished and experienced computer professionals and journalists was established, to offer the public and critical support to the fast growing domestic computer industry. At the end of the article, a call to the managerial and professionally diverse engjneering staff of Iskra Delta is communicated, to rise the necessary intellectual motivation and the corresponding professional reaction.
Slika 1 IzgotaJ). Na sliki je prikazan izgled proizvodno-razvojnega objekta Iskre Delte med izgradnjo. Sprednji del objekta (levo) je v glavnem namenjen raziskovalno-razvo jnemu delu, zadnji del (desno) pa pretežno proizvodnemu delu. V kletnih etažah se nahajajo skladiSöa, na vrhu zadaj pa infrastrukturne enote (kuhinja, menza, ambulanta, predavalnica).
Slika 2 (spodaj). Ta slika ka2e proizvodna montažo in preizkušanje vefiproceaorskih mini-raäunalniä kih sistemov Gemini in Delta 4860, ko ae njihova izdelava te približuje konfini fazi. To so znani in že uveljavljeni računalniški sistemi Iskre Delte iz razreda super miniraöu-nalnikov, z lastnimi modutakimi aparaturniroi in prog ras kimi komponentami.
Slika 3 (desno). Pri izdelavi modulov na tiskanih vezjih, ki je sicer avtomatizirana, so večkrat potrebne tudi dodatne ali korekturne ročne operacije. Tako se moduli	dopolnjujejo	oziroma
popravljajo. Vse to pa zahteva vestno oziroma natančno delo.
Slika 4 f apoda jV. Tiskano vezje z vstavljenimi in elektriöno povezanimi elementi se preizkuäa z uporabo posebne preizkuSevalne naprave, ki opravlja meritve in a tem preverja avtomatično ustreznost vezja. To je visoko učinkovito diagnosticiranje modula na tiskanem vezju, torej ugotavljanje kritičnih odstopov.
SJlira 5 (zgoraj). Na aliki je prikazana proizvodnja mikroraCunalniS hih sistemov Triglav (stolpna ali obmizna izvedba) s 16-bitniroi procesorji Jll, M68010 in I80286. Te konfiguracije je mogoCe raznovrstno dopolnjevati z disketnimi, diskovnimi in trafinini enotami.
Slika 6 (desno). Namizno izvedba mikroraćunainiskih sistemov Triglav je namenjena njihovi uporabi s funkcijo visoko zmogljivih delovnih poataj za različne namene (CAD/CAM, grafika za različne uporabne naloge, komunikacijske mre2e itd.). Sistemi Triglav uporabljajo lastne, tj. domaČe module s tremi razliónimi mi k roproce Bor ji na vodilu VME. To vodilo je industrijski standard. Tudi operacijski sistemi za računalnike družine Triglava so različni in omogočajo uporabo najSirSega spektra uporabniških programov. Ti óperacijski sistemi so Delta-M, Xenix, Unix in MS-DOS. ■

Some of the riskiest work we do is concerned with altering organization structures. Emotions run wild and almost everyone feels threatened. Why should they be? The answer is that if companies do not have strong notions of themselves, as reflected in their values, stories, myths, and legends, people's onJ.y security comes from where they live on the organization chart. Threaten that, and in the absence of some grander corporate purpose, you have threatened the closest thing they have to meaning in their business li ves.
T.J. Peters and R.H. Waterman; In Search of Exellence, p. 77 (Harper & Row, 1981)
Zamisel, ki stno Jo Jugoslovanski računalniški inženirji nosili vefi kot trideset let^ v svojem hotenju in znanju, se v teh dneh materialno uresničuje. Pred nami je prava in na lastnih tehnoloških in razvojnih temeljih/ na dolgotrajni avtentični zamisli' zgrajena in organizirana tovarna računalnikov, ki ni le privesek neke drugorodne, računalništvu mačehovske ali tuje industrije, temveč je sodobna proizvodna zmogljivost, ki obsega dovolj airok in tehnološko zahteven računalniški proizvodni program. Ta nova zmogljivost naj bi se podpirala tudi s sodobno koncipiranim! raziskavami in razvojem, ki naj bi zagotavljale novonastajajoče računalniške produkte, tj. informacijske in računalniške sisteme za svetovno konkurenčni plasma in
' Slavnostni govor ob otvoritvi proizvodno-razvojnega srediSCa Iskre Delte v Stegnah, dne 4. decembra 1987.
' Zamisli o oblikovanju slovenske računalniške industrije so nastajale že v drugi polovici 50-ih let. Na Institutu JoZefa Stefana so potekala interdisciplinarna predavanja, s katerimi se je de facto pripravljal tudi razvoj računalnika, ki naj bi ga proizvajala domača industrija. V 60-ih letih Je ta zamisel ugaäala, saj se je na Zavodu za avtomatizacijo pojavila licenčna montaža (neke vrste proizvodnja) računalnika Zuse Z23.
' Avtentična zamisel računalniške tovarne pomeni med drugim na lastnih možnostih in izkušnjah strukturirano in organizirano proizvodnjo, podprto s sodobno (lastno) računalniško avtomatizacijo tipa CIM (Computer Integrated Manufacturing) in tipa CIC (Computer Integrated Communication) , kot jo uvajajo sodobno organizirane tovarne v avtomobilski (General Motors) in računalniški industriji (npr. avtomatizacija proizvodnje mikroračunal-niSkih sistemov tipa PS in velikih računalniških sistemov pri IBM).
uporabo. To Je vizija in odločenost ob tej otvoritvi in prav zaradi tega Selim kritično poudariti, da bosta svetovno konkurenčni plasma in uporaba mogoča le tedaj, če bo sodobno organizirana znanstveno-produkci j s k a veriga dobivala ustrezne tržno-strateške inpute in le če se bo vzpenjala na skrajne meje znanstvene in tehnološke i no va t i vnos t i ob izredno samokritičnem uresničevanju vseh dejavnosti sklenjene verige raziskav, razvoja, proizvodnje in trženja.
Vsestranska znanstveno-tehnolofika 'povezanost Iskre Delte s svetovnimi in domačimi raziskovalnimi, razvojnimi, tehnološkimi, proizvodnimi in poslovnimi srediSči bo odločilna in mora neglede na trenutne krizne okoliščine slej ko prej ostajati, nastajati in se razvijati kot glavna strateška usmeritev računalniškega podjetja. TakSna usmeritev, ki edina zmore zagotavljati obstoj in prepotrebne razvojne impulze lastne in druge sodobne industrializacije na naSih tleh in v evropskih koordinatah, bo lahko noSena le a strokovno in intelektualno vrhunsko usposobljenimi kadri v vodenju, raziskavah, razvoju, proizvodnji in trženju. V naporih za svoje preživetje in razvoj se bo Iskra Delta vedno znova soočala s problemom zahtevne kadrovske kvalifikaciJe, izkušenosti in intelektualne prodornosti. Značilni kadrovski kompromisi danaànje stagnantnosti, neaktivnosti in disfunkcije ne bodo zadostovali niti za golo preživetje.
Računalniška industrija in razvoj te industrije sleherne države sta danes pomemben integracijski dejavnik na področju narodnega gospodarstva, raziskav, razvoja, proizvodnje in trženja. Ta industrija je zato skrbno opazovana in varovana z državno regulativo, saj je izhodiščna za razrast druge industrije, malih podjetij materialne in programske proizvodnje, trgovine, nacionalnih šolskih, zdravstvenih in drugih	infrastrukturnih	dejavnosti.
Računalniška industrija je temeljna podlaga prihajajoče informacijske epohe. Narodi ali mednarodne skupnosti, ki Se domišljajo in gradijo svoje vizije preživetja, morajo jasno in načrtovano predvidevati svoje vire informaciJskega znanja in tehnologije.
Tudi funkcija današnje Iskre Delte postaja čedalje bolj raznovrstno integrativna na področju poslovnega povezovanja, postaja pa tudi bistveno inicializirajoča skladno in spontano s tistimi potrebami sodobnega tehnološkega in socialnega razvoja, ki zmore odločilno posegati v prestTukturi ranJ e papirnatih vrhunskih raziskav in tudi v navidezno oziroma nebistveno akademsko tehnološko ' napredovanje. Raziskovalna industrijska iniciativa Iskre Delte že oblikuje tisto bistveno polje razvojne relevantnosti, ki bo lahko podlaga ključnim preobratom v razvojni strategiji sozda Iskre in Gorenja in t.i. raziskovalnih skupnosti. Iskra Delta vendarle in to navkljub občutjem krizne brezizhodnosti odpira obetajoče in motivacijsko bistvene
Slika 7 (zgoraj). Na tej sliki je viden del laboratorija za razvoj mikroraäunalniäkega sistema Triglav, in sicer za modul z 32-bitnini mikroprocesorjom Intel 80386. Iz slike je razvidno, kako so svetli laboratorijski prostori bogato opremljeni a najrazliònejèiml aparaturnimi pripomočki za kompleksen razvoj (načrtovanje, risanje, simulacija itd.) vezij in podsistemov. Preko posebnih terminalov je laboratorij povezan z visoko zmogljivimi miniracunalniskimi sistemi lastne proizvodnje (glej npr. sliko spodaj). Ti razvojni računalniki so locirani na veC mestih v proizvodno-raziskovalnem arediäöu Iskre Delte.
Slika 8 (spodaj). RaCunalniäki center projektov za operacijske sisteme in super miniraćunalnike je le eden od petih računalniških centrov v proìzvodno-razvojnem arediSöu Iskre Delte v Stegnah. V ozadju je v sredini slike viden veC-procesorski rsCunalniSki sistem ID Gemini, s skupnim diskovnim obsegom 4,2 Gbyte. Ta sistem je namenjen med drugim tudi razvoju IDA orodij in lastnih operacijahih sistemov iz družine Deltix. Na levi strani slike je viden sistem VAX n/780, ki se uporablja pri razvoju super tniniraCunalnikov (za simulacijo, naCrto~ vanje in druge razvojne naloge).
T^'iv- ■ : - " ■■ ■
„■'t-. '^.^■JS
razvojne perspektive, ki jih letargiCno in nemotivirano okolje že sprejema s tihim odobravanjem, saj vidi in Cuti v njih svoj izhod, razvojno upanje in preatrukturirno zaupanje. V tem spoznavnem kontekstu je razvojna uspeSnost Iskre Delte izredno pomembna za obćutje ihdustri jakih in drugih infrastrukturnih razvojnih moSnosti jugoslovanskih nacionalnih in federalnih znanstveno-tehnoloökih strategij.
V bližnjem razdobju se bo z nadaljnim Deltinem razvojem pojavila tudi v obeh sozdlb pa tudi v strokovni in SirSi javnosti bistveno nova, oživljajoča pobuda. Slovensko druStvo Informatika bo oblikovalo t.i. Forum informationis' , ki ga bodo sestavljali najvidnejši računalniški strokovnjaki in novinarji, kot kritično, problemsko in intelektualno usmerjeno telo za podroóje računalništva in informatike. Delovno polje relevantnosti tega telesa bo raznovrstna strokovna, raziskovalna, organizacijska in proizvodna pobuda. Hkrati bo Slovensko druätvo Informatika reformiralo svoj strokovni Časopis Informatica tako, da bo njegova vsebina bistveno povezana z raziskovalno in razvojno problematiko obeh aozdov, da ae bo skozi časopis razvijala tudi potrebna motivacija za ràziskovalnost, razvojnost, strokovnost in znanstveno-tehnoloéko obveSčenoat raziskovalcev in inženirjev na področju računalništva in informatike. Z odpiranjem Deltine tovarne se tako de facto zaöenja tudi bistveno nova, oživljajoča pobuda strokovne aktivnosti kot nujnost iti kot predhodnica in posledica industrializacije, kot oko in uho potrebne razvojne relevantnosti.
Z vstopom Iskre Delte v računalniško industrializacijo se odpirajo nove možnosti za infornacijako-značilno in spremljajočo drugo industrijo v ožjem in äiräem okolju. Državni,
gospodarski in politični centri odločanja naj bi take in drugačne, smiselne in integracijske pobude in tokove, ki so povezani z informatizacijo ključnih dejavnosti in z njihovim nujnim razvojem, z razumevanjem podpirali. Iskra Delta je eden tistih inovativnih gospodarskih poganjkov, ki je äe sposoben izvenkrizne rasti, motivacije in pozitivnih gospodarskih učinkov. Zaradi tega pričakujem, da bo politična moč podpirala prav svoj lasten, demokratičen razvoj v največji meri tudi tako, da ne bo postavljala umetnih zaprek Deltinemu razvoju tam, kjer bi ga bilo smiselno kvečjemu pospeševati. Seveda pa mora tudi Iskra Delta v prihodnje sprejeti strokovno in vsestransko odgovornost za skladen in ekonomsko učinkovit razvoj računalništva in informatike na naSih tleh in v SirSem evropskem in svetovnem prostoru.
Podobno kot ae s tem govorom obračam na Sirèo slovensko in jugoslovansko javnost, se obračam Se posebej na Deltine inženirje. Predvsem od vas Deltini inženirji, od vaše strokovne zavezanosti, organiziranosti, brezkompromisar-skega izražanja vaakrSnjih kvalitativnih zahtev, od vaSega hotenja in raziskovalno-tehnoloSkega izpopolnjevanja je in bo odvisen razvojni in gospodarski potencial in uspeSnoat Iskre Delte. Slej ko prej se bo potrebno odpovedati konzervati vni in že danes razvojno neperspektivni tehnologiji' in osvajati naprednejšo, za vse nas napornejso in zahtevnejšo tehnologijo in znanje. Znani koncepti kopiranja in golega posnemanja ne bodo več zadostovali. Zato pričakujem, da bo vaSa strokovna in organizacijska glasnost glasnejša, strokovno bolj inte 1ektua1 i z i rana in konceptualno smelejSa, kot je bila doslej; da inženirji ne boste le nemočni opazovalci in nekritični sprejemniki tistih usmeritev, katerim se argumentirano potihoma ali vsaj ne dovolj glasno upirate. Inženirji
* Forum informationis je latinskega izvora in pomeni trg {javno mesto) predočbe (informacije). Pomen foruma je dvojen: informacijsko strokoven , tj . utemeljen s strokovnostjo informacije kot informacije (računalnik je danes pojem za informacijski stroj) in javen, tj. kvalificiran za javno razpravo o računalništvu in informatiki.
• Konzervativna te/inoJo^lja je v svojem bistvu subkulturno oblikovana (funkcionalno in '"oblikovno "dizajnirana"), modificirana, največkrat reducirana tehnologija, ki ne sledi več glavnim, tudi konjunkturnim razvojnim, tržnim in atandarizacij-skim trendom, se pa na značilen nesposobnosten način odeva v svoje zaščitne znake in ščiti tako le sama sebe pred nikomer. TehnoloSka konzervativnost je lahko le trenutna tržna poteza, ki jo je potrebno čimprej nadomestiti s tržno relevantno noviteto.
® Po otvoritvenem govoru' so se pojavili očitki, da je ta govor poziv Deltiniai inženirjem na upor. Moj odgovor na te očitke je bil in ostaja, da se Deltini inženirji dejansko morajo argumentirano in zavestno, torej sistematično in strokovno organizirano, z vso intelektualno silo, ki je vest in zavest sodobnega inženirja, upirati kriznemu napredovanju, ki grozi z razrušitvijo doseženega industrijskega, predvsem pa v tem kompleksu nujnega, vse drugo pogojujočega raziskovalno-razvojnega potenciala in s tem k razrušitvi določenega preživetvenega minimuma, Inženirji kot vlečni konji so moralno poklicani, da vlečejo skupaj z drugimi voz visokotehnoloSkega podjetja iz krize, da gradijo svojo moralno in strokovno motivacijo predvsem tudi na elementu vleke iz kriee. Brez te vleke, ki Je moralna in intelektualna. Je tudi sama populacija inženirjev obsojena na propadanje, na drsenje v letargijo in v skrajni posledici na izumrtje.
Slika 9. V novih prostorih proizvodno-raz vojnega centra Iskre Dette v Ljubljani, v Stegnah 15b, je dobilo avoj prostor tudi uredništvo öasopiaa Informatica. Iz slike je razvidna funkcionalnost proatora, ki je razen 2 uredniškimi mizami in policami opremljen tudi a teriDinaloiD, mikroračunalnikom in pomožnim tiskalnikom. To je hkrati tudi delovni prostor glavnega urednika.
Ob vsem tem je gotovo treba poudarili, da je Iskra Delta materialno In vsebinsko z vsestranskim razumevanjem podpirala delo Časopisa Informatica od leta 1980 naprej. Le pod takimi pogoji je bilo mogoöe vzdrževati utefieno, standarizirano in zanesljivo urejanje in izhajanje časopisa Informatica. Seveda Je pri tem nujno poudariti tudi urejeno razmero v Raziskovalni skupnosti Slovenije, ki je z vaakolelniiD financiranjem omogoCala praktično neproblematično izhajanje Časopisa.
Okolje uredništva Časopisa Informatica je izjemno primerno, saj je uredništvo locirano v eamem erediäCu t.i. raziakovalno-raz vojne enote Iskre Delte; ta enota predstavlja bržkone jedro prihodnjega (korporativnega, tj. Iskrinega) instituta za raCunalnid tvo (in informatiko). Prav s tem pa se vpliv stroka in aktivne publicistične kulture v taki in drugaCni obliki prenaSa tudi na delo vrste raziskovalcev, razvojnikov in inaenirjev Iskre Delte. Uredništvo lahko na ta naCin tudi sistematično vzpodbuja in povezuje potencialne avtorje v Iskri Delti in izven nje. V nove prostore uredništva prihajajo tudi avtorji in Študentje iz drugih krajev, iz zunanjih institutov in jugoslovanskih univerz. Na ta naCin se smiselno oblikujejo interesne povezave nas ih in tujih avtorjev in strokovnjakov.
(Nadaljevanje podnapisa k sliki 9.) Zanimivo je tudi, da krizne razmere ne vplivajo negativno na rast obsega, kakovosti in na distribucijo časopisa Informatica. Število prispevkov naraSCa in urejanje se ne sooCa veC a problemom premjahnega obaega, katerejra spodnja meja je bila ob ustanovitvi Časopisa v letu 1977 postavljena na 72 strani za eno Številko. Čedalje veCja je tudi pripravljenost avtorjev, da piftejo svoje prispevke v anglefikem jeziku, s Čemer se bistveno poveCuje komunikativnost Časopisa oziroma prispevkov posameznih avtorjev s tujino.
V bližnji prihodnosti bo Časopis Informatica nekoliko spremenjen, NaCrtuje se sodobnejša oblika oziroma vsebina platnic. Z razvojem t.i. namiznega urejanja in izdajanja pa bi bilo mogoCe vpeljati tudi izpisovanje vseh tekstov z laserskim tiskalnikom. V tem primeru bi morati avtorji svoje Članke dostavljati v uredništvo (Se) na disketah oziroma z elektronsko poŠto. Na ta naCln bi dobili Časopis, ki bi bil v vseh ozlrih (In po moanostl ob nebistveno spremenjenih stroSkih) tudi tehnološko kvalitetno oblikovan. Ob veem tem je seveda potrebna tudi spremenjena uredniška politika, ki bi zagotavljala kvalitetni dotok prispevkov In morda tudi pogostnejSe izhajanje Časopisa (v prvi fazi bi bilo mogoCe brez posebnih naporov izdajati letno Sest namesto dosedanjih Štirih Številk v enakem obsegu).
Na koncu tega zapisa velja omeniti, da je potrebno navajati novi naslov uredništva, ker se dogaja, da posta vraCa pošiljke, ki uporabljajo stari naslov. Novi naslov je:
Časopis Informatica, Iskra Delta, Stegne 15Đ, 61000 Ljubljana.
raziskovalni, razvojni, proizvodni, finanöni in prodajni - ste nosilci in vleòni konji visokotehnolodkega podjetja. Prav zato naj bi tudi vsi drugi - vsi delavci Iskre Delte in uporabniki Deltinih proizvodov - vlekli v vami in ne proti vam*.
Današnja otvoritev računalniške tovarne in raziskovalno-razvojnega centra ob njej je praznik SirSega okolja neglede na to, kako se to v tem trenutku mani fest ira^ . To je zgodovinski korak v domafii industrializaciji računalništva. Zato končujem s spoštovanjem, priznanjem in z zahvalo vsem, ki so na tej dolgi poti načrtovali, pripravljali in nosili kamne gradnike' .
Hvala!
' Manifestacija otvoritve računalni&ke tovarne Iskre Delte je pokazala značilno nerazumevanje in g tem soc!Ìa.lno izoliranost slovenskega univerzitetnega, znanstveno-institucionalnega, gospodarsko-zborničnega in političnega estalishmenta do dejanskega tehnolo&kega .razvoja in s tem do bistvenih preZivetvenih možnosti slovenske in z njo vred jugoslovanske populacije. Kljub prisotnosti najvišjih predstavnikov slovenske znanosti in umetnosti, njenega predsednika akademika Janeza Milčinskega, visokih vojaSkih osebnosti in vrhunskih posameznikov strok, je lahko Mija Repovž naslednjega dne resignirano ugotovila:
"Ce bi pomen neke tovarne presojali po goatib iz političnega vrha, ki naj bi s avojo navzočnostjo otvoritvenemu ceremonialu dali pečat družbene skrbi in poaor/josti, Je bila za ta podalpski kot Jeseniška Jeklarna dogodek številka 1 zadnjih desetletij, novi razvojno-proizvodni center Iskre Delte pa epizodica, ki naj ne moti velikega duha ..."
{Neelaatićna in soft pravila. Delo, stran 2, 5. decembra 1987).
' K načrtovanju, pripravljanju in noSenju kamnov gradnikov bi morali dodati fio trpljenje, tragično dinamiko in dramatizacijo epopeje hitre rasti podjetja Iskre Delte in same graditve tovarne in njenih brezizhodnih posledic. Vrsta vodilnih delavcev je bila postavljenih v iskanje izhodov in je občutila nečloveške, brezobzirne in skrajno nehumane (totalitarne) pritiske, pod katerimi je doživljala tudi svojo lastno deformacijo in razcep. Doživljala je svojo lastno nehumanost in izgubljenost (suicidnost), MiJa Bepovä* priznava Janezu Skrvbeju, generalnemu direktorju Iskre Delte vlogo jugoslovanskega Stevena Jobsa. Osebno sem prepričan, da bi vsak direktor, ki je s svojo osebno odgovornostjo zgradil tako veliko računalniško podjetje in s
svojo äirino omogočal razmah vrhunskih industrijskih raziskav in razvoja^«, moral imeti na voljo vsaj en izhod, ki bi bil priznanje (ne nagrada) njegovemu požrtvovalnemu, izčrpljajočemu delu, v katerem nikoli ni videl tiste lastne koristi, ki so jo lahko zaradi njegovega dela pobirali drugi. Skrajno neetično in za tehniäko inteligenco nesprejemljivo je atalifiče, da lahko določena višja struktura, popolnoma birokratsko, post-totalitarno, po logiki svoje totalitarne samo-poklicanosti atigne ustvarajalno osebnost v poligon svojih brezobzirnih in ruftilnih manipulacij.
• Mija Repovž: Računalništvo na nas način. Delo, Sobotna priloga, stran. 18 (5. decembra 1987) .
1® A. P. Zeleznikar: Parsys Expeditions to New Worlds. Informatica 11 (1987), No. 3, 76-80.
V tem svojem članku sem opozoril na posebno, seveda odločujočo, atrateäko smiselno vlogo generalnega direktorja Iskre Delte, Janeza Skrubeja: "... Quite at the beginning, Iskra Delta has introduced and incorporated strategic thinking and acting in its managerial decision making. As a fast growing enterprise in the field of computer industry, it has been confronted not only with the very basic organizational and technological problems of modern computer industry of the developed hemisphere, but alao with specific problems of a technologically and even civi 1 i zationally (socially, ideologically) underdeveloped environment. In these times, up to this day, the general director of Iskra Delta
- Mr, Janez SkrubeJ -was not only the real strategist of the company, but also the optimistic fighter, organizer, believer of the progress, and the carrier of several hard, arduous, and exhausting business situations and developmental processes of the company. And all of this in irregularly and unforeseeably changing circumstances of the underdeveloped hemisphere. Thus, it is to say, that he was the main initiator and supporter of the innovative-and independent Parsys project.
APPROACHING THE LIMIT OF CLASSIFICATION ACCURACY
INFORMATICA 2/88
UDK 519.226.3
Matjaž Gams, Matija DrobnIč Jožef Stefan Institute
ABSTRACT. One of the recently developed systems for machine learning (GINGSYS) significantly outperformed all compared systems including theoretically optimal Bayesian classifier, which was the second in both tests. We tested several options in Bayesian classifier to investigate the teal cause for nonoptimal results and to estimate the upper limit in classification accuracy. The conclusion is that while it is possible to achieve even higher classification accuracy with suitable parameter adjustment in Bayesian classifier, it seems that GINESYS practically achieved the optimal classification accuracy.
Sistem za empirično u&enje GINESYS je v praktičnih meritvah presegel primerjane^sisteme z Bayesovim klasifikatorjem vred. Podrobna analiza kaže, da je dosežene reeultate mogoče preseči, vendar so iže zelo blizu optimalne meje.
1. INTRODUCTION
SWITCHES
status
Kachine learning is a quickly developing area of Artificial intelligence (Winston). According to the major inference type used it can be divided into rote learning, learning from instruction, learning by deduction, by analogy, from examples and from observation and discovery [Carbonell et al; Hichalski]. The scope of this article is learning from examples or Empirical Learning (EL). The aim of EL is to induce general descriptions of concepts from examples (instances) of these concepts. Examples are usually objects of a known class described in terms of attributes and values. The final product of learning are symbolic descriptions in human understandable forms. Induced descriptions of concepts, representing different classes of objects, can be used for classifying new objects. EL systems basically perform the same task (classification) as statistical methods and can be directly compared to them from the point of classification accuracy. On the other hand, EL systems offer further advantages, namely a) explanation during classification of new examples and b) the insight into the laws of the domain by observing classification rules. Explanation during classification (a) is important since it enables the usee to check the line of reasoning and verify the system's decision. The knowledge base (b) can be viewed as a new representation of the domain knowledge, which can be of great value to domain experts, especially in domains that are not yet well formalized and understood.
2. a simple example
For a simple example let us consider a case where we have a device with 8 binary switches representing 256 legal combinations. Device reports errors in some combinations and we want to find out what subsequence causes them.
1 2 3 4 5 6 7 8
1 1 0 1 0 1 0 1 0 1
1 0 1 1 1 1 0 0 1 0
error ok
ERROR OK
error
Table 1. Device reports error in some combinations of switches. Which subsequence of switches causes them?
Probably the most common answer in EL systems would be, that error is reported when switches 5 and 8 are on (»1).
In practical tasks EL systems deal with domains with 10 to 10.000 examples (typically some hundred) with 2 to 500 attributes (typically ten or some ten) [Breiman et al; Qulrlan; Lavrac et al]. Attributes can be real, integer or categorical with many possible values.
3. EMPIRICAL LEARNING
The whole process of empirical learning consists of four steps:
-	preprocessing of learning examples,
-	construction of a classification rule,
-	classification of new instances and
-	analysing the laws of the domain.
Detailed description can be found elsewhere, e.g. [Kononenko] or [Gams, LavracJ with detailed overview of some well known algorithms - C4 [Quinlan], CART [Breiman et al], ASSISTANT 86 (Cestnik et al], CN2 [Clark, Niblett], aqIS [Carbonell et al}. We shall formally represent here only a domain area and a classification rule.
A set of learning examples L - {(x,c)} consists of pairs .'.,c), where x is a vector (denoting propercies of the object) in a
measurement space x and c represents index of the class of example x.
the
Components of vectors x are called attributes or variables. The values of attributes can be numerical or categorical.
A classification or a decision rule đ(x) is a mapping which maps every x from X into some c from C or into the probability distribution (pl,p2,..pj) where pi is a real number between 0 and 1.
A classification rule d(x) splits the whole space X into spaces X1,X2,...XJ, such that for every Xi only a certain subset of d(x) is relevant.
The syntax of a classification rule d(x) is;
<d>	Si- <Rule> 1 <Rule> and <d>
classification rule
<Rule>	<Class> if <Cpx>
rule
<Cpx>	<Sel> | <Sel> and <Cpx>
complex
<Sel> selector
Atr <op> <Values>
<Values> It- val ( val or <Values> values
<Class> !!- 112|3...|J class
<op> ■ J I- < I , I > operators
Atr corresponds to the name of the attribute and Val is a categorical or numerical value.
This syntax is transformable into DNF and is similar to the syntax of most rule-based systems or expert systems [Waterman, Hayes-Roth]. Note that the actual syntax is slightly more complicated [Gams].
4. DOMAIN DESCRIPTION
We performed practical measurements on two real-world domains. Data were obtained by I. Kononenko and represent descriptions and diagnoses of patients from the Oncological Institute Ljubljana. The only correction was replacement of missing values by the most probable ones for a given class. More detailed description is in [Gamsi, here we present only cumulative data about these domains:
Domain 1
number of attributes	18
no. of possible values per attribute 2-8
( average 3.3 ) . number of classes	9
total number of examples	150
distribution of examples amongst classes number of examples in CI to C9 12 .34 56789 2 1 12 8 69 53 1 4 0
Domain 2
number of attributes	17
no. of possible values per attribute 2-3
( average 2.2 ) number of classes	22
total number of examples	339
distribution of examples amongst classes
number of examples in CI	to C22
1 2 3 4 5 6 7 8	9 10 11
84 20 9 14 39 1 14 6	0 2 28
12 13 14 15 16 17 18 19 20 21 22 16 7 24 2 1 10 29 6 2 1 24
importance of attributes - Al to A17 (counting how many examples overlap when omitting the i-th attribute)
123456 7 89 80 80 58 85 60 53 65 55 68
10 11 12 13 14 15 16 17 63 55 60 53 54 57 65 65
5. GINESYS
importance of attributes - Al to Aie of them is redundant
none
5.1. ALGORITHM DESCRIPTION
The top level description of GINESVS (Generic iNductive Expert SYstem Shell) is as follows;
repeat
initialize Rule; generate Rule; add Rule to d(x);
L - L - {examples covered by Rule) until satisfiable(d(x))
In this general view GINESVS represents a prototype of a unifying algorithm for empirical learning covering many other systems. In a slightly more specified description we obtain the following algorithm!
repeat
generalise Rule; repeat
specialize Rule until stop(Rule)f postprocess(Rule); add Rule to d(x);
L - L - (examples covered by Rule] until 6atisfiable(d(x))
The main difference between other EL systems and GINESYS is in "confirmation rules". Basic idea of confirmation rules is using several sources of information for classification. That seems to be common practice in every day life. For example when we try to predict the weather, we look at the official weather report, but also look at the sky and ask our neighbour. The implementation of this idea in GINESYS is that instead of using only one rule for classification several rules confirm or confute the first one. In case of a confrontation between these rules the Bayesian classifier is consulted as a method of a conflict resolution (Waterman, Hayes-Roth]. One confirmation rule in our simple example in Table 1 could be ; Error is reported, when switches 3, 5 and 8 are on. This rule could be redundant or even wrong, but on the other hand it could be the only correct onel From examples in Table 1 it if not clear which of these possibilities is the right one, so both (and other) rules are
14
Stored and consulted. In more detailed tests [Gams] it was shown that this method of consulting several rules (> using different kinds of Information) significantly improved classification accuracy.
5.2. COMPARATIVE RESULTS
A detailed comparison was made with other well known EL systems in two noisy medical domains (oncology). Table 2 shows results in classification accuracy.
GINESYS BAYES
other systems
domain 1 69.9 68.4 67.3
domain 2 51.9 50.1 48.7
Table 2. Classification accuracy measured as the percentage of correct guesses.
While GINESYS achieved bost results and Bayesian classifier the second ones, none of the compared systems [Gamsl outperformed results in the last cow In Table 2. These results ace actually an avecage over ten cuns on randomly chosen 70% of data for learning and remaining 30% of data for testing. In further tests (t-tests, [Gams]) it was shown that the number of tests) distribution and the difference between classification accuracies was sufficient to ensure that differences are a result of some deeper cause (e.g. better algorithm) and not a chance choice.
Other measurements proved superiority not only from the point of classification accuracy, but also in generality, complexity of classification rule and explanation [Gams]. GINESYS and other algorithms discussed in this paper were implemented In Pascal on VAX 11/750.
5.3. IRREPROACHABILITY OF MEASUREMENTS
He argue that our measurements are
irreproachable (unbiased) since;
all systems were measured on exactly the same data
-	no "cleaning" of data was performed
-	no special form of data was allowed
-	no unusual method of measuring classification accuracy was used
-	no domain dependent parameters were allowed
-	the number of data and tests was sufficient (t-tests) to avoid chance choice
-	results were strictly checked and verified by many supervisors from the program source level to the level of classification trace.
accuracy as other statistical methods, in our measurements some of the EL systems, especially those without special mechanisms for noisy domains, gave unexpectedly poor performance compared to the results published by the originators of algorithms. Since our implementations of those systems were the same as published, several	possible
explanations remain. It might be that actual Implementations use some unpublished extra features, maybe the domains used foe testing were especially suitable for specific algorithms etc.
The authors of this article also find questionable comparing between the system and the expert, since we regard EL systems mainly as a helping tool and not as a stand-alone program. The other reason is that fair comparison between machine and human is extremely difficult. The correct comparison should be (system + user) : user.
In most complex realistic domains mechanisms for dealing with noise are of greatest importance as Independently discovered in [Breiman et al; Kononenko] and it is not realistic to achieve even tolerable results without them [Kononenko; Gams].
6. BAYESIAN CLASSIFIER
6.1, THEORETICAL FOUNDATIONS
The concept of the Bayes rule is one of the most important concepts in the field of classification and also learning. For the data drawn from a probability distribution P(A,j), the most accurate rule can be given in the terms of P(A,J} and this rule Is called the Bayes rule. It is normally denoted by dB(x).
precisely, suppose that (x,y), x« K, y« Y is a random sample from the probability distribution P(A,j) on X x C, i. e., P(x «■ A, y=j)"P(A,3). Then we define dB(x) as the Bayes rule if for any other classifier d(x),

Let us assume that X	is H-dlmensional
euclidean space and	for every j, j=l, ... ,J, P(A/j) has the probability density fj(x) and for sets AC X
Then we can prove the following theorem;
5.4. DISCUSSION ABOUT RESULTS
Some of the systems for empirical learning achieved good results in practical testing In several real life domains, practically approaching or even outperforming domain experts and statistical methods [Kononenko; Hichalski, Chilausky; Breiman et al; Gams]). Hore acceptable is the opinion [Breiman et al), that although all methods are more or less domain dependent, EL systems in general achieve about the same classification
where J is the number of classes and P(j) is the prior probability of the class j. The proof can be found In [Breiman et al).
in practice, neither the P{j) nor the fj(x) are known. The three most common classification procedures, used to approximate the Bayes rule by using the learning sample data, are discriminant analysis, kernel density estimation and k-th nearest neighbour. Accuracy of two of them have been compared with the results of Glnesys on both domains.
6.2. PRACTICAL IMPLÉHENTATIOMS
6.3. A PRACTICAL EXAMPLE
The k-th nearest neighbour method [Fix, Hod9e6) was implemented as simple as possible. The algorithm eearches through the set of learning examples and determinates distance between learning and test example as the number of mismatches in- their attribute values. Test example Is then classified by its first nearest neighbour, and if there are more equally distant neighbour, the last one found is picked for classification. It is so called	Nearest-neighbour	classifier
iBatcheles 1974], Although it is so primitive, this method classifies test examples with 72.9% average accuracy in the domain 1, what is even better than GINESYS. However, in the domain 2, which is far more complex, the classification accuracy is only 40.4% what is considerably lower than that òf the other methods.
The following approximation of the Bayes rule (Clark, Niblett; Kononenkoj is one of the most commonly used. In general, the rule is farmed as

At this point the assumption is made, that all attributes are independent :
mAijc)
\}nAiic]
(11
when classifying a new example we need to evaluate formula (1), One practical solution when dealing with categorical values is to store all factors into a 3-diraenslonal table TB[i,j,k] with the following indexes :
i - attribute index j - attribute's value's index k - class index
TB(i,j,k] is the number of examples in the learning set with the properties, denoted by index values.
When evaluating formula (1) during classification of a new example, one of the factors can be 0. The result can be either 0 or undefined. The solution in the second case is obvious - delete this attribute from the formula. The same solution is sometimes used when the result is 0.
In Table 2 GINESYS (without domain dependent parameters or other adjustments [Gams]) achieved higher classification accuracy than the practical implementation of theoretically optimal Bayesian classifier. The reason for this must be in practical Implementation, especially in
a)	approximation of probabilities from the learning set,
b)	assumption, that attributes are independent,
c)	practical solutions to numerical problems.
Problem (a> can be discarded, since all systems [Gams] processed exactly the same data. But it could be the case, that ditferent classifiers (also different implementations of Bayesian classifier) are more and other less sensible to the number and distribution of input data.
Problems appearing during the evaluation of formula (1) can be shown in a simple example. Let us try to classify examples el and e2 from data obtained from Table 1.
el - 0000 0000 e2 -10111011
P{OK/ü)=l
P{ERRORlel) = t
P{ERRORfa)
None of the two examples gives the sum of all classes equal to 1. Even if we delete all columns having 0 we obtain results like P(OK/el) - 2.6. Furthermore, in case of example e2 the probability for class OK is greater than for the class ERROR, although example e2 is the same as example e5 from Table 1, belonging to class ERROR. However note that the nearest neighbour method would classify correctly in this case,
A small number of examples, is insufficient for most statistical methods and also foe Bayesian classifier. In real measurements in domain 1 and 2 the number of examples was always greater than one hundred and was considered sufficient. Nevertheless these counterexamples	show that	better
classification accuracy is possible.
7. ADJUSTING PARAMETERS CLASSIFIER
IN
BAYESIAN
Probabilities used in evaluating formula (1) are approximated by prior probabilities in the learning set, what yields some error in classification. The formula is then evaluated as follows
pm

(2)
where
ai is the number of examples of the class c with the same value of the i-th attribute as the test example, b is the number of examples of the class c,
ci is the nurober of all examples with the same value of the i-th attribute as the test example, and d is the number of all learning examples.
When dealing occur during
with noisy data, errors may evaluation of formula (2). Two methods have'been usèd to avoid this errors.
7.1. OMITTING THE UNRELIABLE FACTORS
The main idea of this.method is that' the accuracy of estimations in formula (2) grows 'with the nfjmber of examples. Therefore, if b
oc d (only b in practice) is smaller than parameter HINH, which is set before evaluation, then the factor with this b is omitted during evaluation of formula (2) and
»Btimated by its shows the results in classification accuracy with different values of HINN.
WHlAWVeW	W dX Vid ^Wk'i
the class probability Is es prior probability. Table 3 st
KINN :	domain 1	domain 2
0	66.4 ( 0.0)	50.1 ( 0.0)
1	66.4 (11.1)	50.1 ( 4.5)
2	66.4 (47.6)	49.7 (31.8)
3	69.3 (52.2)	49.7 (32.3)
4	68.7 (56.7)	49.8 (33.6)
6	68.0 (60.0)	50.9 (47.7)
10	66.7 (75.6)	48.7 (60.5)
IS	67.1 (77.8)	46.2 (72.7)
Table 3.	Classification accuracy	measured as
Liie peEctiiita^e oc cocreci; guesses, values in brackets are percentages of classifications with prior probability.
It is interesting that accuracy is almost independent of the number of classifications with prior probabilities and it decreases only if we classify approximately 75% of examples this way. Yet we can see that accuracy on both domains reaches its maximum when approximately 50« of claesificatlons are done by the prior probability of classes and this maximal accuracy le near the accuracy of GIHESYS.
\ EPS :	0.00	0.01	0.05	0,10
hinn ;				
0	0.0	0.3	2.7	3.8
2	0.0	1.4	1.1	1.1
4	0.3	1.2	0.9	0.3
6	-0.4	0.0	-0.4	-0.4
Table 5. Increase of classification accuracy by combination of both methods in domain 1 (basic accuracy 68.4%).
\ EPS :	0.00	0.01	0.05	0.10
MINN ;				
0	0.0	2.7	2.8	1.8
2	-0.4	1.9	2.2	2.0
4	-0.3	1.8	2.1	2.0
6	0.8	2.7	3.2	3.2
Table 6. Increase of classification accuracy by combination of both methods in domain 2 (basic accuracy 50.1%).
The accuracy of GINBSYS is in both cases exceeded by more than 1% what is also the difference between basic Bayesian classifier and GINESYS. Yet there is a problem which combination of BPS and hinn values to choose and how far this decision is domain independent. Therefore, GINESYS can still be considered to reach the practical upper bound of classification accuracy.
7.2. ADJUSTING ZERO FACTORS
8.
EXTERNAL RULE CLASSIFIER
directed
bayesiam
A problem occurs, what to do when ai in the formula (2) is 0. One solution is to omit this factor from the formula. Another idea could be to set ai to a very small number EPS in such case. The idea is that this zero is the result of domain noise and, with more learning examples, we would sooner or later find such example and therefore we made almost no mistake and we also don't lose the information contained in the distribution of other attributes. The results of this method are shown in Table 4.
EPS :	domain 1	domain 2
0.00	68.4	50.1
0.01	68.7	52.8
0.05	71.1	52.9
0.10	72.2	51.9
0.50	24.2	27.7
1.00	2.2	7.5
Table 4. Classification accuracy achieved by adjusting zero factors in formula (2) by some small value EPS.
In both cases this method achieves accuracy higher than GINESYS. But, rapid drop of accuracy also shows that this method is very sensible to the value of EPS. Whenever we set zero factor to some value different from 0 we introduce an error into the evaluation of formula (2) and if the value of EPS is too big the results are no more reliable at all.
Bayesian classifier itself does not derive any explicit rules and therefore rules generated by some other system (in our case ginesys) can be used to control the evaluation of formula (1). Two such methods have been tested. The idea of the first one is that any (more or less successful) rule denotes a complex of attributes which are logically connected and therefore a deviation from the optimal Bayesian classifier is somewhat corrected. The second one is an attempt to cross GIHESYS and Bayes together to yield better results.
8.1.
CLASSIFICATION ATTRIBUTES
with
important
This method uses rules, generated by GIHESYS. During evaluation of formula (1) the rule which matches the current example is searched for. If it is found, we calculate adequate probabilities by searching through the table for entire complex and not by decomposing the attribute complex to basic attributes. On the other hand, if the matching rule is not found, undisturbed evaluation of formula (1) follows. The results are shown in Table 7.
Measurements show that introducing of externally generated rules into Bayesian classifier only slightly disturbs its classification accuracy.
7.3. COMBINATION OF BOTH METHODS
In this case, both methods described in 7.1. and 7.2. are used together. First we look whether b and d are bigger than HINN and than we set zero factors in formula (2) to BPS where needed. The results of measurements on both domains are shown in Table 5 and Table 6.
6.2. CLASSIFICATION ATTRIBUTES ONLY
with
important
The main idea of this method is to use important attribute complexes	in
classification if possible. Por each example classified we first search for the matching GINESYS rule. If such one is found, it is used for classification, if not (when only Null rule of GINESYS is found), the
classification is carried out by formula (1). The results are shown in Table 7.
Bayes
Rule Directed Bayes Important Attr. & Bayes
domain 68.4 68.4 69.3
domain 2
50.1
49.2 47.9
Table 7. Classification accuracy measured as the percentage of correct guesses.
9. COMPARISON BETWEEN EL STATISTICAL CLASSIFIERS
SYSTEMS
AMD
Let us summarize conclusions from previous paragraphs:
It is possible to further improve classification	accuracy	of
implementations of Bayesian classifier, even to overpass best results of GINESYS.
-	Among compared methods without domain-independent parameters	GINESYS performs best and is very close to the practical limit in classification accuracies in measured domains.
Statistical classifiers are basically unable to perform explanation during classification and to build a human understandable knowledge base. Besides these, other disadvantages can be pointed out [Bceiraan et al; Gams Is
-	they can not deal with domains with small number of learning examples;
-	it is difficult to deal with unusual situations (deleting by 0, unknown values, ... ) ;
-	their results vary according to the suitability of problem domain.
It is only fair to notice that more advanced statistical methods eliminate some of these disadvantages. However these properties remain basically unchanged.
Another reason for so good results of EL systems like GINESYS compared to statistical methods is shown in Figure 1. Real-life complex domains probably contain logical laws which cover greater areas regardless of noise in given examples. On the contrary statistical methods depend on variations of
probability distribution. In Figure 1 the correct probability distribution for classes 1 and 2 is presented by bold lines. Dotted lines represent probability distribution, obtained from given examples. Because of the fact that probability distribution is more sensible to chance choice it is possible that dotted lines 1 and 2 overlap causing incorrect classification.
Figure 1; A graphical representation of one of the possible reasons why GINESYS performs so well compared to statistical methods.
10. CONCLUSION AND DISCUSSION
older systems for empirical learning (EL)
outperformed the statistical methods from the point of explanation during classification and possibility of building a human understandable knowledge base, while it was in some cases reported that older EL systems outperformed statistical methods as well as domain experts this opinion is not undoubtedly shared with the authors of this article, nore acceptable is the conclusion [Breiman et al], that the best EL systems achieve about the same classification accuracy as statistical methods. Nevertheless it seems that the new breed of EL systems with GINESYS as one of the most promising representatives outperforms statistical methods even in classification accuracy (at least in so far measured domains).
ACKNOWLEDGES
We are grateful for suggestions to prof. Ivan Bratko.	Marjan	Petkovsek	provided
mathematical background for our analysis. Students Izidor Jerebic, Borut Znidar, Aram Karallc and Darko Zupanic were of great help in programming tasks. Igor Rononenko and the Oncological institute Ljubljana enabled us to do the measurements on real-world domains. This work was partly supported by the Slovene Research Council and COST-13. Research facilities were provided by the "Jozef Stefan" institute.
REFERENCES
Breiman L., Friedman J.H., olshen R.A., Stone C.J. (1984)! "Classification and Regression Trees", Wardsworth Int. Group. Carbonell J.G., Michalski R.S., Mitchell T.H. (1983): "An overview of Machine Learning", in Michalski R.S., Carbonell J.G., Mitchell T.M. (ed.). Machine Learnings an Artificial intelligence Approach, Tioga Publishing. Cestnik B., Kononenko i., Bratko I. (1987): "ASSISTANT 86: A Knowledge Elecitation Tool For Sophisticated Users", Progress in Machine Learning, Sigma Press.
Clark P., Niblett P. (1987); "Induction, in Noisy Domains", Progress i-n Machine Learning, SIGMA Press.
Kononenko I. (1985): "Razvoj sistema za induktivno učenje ASISTENT", magistrsko delo. Fakulteta za eletrotehniko, Ljubljana. Kononenko I. (1985): "Strukturno avtomatsko učenje". Informatica 3, str. 44 - 56. Gams M. (1987): "Principi poenostavljanja v sistemih za avtomatsko učenje", doktorska disertacija, Ljubljana.
Gams M., Lavra« N. (1987): "Review of Five Empirical Leaning Systems Within a Proposed Schemata", Progress in Machine Learning, (ed. Bratko I., Lavrac N.), Sigma Press. Lavra5 N., Varšek A., Gams M., Kononenko I., Bratko I. (1986): "Automatic construction of the knowledge base for a steel classification expert system". The 6th International workshop on Expert Systems, Avignon. Michalski R.S. (1987): "Machine Learning", Tutorial 6, IJCAI 1987, Milano. Michalski R.S., chilausky L.R. (1980): "Learning by Being Told and Learning from Examples: an Experimental Comparison for Soybean Disease Diagnosis", Policy Analysis and information Systems, Vol. 4, No 2. Quinlan J.R. (1986): "Induction of Decision Trees", AI Summer Seminar, Dubrovnik. Waterman Đ.A., Hayes-Roth F. (ed.) (1977): "Pattern-Directed Inference	Systems",
Academic Press.
Winston H.P.	(1984):	"Artificial
Intelligence", Addison-Wesley.
STANDARDS IN COVPUTER GRAPHICS
INFORMATICA 2/88
UDK 681.3.083.7
Niko Guld, Borut Žalik Tehnical Faculty Maribor
Abstract: In this paper standards in computer graphics are described. fit -first the reasons for evolution these standards are given and then the ways of accepting the international standards are presented. Afterwards the evolution phases of the graphical standards under ISO and ANSI are interpreted and current stage of particular standards are given. In proceeding the place of graphical standards and standard proposals in a graphical system are shown. Finally, the position and rale of the graphical standards in a modern CftD system is presented.
Povzetek: V ćlanku podamo pregled standardov v raćunalnižki grafiki. Najprej opišemo vzroke za razvoj teh standardov, nato pa prikaieno poti, preko katerih nek predlog lahko postane mednarodni standard. Zatem predstavimo raevojne fase ISO in ANSI standardov ter podamo trenutne razvojne stopnje posameznih standardov za raćunalnižko grafiko. V nadaljevanju opiéemo mesto grafitnih standardov oziroma predlogov standardov v grafičnem sistemu. Naaadnje podamo mesto in vlogo grafitnih standardov v modernem CflD sistemu.
1. THE BEGININQS OF STANDARDS DEVELOPMENT
Today, a large number of different graphical hardware and even more different graphical software exist. A big part of this graphical software is device-dependent. The consequences aret
1.	It is impossible to exchange graphical software between different graphical systems
2.	There are problems by installation of old programs on new graphical equipment, although it has been suplied by the same producer, etc.
Because of these problems an idea has been apps^rsd to make a device-independent graphical packet. Advantages_of this device-independent graphical packet are:
1.	It could serve different device generations.
2.	Programs could work on different graphical systems.
3. Programmers could immediately different graphical systems.
work
4. Graphical systems are distinguished only by quality, price, and efficiency.
Of course, this device-independent graphical packets have also some weaknesses. They are slower than device-dependent and more memory space is needed. Because the power of computers is rapidly increasing and their prices are decreasing, advantages of standardization are going over its weaknesses. People) who are opposite to standards in computer graphics, affirm that standards are against inovations. It is clear, when a standard is accepted, it could not be changed immediately.
Portability of api i cation programs could be achieved in some different ways /ENDE84/'t
with development of computer languages,
with extension of existing program languages with graphical features or
with libraries of graphical subroutines which could be linked into application program.
Experts from the field of computer graphics have chosen the last possibility by the construction	of	international
device-Independent	graphical	standard.
However, it is least elegant of all but it Is the best way to awoid confusing in structures of program languages. The place of the device-Independent graphical standard in a graphic system is shown by figure 1,
APPLICATION PROGRAM
DEVICE-IMDEPEHDENT ORAPMICAL PACKAGE
DEVICE		ORAPHICAL
DRIVER		DEVICE 1
DEVICE		QRAPHICAL
DRIVER		DEVICE N
Figure 1. The place of graphical standard in graphical system
The development o-f graphical standards began in the year 1974, when the Qraphics Standard Planing Cainmittee <6SPC) was found by ACM SISQRAPH ("ftssDciation for Computing Mashinery Special Interest Sroup on Graphics"). This committee met with other international members, Involved in computer graphics in Seillac <France> in 1976. This meeting had a great influence on first draft standard called" Core Systeoi. It was introduced on SI6GRAPH 1977. Two years later, on SIGGRAPH in 1979, an improved version of Core was appeared.
Soon after that a new group has been found by German Institute -For Standards DIN which has been worked on a neM graphical standard basis. The group has been directed by Jose Encarnacao and it prepared in 1977 a draft standard called GKS (Graphical Kernel System).
Two propositions were apeared by ISO in 1979: Core and GKS. Working Group WI32 by ISO decided that only efforts on BKS continued. GKS was much more simple, it was 2D, and it was intended for raster devices. On other side Core was 3D and destined for vector devices. The first draft proposal of GKS was made by ISQ in 19B2. QK8 was accepted as an ISO standard in 1985.
In 1981 SIGGRAPH SSPC committee was disbanded and passed over to the ANSI X:3H3 committee, which was founded in 1979.
XIH) gtipkkal lUMirdt
XlHl.l »Hl.t PHI GS (Funi)
S»««lllettlM)
MHt.) CGUCGU
(Luguigi Blndlagi)
OHl.t SKS
»HM (WU40WI)
Fi gurE
2. )C3H3 Tehnical Committee and its subcommi ttees
It is similar for International Organization for Standardization (ISO). ANSI is only a secretariate in ISO's Technical Committee TC97. Working Group WG2 in subcommittee SC21 is responsible for graphical standards. Its sign is ISD/TC97/SC21/WB2. '
Some standards which are set up by ISO or ANSI', are effective, but others are even ignored. From this point of view standards could be considered as de facto and formal standards /STRAST/.
For example, IBM's Color Graphics Adapter (CGA) is a de facto display standard for PCs, just as the IBM PC is a de -facto standard for personal computers. Neither CGA neither IBM PC was formal standards, but market factors has adopted them as standards.
Among formal standards we distinguish successful and unsuccessful ones. A case of formal standard, which has been widely used, is RS-232-C. On the other hand, who has heard about ANS X3.23 standard for keyboard layout. This standard has been totally eclipsed by de facto standards, first by the IBM Selectric layout and later by PC and AT layouts.
3. DEVELOPMENT PROCESS OF STANDARDS
GKS has become a basis for many other proposals of standards including PHIGS, CQM, and CGI.
I6ES, as a standard for transferring CAD/CAM data bases, has been developed in completely another way than Core and BKS (through others ANSI committees)i IGES was accepted as an ANSI standard in 1981.
2. WHO SETS UP THE STANDARDS?
That a proposal becomes a standard, there are several phases it must go through. It takes much more time for standardi zati on process in computer graphics than for standards from other areas, because the projects are very large and completly new.
Evolution process of an ISO standard
Evolution phases of an ISO standards are /BGNOaS, EiDNOBi/i
American National Standard Institute (ANSI) does not set up the standards, but it only whaches over the process, through which the standards are accepted. ANSI has to notice if a standard draft is acceptable by most wide part of industry. Only such standard could be adopted and used in industry and other institutions. ANSI adopts a standard as a national standard when it is acceptable by most companies and organizations.
ANSI consists of by several committees. So the ANSI X3 is the standards development committee for information processing and has about 3i3 committees, each wi t);i about 15 to 80 members /BQNQSè/. One of them Is )t3H3 tehnical committee, which ie responsible for computer graphics standards. X3H3 committee consists of 6 subcommittees, which is showed by figure 2 /STRA86/.
More than llM participants, representing about aä companies (CalComp, Control Data, DEC, HP, Honeywell, IBM, Intel, Tektronix, TI, etc.), attend )(3H3 meetings.
the first; New Work Item proposal (NWI); discussion about new project is started, when subcommittee (like SC21) or a member body (like ANSI) makes a proposal. Representati ves of different countries decide if they accept the definition of the work item and if the'work is continuing on this proposal. This stage can take 5 to 8 months.
the second: Working Draft- (WD)s document could be in this stage 6 to IS months;
the thirds Draft Proposed (DP); this stage can take 12 to 14 months;
-	the fourth: Draft International Standard
(DIS)! document could take place in this stage for 9 to 12 months;
-	the final: International Standard (IS),
Evolution process of an ANSI standard
Evolution stages o-f an ANSI standard differ from stages of an ISG standard and are followingi
-	the firsts Standing Document 3 CSD-3) is an initial proposal which can take no less than b months)
-	the secondi Working Drafts; X3H3 prepares a series of working drafts that are circulated among X3H3 members. This stage typically takes several years.
the third: Draft Proposed American National Standard (dp ANS); this stage takes é to 10 months;
- the fourth! Public Review; document could be in this stage 8 months or more, which depends an the number of public reviews. At least two public reviews are required by X3H3.
the finali Final Approval takes 6 to 9 months.
The current stage of graphical standards under ISO and ANSI is shown by table 1 /BONOBA, SELEB7/.
Table 1. The stage of graphical standards project
Project EKS
GKS Fortran SKS Pascal EKS Ada
ISO status
IS 7942 published in Avgust, 198S.
Known as ISO DIS B631/Ì. CIS ballot closed in Avgust, 198à.
Known as ISO DIS 8^51/2. DIS Ballot closed in August, 198à.
Known as ISO DP 06S1/3. Second DP ballot closed in Apri 1 , 1966.
ANSI status
ANS X3.123-1985. Published in October, 1985.
ANS X3.124.1-1903. Published in October 19B5.
ANS X3.124.2-1987. Public review closed in May, 1987.
ANS s<3. 124.3-19BX . Public review closed in 1986.
GKS C
GKS-3D
Not yet an ISO standard language. UJD available now (SC21/N669).
Known as ISO DP B80S. Second DP ballot closed in March 1986.
ANS X3.l24.2-19aK. Public review will be -finished by October,19S7.
Public review finished in 1980.
SKS-3D Fortran
Known as ISO DP 8806.
Public rewiew finished 19B6.
GKS-3D Pascal
Not yet available.
PHI GS
WD finished in 1986.
ANS X3.144~198m. Second public review finished in 19B7.
PHI GS Fortran
UID availablt
(SC21/N667>
Second public review finished in 1987.
PHIGS Ada
WD available (SC21/N819)
Public review finished in 1986.
CSM
(former VDM)
IS 8632 published in 1987.
ANS X3.122-1986 published in 1986.
CGI
(former VDI)
DP began in 1986.
ANS X3.161-198«. Public review finished in Juny, 1987.
4. THE PLACE OF THE GRAPHICAL STANDARDS IN THE GRAPHICAL SYSTEM
Six known could bE /DEUSa4/!
standards (suggested or accepted) devi ded into three chategorles
1.	Core, BKS, and PHIGS represent an aplication prograoiming interface (API). This API standards are usually implemented as a set of the external procedures and an application programmer could link them into his application code.
2.	IQES and CGM are used by transfering and storing the graphical information.
3. CGI represent interface.
graphical
device
Figure 3. Interfaces of the graphical system
These tree classes could help us by de-fining common -features o-f current and future standards with regard an performance, price, and usefulness of graphical so-ftware and hardware. A comparison and valuation of the graphical standards is not easy, because the majority of them are very compi e» and because they are comming from different areas. Figure 3 gives a review of the standard graphical interfaces.
The most important part o-f each language and device independent graphical system Is a member o-f the BKS standards family. These are GKS <Sraphical Kernel System) or its 3D SKtension or a standard for dynamical manipulation with graphical data structures PHIGB (F'rogrammer's Hierarchical Interactive Graphics System}. Functions of graphical system are eiiactly defined by these graphical standards and because of this reason they are also called functional graphical packets. They are completly language and device independent.
ftn aplication programmer is able to access -functions of these packets through a language binding. It has to adopt language independent functions of graphical packet to design and particularities of each high level program language (ada, C, pascal, fortran, basic).
functional graphical packet by tneta-file generator.	The metafile contents is
interpreted by metafile interpreter. Metafile could be interpreted directly by CGI or by functional graphical packet (6KS has particular types of workstations intend for manipulation with metafiles).
In presentation of interfaces of graphical system many authors mark also connective links between individual interfaces. They call them interfaces, too. Connective link presents a set of all functions, which the interface on the higher level of hierarhy of graphical system can access from the interface, which is Iower.
Far enamples A connective link between a language binding and a aplication program i s an exact declaration of all functions, which aplication programmer can include in his programs. There are declarations of all parameters and their types, which are used in the functions.
5. BRflPHICAL STANDARDS AND THEIR CAD SYSTEM
POSITION IN
The communication between a language and device independent graphical packet and graphical workstations is controled by CGI (Computer Graphical Interface). This standard defines functions and format for this communication.
Current graphical devices are not able to receive the CGI format and interpret its functions directly yet, so drivers are needed. But next-coming graphical devices are going to be driven by CGI format directly.
In capturing, storing, and transfering of graphical in-formatian standard CGM is involved
(Computer Graphical Metafile). Pictures are saved into metafiles, and are captured from
The most important aplication area of computer graphics is certainly CAD. Graphical system in the CAD system has to care about:
the graphical presentation objects and
of constructed
- about the user.
graphical interaction with an
If figure 3 is extended for a CAD aplication, a structure of CAD system is reached and it is shown on figure 4 /ENDEBi/. Older CAD systems have been usually put into only one product. Individual pieces of them have been neither
evident neither accessible by the user. Trends o-f next-coming CAD systems are to make the graphical system visible also to the user, so that he ccjuld add di-fferent graphical devices and transfer graphical data between different graphical systems. But this is possible only, if a graphical system consists o-f the standard interfaces.
A core of a CAD system includes functions far modeling,	presenting,	calculating of
constructed models, and a modul for interactive dialog with an user. The most important part of a CAD system is a CAD data base, which saves all information about created objects. A core of a CAD system is only a superstructure of the graphical system, which takes care about objects presentation on graphical devices and for graphical interaction with its standard interfaces. A modern CAD system has his own program's interface. So an user can reach the functions of the CAD system from high level program language and has an opportunity to solve and to present his awn specific demands.
An exchange of data constructed by core of the CAD system (data are not only graphical) between systems of different supliers could be done by standard CAD data interface. The first such standard is IGEB (Initial Graphics EMchange Specification), but just now more new exchange data formats is being developed (PDES, PDDI, SET, STEP) /CADSS, ENDE86, WILS87/,
6. CONCLUSION
All problems of device-dependent graphical software have been solved with introducing the graphical standards. The suppliers of graphical hardware and software are aware of this and today some of these standards are accessible even on PC computers (for example GKS level 2b>.
In regard of development phases of graphical standards and their language bindings we can expect, that all GKS language bindings (with exception the language bindings far C, because it is not a standard language yet) will become the international standards in a short time.
The basi cal request for EKS-3D is fully compatibility with GKS. Currently, there are di scusions about compatibility among PHISS and GKS. It seems, that there will not be a compatibility at all, because GKS uses only one-leveled	graphical	data structure
(segments), while on other hand PHIGS manages with hisrarhical data structures, which are suitable for presenting the graphical models. PHIGS is intended for tirae-demand applications and so it more than likely will not be available on PC computers.
SPECIFIC ADDITIONS
PROGRAMMING ir^mRFACE
Core of CAD «yttem
MODELLING
PRESENTATION
CALCULATION
DIALOGUE
other CAD tyttem*
LANGUAGE BINDINGS
DEVICE AND LANGUAGE INDEPENDENT GRAPHCyU- PACKAGE
Standard Piclure Exchange Format
CGM
Other
ir graphical »yttemt
COMPUTER GRAPHICS INTERFACE
DEVICE DRIVER
CURRENT
GRAPHICAL
DEVICE
FUTURE
C»U>HICAL
DEVICE
Figure 4. A place of the standards itr CAD system
C6M has already became an inter-national standard in its elementary version. The work is proceeding now on an Kupansion o-f CGM, that it could support also GKSM (GKS meta-file) -format (this is format in which SKS saves graphical	in-formatlon through logical
workstation MO and MI)>
The evolution o-f CQI has been already started. There is a long way until CGI as an international standard will be accepted, because this standard should, not limiting the development o-f the graphical hardware. So we can expect a great interest and in-fluence o-f manu+acturers o-f the graphical hardware.
Re-ferences
B0NO85 Bono, P.R.,
"A Survey o-f Graphics Standards and Their Role in In-formation Interchange", IEEE Computer, Vol. 18, No. 10, October 1985
EONOai Bono, P.R. ,
"Guest Editor's Introduction Graphics Standards", IEEE CG&ft, Vol. b, No. 8, August 1986
CflOas CflDCAM Communications,
"A Practical Guide to Exchanging CADCAM Data Using the IGE5 Format", Department o-f Trade and Industry, 19B5
DEU584 Deusen, E.,
"Graphics Standard Handbook", CC exchange. Laguna Beach, 1984
ENDE84 Enderle, 6., K. Kansy, and G. P^a-ff , "Computer Graphics Programming", Sprlngei—Verlang Berlin, Heidelberg 1984
ENDESà Enderle, 0.,
"Inter-faces -for Storage and Communication o-f Computer Graphics In-formation", from the book Hopgood,F.R.A, R.J.Hubbold, and D.A. Duce,"Advaces in Computer Graphics", Eurographics Seminars, Springei—Verlang, Heidelberg, 1986
SELEST Selective Update;
"Two draft standards available -for public review",
IEEE CG & A, Vol.7, No. 3, Mach 1987
STRA86 Straayer, D.H.,
"Setting Standards",, Computer Graphics World, November 1986
WILSB7 Wilson, P.R.,
"A Short History of CAD Data Transfer Standards",
IEEE CS5<A, Vol.?, No. 6, June 1987
A SURVEY OF MOST IMPORTANT AND OUTSTANDING METHODS FOR SOFTWARE ENGINEERING
UDK 681.3.006
József Györkös Ivan Rozman Tatjana Welzer University of Maribor
With the presentation of most important methods of softwareengineering we want to contribute to the bettor knowledge, application and development tools that demand a systematic procedure for the software design. We have mentioned basic methods of requirements engineering and structured design which were already developed in the seventies. It was not earlier than in the eighties when complex tools for computer aided design of software were formed from these methods. In their essence, these methods may be devided into those which solve the problems by analysing the data flow and those which solve the problem by decomposing the data structure. The most important factor to estimate the suitability of application of a method is the level of covering the phases in the software life-cycle. The behaviour in real-time environment and the possibility to create consistent entiti-relationshlp models for more complex information systems are important in dependence upon the system.
2 objavo pregleda pomembnejših metod programskega inženiratva èelimo prispevati k boljèemu poanavanju, uporabi in razvoju orodij, ki narekujejo sistematičen pristop k snovanju programske opreme. Kaniaali smo temeljne metode inženirstva zahtev in strukturnega snovanja, ki so nastale že v sedemdesetih letih. Sele v osemdesetih letih so se Iz njih izoblikovala kompleksna orodja za računalniško podprto snovanje programske opreme. V osnovi se metode delijo na tiste,	ki k reševanju sistema pristopajo z analizo toka
podatkov in tiste, ki to iavajajo z razgrajevanjem strukture podatkov. NajpomembnejŠa postavka pri	ocenjevanju
primernosti uporabe neka metode je stopnja pokritosti faz življenjskega cikla programske opreme. V odvisnosti od sistema pa je pomembno tudi obnaSanJe v 'real-time' okolju in možnost kreiranja ćvrstih entitetno-relaciJskih modelov za kompleksnejše informacijske sisteme.
0. Introduction
A multitude of actions leading to a consistent, realiable and well documented software is called Software engineering. In the attempt to normalize thè "multitude of actions", several methods have been developed from the sixties onwards. Regarding their applicability and above all their "usableness", certain methods did not reach their climax earlier than In the eighties and most often as a base for automatic tools for software design. Let's see a brief survey of methods of software engineering /Kell87/
1) the sixties may be called also the period of crisis In the software development because it was recognized that it is impossible to create effective programs without a slstematic procedure. In this period the philosophy of Structured programmine was formed ( important author^ : Böhm, Jacoplni, Dljkstra ). ii) ;ln the seventies the previously mentioned recognition caused that various technologies were developed ( structured anallsys, design )
and term software life-cycle was introduced. In this period, new methods lived only in scientific circles, therefore this period is called "the period of searching for panacea" ( Important authors	Constantine, Myers,
Yourdon, Ross, DeMarco, Sarson ). ili) in the eighties, the methods of the seventies experience a revival. With the Integration of some methods a series of effective and commercially successfull automatic tools for software development appeared.
In the introduction let us consider the term " software life-cycle". The software life-cycle covers six basic steps /Porc83/:
-	Requirements Anallsys
-	Functional Specification
-	Design
-	Implementation
-	Validation
-	Maintenance
1. Bequlrements Engineering a3 a Part of Software Engineering
When we are solving problems by means of software we make a photo-copy of the real world in a logically clipped form. This form must substitute the reality in certain characteristics that are required. Therefore In the complex process of software engineering the requirements specification is of essential importance. As a rule, in this process are envoived the developer of the project and the customer. The process of requirements engineering, that is the cooperation mentioned, can be gathered into four principal points /PressS?/ :
i) recognition of the problem il) development and synthesis
iii)	specifications
iv)	survey evaluation
The coordination between the developer of the project and the customer is made by the analyst ( who is often called the system analyst ), the system engineer, the programmer/analyst and the like. He must distinguish himself by his adaptability, ability to	abstraction,
communication,■ knowledge of the environment ( hardware, software ) in which the project will be realized.
Picture la shows the process to accord and define the requirements, ( see /RossBS/ ). This process, called the reader/author cycle, is a part of the method which will be considered in detail later ( SADT - Structured Analysis and Design Technique ).
A READER :
- clarifies terminology - checks completeness DIAGRAMS	- checks consistency
- assures accurancy - corrects syntax
REACTIONS
Picture la "Reader/author" cycle of SADT Method
Besides SADT method several other methods for software analysis and implementation of specifications are developed. This is the requirement engineering. Each method posesses a specific procedure and' therefore also a different notation. The methods pass the above mentioned four points and must meet the following three demands:
a) Understable presentation of ' informational and functional domain in order to analyse the requirements of the problem,
Information flow shows the way of data transformation when the data pass through the system. As the result of studying the information flow the data structure for the processed system is obtained.
The functional requirements /Roma86/ describe the dependence of the components upon each other and upon their environment. The whole system, the program or the element of hardware can appear as a component. A conceptional model is the result of fulfilment of functional requirements. Its level of understability should be adapted to the environment for which is determined. Non-furictiv.inai . requirements cause a problem because they can exert an essential influence on the complexity of
design. Many difficulties cannot be known in the early phases of design ( it is difficult to determine e.g. the influence of the required level of software reliability). The software reliability is closely connected with the powerful testing tool. It is difficult to foresee the influence of the "human factor" and the respondence of the finished product to the errors. It is impossible to formalise the process of maintenence completely.
b)	Division of the problem into understandable and surveyable partitions.
The division of the problem into easier and more understandable parts is carried out by a vertycal hierarchical decomposition or with a functional decomposition in the horisontal hierarchy.
c)	Logical and physical presentation of the system.
The task of the requirements engineering is to clarify what should be realised and not how It should be done. Logical presentation of the problem forms the base for software system design. The analyst will not include the phisycal presentation until the logical presentation la faultless. It is necessary to define e.g. exceptions in the hardware configuration, system for database management or specificity of the applied operating system.
In the process of analysing the user's requirements a document Is formed which is often called speci-f icstcms . A standard is existing for this kind of document ( The Hat ion»! B'jreau of Standards, IEE£ Sf artdac o' Nif. S30-19B4 ), but the automated methods of software engineering do not follow it entirelly.
It can be concluded that the task of the requirements engineering is to introduce a systematic and inform notation of the Information and functional analysis to solve a software system. According to the software life-cycle, mentioned in the introduction the first two points are met by the requirements engineering. The development trends in the eighties support the development and application of computer aided integrated tools for software engineering : CASE Computer Aided Software Engineering, They are overbuilt and the most effective among them meet the development all phases of the software life-cycle, In the following chapters, some tools will be treated in detail.
2. Methods of Software Engineering 2,1. Basic Division of Methods
The basic division is taken from /Press87/. The selected division implies to the phase of system analysis, thus it is undependent from the design method. The latter, is chosen according to the ' obtained functional decomposition ;
1) duta floti oriented methods ii) data ^triicture oriented methods
In literature the third group is called au/rc.imfli-fpd methods . It is true that in this group the methods were made as automated ones and the philosophy of SADT method ( author D.T. Ross ) served as the starting-point to design methods from the first two points. Nowadays many methods from the first two mentioned groups are automated In the form of CASE tools. SADT method will be described in detail in a separate, third chapter.
Our description of methods is limited ( following our opinion ) to most effective and
most popular ones. The criterion of effectiveness la how many phases of a software life-cycle are covered by the method.
2.2. Data Flow Oriented Methods
All methods of this class have in common that they perform the structural analysis by means of data YIon charts. So they give preference to the data transfer through the system over the information structure on which the system is based, The syntax of data flow charts Is taken from /Gane79/ bubble chart and mainly from /DeMa79/.
Tom DeMarco describes very accurately . the processing of analysed system until the functional decomposition is reached. The continuation of processing by means of Btruotural design is taken from /YoufVd/. On the base of literature cited automated tools from the field of software engineering are constructed. The stress is laid on the system analysis by means of data flow charts. Nowadays the following products are awailabile on the market :
-	ANALYST/DESIGNER TOOLKIT; product of the firm Yourdon Press, New York
-	HP Teamwork SA/SD/RT; product of the firm Hewlett Packard
-	TEK CASE; product of the firm Tektronix
-	CASE; product of the firm Microtool, Berlin
SASĐ ( Structured Analysis Structured Design ) is the common name for the methods described. It is typical for SASD that it covers all , phases of life cycle in the software development C "life-cycle modeli" /KellBT/). SASD offers a good survey over the project and enables a team work. An effective functional decomposition from the top-down is the reason for it. As in the of this method a series of understandable tools and those being near to the man is used ( a detailed description is given in the following chapters ) the method permits immediate correction of the existing faults.
Automated tools always offer an up-to-date version to all members working on the project what is of special importance when individual members of groups do not work in the same place.
Data storage ; the symbol means a file or an Interface where the data are stored or from where the data are obtained.
Picture 2.2.1.1.a Basic elements of DFC
In order to follow the information flow best when analysis is carried out it is advisable to name the processes at the end. Data flows ( information flows 3 that condition the necessary process form the base and not vice versa. Unfortunately, automated tools require to name the processes immediately. The picture 2.2.1.1.b shows the decomposition into the depth of an imaginary problem by means of DFC.
DFD3		
13.1 r		v'v"^
/ \ /		
f r / f		V
DFD3.3		
		"asi-*
0FD4 ^^	
	
CjL.	
	
1	
	
---^.jjl---	
	—si?
^__	
2.2.1. 2.2.1.1.
System Analysis of SASD Data Flow Charts
Picture 2.2.1,1.b Decomposition into the depth by means of DFC
2.2.1.2. Data Dictionary
Each arrow in the data flow chart means one or more data elements of a piece of information. The data element /DEMA79/ cannot be decomposed into its components. Therefore it forms a basic element of the data flow. In the data dictionaryevery data flow should be decomposed into elements. To do this a special notation Is needed;
The data flow charts can be used on every level of system abstraction. They consist of four basic building blocks ( see Picture 2.2.1.1,a ) from which the fundamental system model is constructed first. Then it is decomposed into several more detailed and understandable charts. The basic model is called level 01 of the data flow charts ( further called DFC ).
fxternai entity represents the source or destination of system data. The system data can be the elements of hardware, interactive intervention of the usere or connectoin with other software systems.
Pr<?cess : this sign is Used when data are transformed in such a way that new data are obtained or the existing ones are converted.
Data floN : serves to connect other basic elements of DFC. The arrow shows the direction of the flow. Each flow should possess its own uniform name.
symbol	meaning
-	equals
+	logical and
	logical or
{ I''	n iterations of
	bracket contens
( )	optional data
* *	comments
In the design of information systems that are aided by powerful data bases the data dictionary and data store /Gane79/ are needed to model the relational shema in a normal form ( Codd's normal forms ) /5how87/. The fact that the relation shema of the data base of the system designed can be formed by the structured analysis / Gana79/ essentially contributes that this method can be used. Here we find a linking point with methods that are concentrated on the data structure.
E.g. the automated tool TEK CASE alone forms the data dictionary to such an extent that the necessity to describe the undefined data flows is shown. In the data dictionary the syntax of the record Is controlled automatically.
2.2.1.3. Functional Docomposition
With the data flow chart and the data dictionary it is satisfied to the information domain of the problem analysis. The transformations ( processss ) in the product are described aa a functional domain. For this purpose structured natural language, most often " Structured English is used. Such clippéd language is called Program Design Language -PDL.	^
Let's show the formulation of the functional decomposition by means the data flow chart /PreosS?/ - Picture 2.2.1.3.a.
Selected DFD bubble
Information
about processing
Apply structured English
Functional description
Picture 2,2.1.3.a. The Process of Functional Decomposition
The dictionary of the language used for the description of processes should contain english words in the imperative form, expressions from the data dictionary and reserved words for the description of hidden logic ( for this purpose the usage of decision tables is recommended ). The sintax of the language usad for the description of the processes pertnits simple sentences such as : POT, GET, HEQUEST, etc, and obligatory	includes	activity .	based
constructions for the description of sequence, choice and repetitions ( IF, CASE,\ REPEAT, WHILE, etc. ).
2.2.2. System Design SASS
System design of the SASD method is taken from the source /Your79/. The transition from the data flow to the first phase of structured design, which is called structured charts, can be made in two ways. The first and most often used way la the transform analysis The second way, called the transactloD analysisis because of its exaggerated formaliaation less often used.
The nucleous of Transform analyslsia that the data flow charts are devided into the afferente part, central or transformation part and efferent part. The basic three components of the structured design are described. In the chapter 2.2.2.4. the example of the transition from the data flow charts into the structured chart Is described by means of transform analysis.
The design by means_ of operation analysis is more effective from the transform analysis only in those oases, when the centre of the transformation cannot be uniformly identified.
This is the case in split data flow charts ( e.g. several output flows ). The process that splits the input flow is 'called transaction centre. Around this centre a / suitable structured chart is organized. Let's see the basic tree phases of structured design :
a) Structured Charta
The data flow chart shows a network of processes that needn't be connected In the final program in the same way. The model of the system in the form of software modules is ensured by structured charts. The term 'module' dendes procedures or functions in the target programming language.. Thus, the structured chart directly shows the hierarchical structure
of software.
More about the syntax of structured charts can be found in the literature mentioned /Your79/, The most important characteristics are described by the words :
-	rectangles are used to denote modules. External modules, e.g. libraries are denoted by double vertical lines.
-	arroiis connect modules into various hierarchical leves.
-	arroHs fjith circlets denote communication interfaces between modules; the direction of the arrows shows the direction of the control paths and data items.
special symbols give information on the procedurality of the system; these are Iterations, conditional choices between modules, common blocks.
b)	Data Dictionary
The data dictionary used in. the structured design is the same than that used in the structured analysis. Here it is completed with new data obtained from the structured chart.
c)	Description of Modules
The task of the description of modules is to explain the activity based characteristics of the modules. A pseudocode is obtained which is the direct Input into the program implemented in the selected program language.
A simple example to create and follow the knowledge base is given. The picture 2.2.2.a shows the data flow chart of the problem that was devided into three basic parts, according to the tr sTtsfor mat ion analysis.
central, part
e-f-ferent. part
error, report
Picture 2.2.2.a Division of the data flow chart Into three parts
Each part in the data, flow chart has a corresponding functional part in the tree of structured chart. The root of the tree is a new functional component which connects parts ( picture 2.2.2.b' ) . The central part of the division of data flow chart can also play this role. The starting hierarchical structure of the problem is obtained. As the processes are not described acurately by the individual functional components a further factoriz'ing within individual divisions in the data flow charts Is needed. This is done by top-down design.
Picture 2.2.2.b Structured Chart
2.2.3.	Implementation and Testing in SASĐ
Implementation of the system is based on the top-down procedure and adding C incrementation ) of new modules. Incremental procedure demands that each module should be developed and tested separately and then in combination with other modules. In such procedure the member of possible faults is decreased and the testing costs of the system are lowered. The optimization of the implemented system should be done as late as possible and only in cases when performance tests not uncet the requirements. In this case the optimiaation should be concentrated on selected modules because it exerts a bad influence on important characteristics such as performance, applicability, reliability and simple maintenance of a well designed system. In the field of testing and evaluation of software reliability the tendency is to automate the tools with uniform criteria. Additional information on practical access to this problem is found in literature /Roam87/.
2.2.4.	Application Maintenance and Owerbullding In SASD
If SA5D is formed through all phases into a concluded form, an exhaustive reference-book is given. The documentation on original design is given by data flow charts and structured charts, the definition and organization of data is given by data dictionary; an accurate description of processes and module structure is given, too.
Eventual corrections in testing modules should be stored as a certificate on suitability of the system and as an aid to overbuilding. The maintenance of the system is simple. The influence of hardware changes or the modified requirements of the user can be thorougly studied in the documentation through all development phases of the system.
2.3. Data Structure Oriented Methods
As we learn from the title of this chapter these methods lay a greater stress on the construction of regular data structure than to the course of data structure. Two methods are mentioned that specifically solve the problem.. They have four common points:
i)	each method demands that the key information object, that is the entity, and the operators or the process should be determined at the beginning;
ii)	the second common point of these methods is that their starting-point is a hierarchical structure of information;
ili) the data structure should be described by basic procedural constructs - these are the sequence, decision and repetition;
iv) these methods contain effective tools to convert the problem from the hierarchical data structure into the suitable software structure.
2.3.1. Wamler - Orr Method
From the theory of Warnier-Orr charts the DSSD method ( Data Structured System Development ) was developed. Vfarnier - Orr charts implement a hierarchical analysis of information domain. The global picture presented as multitude is decomposed. It is further decomposed into a series of submultitudes. Everithing is interconnected by basic procedual constructs. Braces are used to devide hierarchical levels. A quick and effective explanation of the meaning of the actions which are possible by Warnier-Orr notation is shown in picture 2.3.1.a /Higg79/.
WHY ?
implementation of the task
WHAT ?
beginning of the task
end of the task
WHEN ?
Picture 2.3,1.a
HOW ?
Directions charts
in
Warnier-Orr
We can see that the reading of the events from top of the chart complex downwards gives a sequence of temporal course of events. In the left direction the explanation of events is 'globalized'. To the right, there is a microlevel of described system.
The first step in the system analysis is to di!fir,e> the exit of the process. Then the logical data structure and later the corresponding physical structure are defined. In this phase the entity-relationship model can be formulated. The design of physical model follows from bottom-up but we should not think that Warnier-Orr method is based on botton-up design because the whole logical composition was performed from top down. Practical experiences of design with this method are described in literature /Gyor86/ and /Rese06/.
2.3.2. The Jackson System Development Hethod
The JSD ( Jackson System Development ) method /Jack83/ ensures a methodical way to design complex problems. The expected design result is an objective and repeatable system which is independent from the creativity of the
designer. The designing process can be adjusted and is heavily influenced by the user. The basic idea of this method is that it is possible to design for the user an interesting part of the real world by means entities and actions. The entities are basic elements that can be recognized by the user. Actions are implemented over the entitles described and can be changed from one state to the other ( the tezm 'states' means various phases in the life cycle of entities ).
With the definition of entities and actions and with the formulation of structural shema the design phase of a model is concluded. In the following phase a suitable distribution and definition of Individual functions is made. The functions are Included into the existing model at a definite moment and under definite circumstances which ensure a correct operation of the functional model.
JSD has a very rigid delìmination between design and implementation. The application of the existing hardware and program languages by-means of which the designed system will be realized in a classical way or by JSP ( Jackson System Programming ) will be not included earlier that in the last phase, that is In the phase of implementation.
JSP is also a data-oriented method. It is meant for program design and it is based on the structure of input and output data. The basic ideas of program design by means of JSP are transfered into the design of greater problems and systems. This means that JSD forms an extension of JSP, Everithlng what is used in program design according to JSP can be used in system design according to JSD. The common points and differences between the two methods are shown in the following table ( Picture 2.3.2.a).
action
JSP
JSD
description of the problem
terms of sequence processes
terms of sequence processes
the model	structured	a multitude of
consists of	charts	unlinked proces-
ses for the description of temporal behaviour of entitles
description of the problem is concluded by
implement.of adding and impl. uniform proc. of processes structure	which form the
problem described
implementation
problem can be described immedlatelly
description of the problem is converted into suitable form
Picture 2.3.2.a Comparision between JSD and JSP
The JSD method is meant to design system in which special attention should bo paid to temporal condition. JSD is used to design a wide spectrum of applications based on 'online' or 'batch' manner. These applications are taken from the real world and temporal extension Is of supreme Importance. From the literature /Came86/ it is seen, that this method is most often used In Great Britain where many systems realised by means of JSD are in operation. The method is widely used in Western Europe and less often in North America.
3. Structured Analysis and Design Technique SADT >
This method was developed in the seventies, in the firm SofTech, Inc. when they searched for an effective tool to describe the software architecture of large systems. Douglas T. Ross /Ross77a/ and /Ross77b/ was the first who wrote on SADT. The latter source deals with the applicability of this method for extremely activity based and data-strong systems. SADT is cited as a reference for methods such as Yourdon, Jackson, Warnier~Orr, Petri nets, The multitude of PDL-s (Program Definition Language ), paeudo languages and for typical data-oriented tools Codasyl, Entity Relation Attribute and otliers.
SADT is very effective in the early and lata phases of software life-cycle. The detailed design mekes a bottle neck. SADT can overcome It with the inclusion of poverful languages for the definition of process ( PDL ). The most effective tool of the SADT method Is treated in detail. These are 'box-and-arrow' charts which describe activity based and data aspect
of the analiaed system. These charts are called graphic language of the structured anallsys ( in SADT, of course ).,
3.1. Graphic Language of SADT
Structured Analysis
The basic guideline in the structured analysis is uri>Jerst.-ibiliiy and simplicity . Therefore this method uses the decomposition from top down to the most simple basic elements. The .graphic - language Is based on the str^ìct^ircd ' ^t; Uly J is boxes ( further called SAB ). In the picture 3.1.a the extensiveness of such basic element is shown. The features of individual SAB are described with ICOM ( input, control, output, mechanism > codes. Each SAB has its own number. On every level SAB can be decomposed into hierarchical lower components ( the relationship parent - child; like in data flow charts ). The decomposition cannot be done on the lowest level { atomic level ) because shown system must be entirely understandable.
WHY
CONTROL N
INTERFACES
BOX IS
FORM OS
TRANSFORM
CHANNEL FOR
SHARING /
MECHANISM HOW
DATA
= ACTIVITY
{DUAL OPPOSITES )
Picture 3.1,a The extensiveness of SAB
The wpdei is called a collection of interconnected charts. the model /RossBS/ defines: M is the model of A if M can be used to answer the questions that are put about A. The quality of the model is determined by the extent of questions and suitability of answers".
Models consist of a multitude of SAB by which the decomposition of system is made. Two kinds of konnections, shown in the picture 3.1.b are known : support connections { support arrow ) which are actually global nests in both models and call connections ( call arrow ) which can be imagined as a call of subprograms, in the terminology of program languages. The model syntax permits also that the recursion is shown.
The graphic form of requirements definition can be formaliaed by RML ( requirements modelling language ).
3.2. Applicability of SADT
Besides having influenced the development of very effective methods the SADT idea is successfully used in the development of projects dealing with various fields ( formulation of requirements in various fields of multinational societies, design of complex business	systems,	development	of
telecomunlcation systems and unfortunately many complex military programs ). It cannot be said on which field is SADT most successful. With
MORE GENERAL
DIABRAn = WHOLE / / BOX - PART J _ _// ARROW " INTERFACE
LJ^
ICOM CODES
DETAIL OF BOTH V/A3Z AMD X/A33
Picture 3.1.b Decompósition and multiple model
combinod activity/data models and thorough hierarchical decomposition the method covers a vide sphere of requirements, The author of the method, D.T, Rosa, aknowledges that the design and automation of design of detailed .analysed system present is bottle-neck. A very accurate requirements definition and	excellent
documentation made the method popular to design large systems.
4. Couolusion
In this paper an informative presentation of some methods that proved themselves in the development of software systems is given. In spite of the fact that the methods like PSL/PSA ( Problem Statement Language/ Problem Statement Analyser ), TAGS ( Technology for the Automated Generation of Systems). lORL language C Input/Output Requirements Language ) and SREM { Software Requirements Engineering Methodology ) are not treated we should be aware that these automated methods are widely used in the USA.
.^utcmatej methodff - each automated method is based on certain formalism. The requirements for automated methods must be very strict because possible errors in early development phases are in exponential increase in the following phases. It is said that those methods are good that with incorrectly presented requirements give no final results and the designer can not be mislead. In such cases the procedures within method should give results and this effect should be increased by automation. The creativity of the designer and his ideas still represent the dominant tool. The method should only orient him correctly, warn against the faults and formulate his ideas.
Trends o-f CASE tools: dfn'e 1 opmerit Besides the fact that the tool should be based on a proved formal design method which, supports the development of a program in all its phases, other requirements should be taken into account as well. These are dependent on the environment in which tool will be used.
In general, we can speak about three imortant trends in the development of CASE tools. First we speak about the suitability of tools for the real-timf? system development. Methods deriven from data flow charts often include the real time extension. For the present state of
the development of CASE tools it is tipical that too little attention is paid to the requirements of mathemeatica^ modelling and simulation. The necessity for immediate and detailed knowledge of hardware makes trouble the real-time systems are interrupt or event driven.
The second trend is to develop such tools that can model the data base by means of which the system is aided symultaneously with the development of the system.
The development trend of CASE tools aided by the artificial irite 11 igence is gaining in importance. Expert systems can be used to control syntactical and mainly semantical errors of automated tools. In order to fulfill the communication between the designer and the tool as much as possible object - oriented methods are developed. They enable a natural connection between the designed model and the reality described by the model /BorgSS/. An interesting problem arises when we try to design unstructured Al software systems by conventional methods of software engineering. Ideas to solve this problem are found in literature /PartSe/. The author warns us that for the present it is impossible to offer an accurate and formalized method to design AI software systems. LITERATURE :
/Borges/ A.Borgida,S.J.Greenspan, J.Mylopoulos, "Knowledge Representation as the Basis for Requirements Specification", IEEE Computer, april 1985 /CameSe/ J.R. Cameron, "An Overview of JSD",
IEEE Software Engineering, 1986/2 /DeMa79/ T. DeMarco, "Structured Analysis and System Specification", Prentice Hall, N.J., 1978
/Gane79/ C.Gane, T.Sarson, " Structured System Analisys Prentice-Hall, Inc., Englewood Cliffs, New Jersey, 1979 /Gyor86/ J.Gyorkos,T.Dogsa,I.Roaman," WarnierOrr Diagram Application Experiences", eth International Symposium Computer At The University, Cavtat. 1986 /Hlgg79/ D. A. Biggins, " Program Design and Construction",Prentice-Hall,Inc., New
Jersey, 1979 /Jack83/ M.A. Jackson, "System Development",
Prentice Hall International, 1983 /Kell37/ G.Kellner ( CERN Geneva ), Papers from " CASE Seminar Tektronix", Wien, 1987
/PartSe/ D.Partridge, " Arifical Intelligence Applications in the future of software engineering", Patridge/Ellis Horwood Ltd., Chichester, 1986 /Porc83/ M.Porcolla, P.Freeman,"Ada Methodology Questionnaire Summary", ACM Software Engineering Notes Vol 8 No 1 Jan 1983 /PresS7/ Roger S.Pressman,"Software Engineering ", McGraw-Hill Book Company, New York, 1987
/Rese85/ I.Rozman and others: Research report on Informatyon systems and Artificial Intelligence, 02-052^/796-65, Faculty of Technical Sciences Maribor, 1936 /Roma85/ G.C. Roman, " A Taxonomy of Current Issues in Requirements Engineering ", IEEE Computer, April 1985 /Ross77a/D.T. Ross, "Structured Analysis: A Language for Communicating Ideas",IEEE Trans.Software Eng..Vol SE-3, Jan 1977 /Boss85/ D.T.Ross,"Applications and Extensions
of SADT ", IEEE Computer, April 1935 /Rozm87/ I.Rozman, T.Dogsa,"Empirical software Reliability Models", The 2nd Beijing International	Symposium	On
Computerized Information Retrieval, Beijing China, 12/1987 /Shov87/ P. Shoval, M.Even-Chaime, "Data base ahema design: An experimental compa-rision between normalisation and information Analisys". DATA BASE, vol. 18, 1987
PROBLEMS OF THE RATIONAL UNDERSTANDING OF INFORMATION	INFORMATICA 2/88
Anton P. Železnikar
UDK 681.3.068	Iskra Delta
Character^tic problems of mathematical ineufficiency and inadequacy appear in the rational, especially mathematical expression of information, i.e., in the expression of proceBses of informational arising. The mathematical formalism, which was quite sufficient for the purposes of describing some well-determined linguistic constructions, does not satisfy in its traditional form, when the reality, which is arisingly changing) unforeseeable, and developing, has to be expressed. Because of such (dynamic) informational phenomenology, a critical view in the groundwork itself, which constitutes the classical symbolic or formal logic, is necessary. Besides, it is necessary to omit the rigid and for today's circumstances of thinking too hard and unacceptable mathematical definitions, as for instance, the logic itself, relation, set, function, differential, integral, category, etc. All this doès not mean that mathematics is losing its previous role, but it becomes evident that mathematics can no longer seize constructively (substantially, completely, or credibly) and scientific-developmentally into new realms of cognitive development.
The contribution of this article certainly remains . essentially far from a new, more adequate formalization as is the classically mathematical one. However, many problems are much better enlightened. The mathematical logic, proceeding from the thinking of the previous century and from the first half of this century, is historical-developmentally useful. However, its groundwork has to be reformulated in such a manner which will enable an easy derivation of various general and concrete logics. Within this concept, the contemporary formal logic of mathematics and mathematical systems became only particular cases and, in their essence, conceptually too reduced. Prom the context of this article it is evident how particular logical, relational, set-theoretic, functional, differential, integral, categorial cases of notions are bounded (limited) and inadequate and also, how their dynamic reformulation would be possible. But, this article represents only a soft introduction into this new discourse and an investigation of particular mathematical inadequacies. The goal of this preliminary discourse is, of course, to lay out the way which leads to something termed informational logic. In the framework of such new logic, it will become possible to develop essentially changed concepts of set, relation, function, category, etc. as was the case in today's and yesterday's mathematics.
The context of this article is certainly proceeding into the direction of a general criticism of those kinds of rational understandings that in the most cases deteriorate into the rigid rationalism and that, like traditionalism, set essential blockades against the progressive thinking. Thus, also of these kinds of rationality it ie true that after a period of development they degenerate into their counter-information which is nothing more than ir rationalism and, lastly, an unacceptable dogmaticism for the progressive thinking,.
0. Introduction	Is a rational understanding versus a
rationalistic one in the form of a loose definition of information in a broader sense In the beginning there was information. The	(as discussed in £2, 3, 4, 6)) at all possible?
word came later, Tiie transition was achieved	What are the possibilities of developing, a
by tAe development of organisms with the	genera! (not only the classic, information-
capacity for selectively exploiting	theoretical) concept of information in a quasi-
information in order to survive. and	mathematical (mathematically non-traditional)
perpetuate their kind,	way (e. g. through the informational formalism
of symbolic logic, set, function, differential, Fred I. Dretske [1] vii	derivation, integral theory, algebra, geometry,
theory of categories, etc,)? What does the
clasBical (oommunicational, stochastic) theory of information represent today [6]?
It ÌB evident that before a further discussion concerning a rational or even rationalistic view of information is possible, the mathematics-characteristic way to a rational (mathemational instead of mathematical^ formalization	(abstraction	and	the
corresponding symbolism} of information has to be enlightened. Aready simple examples can ahow why the classic rationalistic (mathematical) way is abysmal and how the deficiency of the discussed essential, rationalistically valid concepts is coming to the surface.
1. Informational Models
2, The Contemporary Theory of Information
Communication theory purports to teil us something about information, if not what it is, at least how much of it there is. ... this is controversial. ... The mathematical theory of information may be an elegant device for codifying the statistical features of, and mutua/ dependencies between, those physical events on which communication depends, but information has to do, not with vehicles we use to communicBte, but with what we communicate by means of them. A genuina theory of information would be a theory about the content of our messages, not a theory about the form in which this content is embodied.
Philosophers ... still seem disposed to think about knowledge, perception, memory, and intelligence with a completely different set of analytical tools: evidence, reasons, justification,	belief, certainty, and
inference. There is, consequently, a serious communication problem.
Fred I. Dretake [1] viii
Several simple and sophisticated models dealing with the rationaliatic formalization of informational subjects and objects can be distinguished. Scientifically, the most direct rationalistic informational model is the so-called theory of information. Its concepts are narrowed into mathematical meaaures of information and can be numerically "calculated" by different mathematical functionals. The theory of information constitutes the so-called naive	(rationallstically	reductlonal,
comprehentionally simplistic, Informationalty quantitative) model of information and represents merely (exclusively) the field of relevance of the so-called generalized theory of (technological) communication systems.
The naive model of information can be advanced already by redefining the components of the generalized communication system in a more complex way, e.g., by raising the question of the nature (the essence) of information source (arising of information or coming of information into existence), information channel (propagation of information through biologically modulated and filtered paths), information coding (various phenomenological transformations of information), information (or merely data) transmission and reception in living and artificial environments, etc. On this way, examples of living information models (philosophical investigations) can be helpful in the development of a critically much more adequate (informationally non-naive) discourse concerning information, than that following from the classically narrowed information theory. Last but not least, a general model of information can be imagined, embracing some essentials of already existing rational, rationalistic and intuitive models of information.
Fred I. Drelske HI 40
Information as a notion has different meanings in different environments (for instance, in different lingual cultures and in different scientific disciplines). Information does not have the same meaning in the realm of common sense, communication theory and engineering (technical sense), crime investigation, theory of information (mathematics), information technology (computer science), science (linguistics, information or even in a general information (information as information).
information retrieval), philosophy
Today's theory of information or information theory, which is founded firmly on mathematical, telecommunicational, and computer methodologies (and technologies), is rational-istically traditional and comprehends, for instance, the following applicative and theoretic areas:
-	application of information theory,
-	complexity of information,
-	communication systems,
-	cryptography and security,
-	data networks,
-	detection and estimation,
^ distributed information processing,
-	error-correcting coding,
-	multi-user information theory,
-	pattern recognition,
-	processing of audio and visual signals,
-	Shannon's theory,
-	source coding,
-	stochastic processes, etc.
Today's theory of information is grounded on merely some characteristic mathematical and technical concepts, but in spite of this fact, it has created many useful concepts by itself as a specific discipline. It must be stressed that information theory, in principle, ignores the problems of informational arising (e.g. coming of information into existence, irrespective of the nature of informational source).
Within the theory of information the so-called information	theorist's	definition	of
information is used in its mathematical {or engineering-technical) sense. This concept has the name information which is (narrowed) enough for the purposes of the theory of information.
However, it does not embrace a common idea of information appearing within different living verbal languages or even living organisms,
Information can be defined in mathematical terms in several ways. The amount or quantity of information is a measure of time or cost of transmitting messages from the information source to the destination. The information source is extremely restricted to be capable of putting forth n probable messages, so that information can be defined as log n and generalized to the entropy function. Information capacity of a channel (the number of distinguishable signals in a time unit> gives a measure of how long it takes to transmit the message generated by source.
An information source has only the role of information supplying arid, in fact, it is not concerned with information arising or with the fundamental problem of how and why information is coming into existence and what information represents when it arises as information in general. The information source simply delivers codified information where the arisen information was coded by a source coding procesa, eo it can be fed into the channel. Decoding is therefore needed to transfer information from the channel to the receiver. Sometimes, because of noise in the communication system, the information obtained by the receiver is erroneous. This is the concept of the noisy channel.
In information theory, one of the most important fields of' mathematical science, the amount of information is estimated by means of entropy. Several characterizations of entropy have been given by Shannon. Information theory combines various methods of probability, statistics, functional analysis, Fourier analysis, and algebra,
3. Logic qua Information
philosophical, scientific, or mathematical inforraationism.
In ite foundation, logic (the Greek 'logos'' has the meaning of speech, word, mind, thought) is a philosophical discipline dealing with valid (true, correct) forms of thought and with methods of scientific cognition. In this sense, logic is the study of the structure and principles of reasoning or of sound argument, with the aim of establishing the truth of propositions in the form of deductive and inductive inference. There exist several other kinds of logical inference as studies of reasoning which may be termed the doontic logic, logic of norms or logic of imperatives, logic of relevance and necessity, modal logic, logic of language, logic of believing, knowing and inferring, logic of inconsistency, epistemic logic, and, last but not least, informational logic. However, in its narrower sense, logic is the study of the principles of deductive inference, or of methods of proof or demonstration. Informational logic is the study of informational principles including also information and Informing as methods of informational inference through the coming of information into existence.
Informationally, logic is merely one of the possible doctrines of reasoning which is termed to be rational. The science of deductive logic has its roots in the conception of establishing propositions by means of such arguments that it would be irrational to reject their conclusions, having accepted their premises. The nature of logic as information will depend on the natures of truth, knowledge, and cognitive abilities of a being, and informationally, on the being's total information (a. being's metaphysics). However, imperatively, logic as a cultural and as a populational information can (or better must) remain characteristically rational, i.e., culturally and popuiationally doctrinal (rationalistic).
Logic ia tlie scienco of the puro Idea; pure, that IE, because the Idea is in the abstract medium of,Thought. ... Logic might have been defined aa the science of thought, and of its Jaws and c/iaracćej-jstic formg, ... If we identify the Idea with thought, thought muet not be taken in the sense of a method or form, but in the sense of the aelf-developing totality of its laws and peculiar terma.
G. W. F. Hegel [9] 25
3/0
Logic is close to a doctrine of essence, i.e., of the absolute. This doctrine is purely informational and arises, for instance, as common sense, belief, reasoning, awareness, and like .that in living cortices. Similar as language, logic as philosophical, scientific, or mathematical discipline concerns particular informational domain dealing with essence as a measure of absolute truth in its own realm. In this respect, logic belongs in the class of
3/1
The creator of European logic was Aristotle and his sucoesaorB, the philosophers of Megsrian-Stoic school, who have established the principles of deductive logic. At the beginning of the 17*" century, F. ■ Bacon introduced the idea of a new, inductive logic. The predecessor of the so-called symbolic logic was Leibniz, but its most distinguished creators were G. Boole, G. Frege, B. Ruasel, B. Carnap, A. Tarski, etc. In his Begritfschrift (1879), Frege introduced quantifiers and the treatment of concepts by analogy with mathematical functions. This enabled him to unify the logic of propoailione (propositional calculus) with the study of those logical relationships which had previously been treated in the theory of ayllogistn. Frege's approach was adopted by Russe! and Whitehead in the writing of Principia Mathematica, where the focus of attention was shifted firmly away from terms to propositions and their relations.
Even prior to Fregean innovations, the study of logic had become increasingly mathematical. The
question is how logic can become an informational discipline. Mathematics and mathematical logic are merely specific forms of information. However, one would like to construct a system of lo^ic which would adequately embrace the realm of information. Within mathematical logic it is possible to treat a formal logical system as just another system of algebra giving rise to an algebraic structure that can be studied using mathematical methods. In the following sections we shall try to reveal some "logical" problema of informational algebraization of logic.
LogiCi especially the mathematical one, is a logical construction. It uses its own language, which is narrowed into the rationalistic domain of deductive and inductive Inference. Mathematical logic is a language which is formalized by specific symbolism as a form of gibberish of symbolic logic.
The effective logic uses junctors and quantifiers (logical connectives or logical operators) to construct propositional formulae (logical expressions). These formulae use logical variables and logical constants (logical operands). Junctors and quantifiers are symbols representing certain 'devices, independent words, prefixes, suffixes, and inflections in any language that enable one to discern the grammatical structure of a sentence.
In order to represent the logical structure of a sentence, junctors and quantifiers are introduced in some places of the sentence. The logical symbols thus have a precise (constant) function. The most commonly employed operators are negation, conjunction, disjunction, implication, the existential and universal quantifiers, and the true and false symbols. A logical proposition (formula, expression) is either true or false. In this way, a two-valued logic is constituted. Certainly, U is possible to define alao a multi-valued logic using adequate logical operators.
acceptable too.
The logical quantifier 'there exists at least one' is in the strict informational sense non-logical too. That which exists inf ormationally is continuously changing, coming into existence. Thus, the informational transformation
there exists at least one x —>
there arises x
is reasonable. However, information can arise merely if its arising (Informing) already exists (otherwise, x cannot arise).
(x) has the meaning 'for all x which exist*. As pointed out, this quantifier could be infor mationally introduced for all those cases whose arising is in the domain of a generative set (a generative set G is defined by its characteristic informational function). For this purpose, the denotation A* C with the meaning 'for all which arise within the generative set (?', would be appropriate.
Similarly, the existential quantifier 'there exists at least one x' could have the meaning 'for those X which already arise' (in a particular case, they can exist as unchangeable information).
In metamathematics, the existential (there exists at least one) and the universal logical quantifier (for all) are bound to a fixed existence. However, in information, this bindingness concerns the coming into existence. In the informational sense, these quantifiers could be used as 'at least one x is coming into existence' (instead of 'there exists at least one x') and as 'for all those * which are coming into existence' (instead of 'for all i'). Evidently, the existential and universal logical quantifiers are losing their general meaning in the informational realm, for they represent cases of informational constancy (statics, datingness).
3/2
3/3
Let us consider the universal quantifier fxj with the meaning 'for all	Informationally,
this quantifier is questionable, for it can exist (or arise) 'at best one case of x'. In fact, this quantifier is pretending that ail x of some domain (set, class, category) have a common property or, in other words, are equal concerning this property. In this manner, the universal quantifier hides a kind of informational equalization. Thus, we can have the	following	reasonable	informational
transformation:
for_all X ~> at best one case of x
At this point, we are coming to the question whether it is at all possible to introduce reasonable	informational	quantifiers.
Certainly, it is possible to say that this and that information is in the process of arising. Simultaneously, it means that the existence of information, its development, its uniqueness (two equal cases of information do not exist) and, in the future, alao its vanishing or its transformation have reached such a degree of change that the identity to its previous identity is more or less not recognizable. As already mentioned, for the arising of information j, the quantifier A In the form
In this sense, the logical quantifier (xj passes into the informational one with the meaning at_beat_one_ca8e_of, which could be denoted by E? (E! is already used for 'there exists exactly one'). If we understand that the quantifier for_all x has in fact the meaning for_all X whÌGh_have_a_common_proper ty, then fxj as a quantifier is informationally
Aj
can be introduced. This means that information J is in the process of arising. If n cases of information arise, there could be written
A 2, i, U
For instance, for all informational cases belonging to a generative set G, one can write
AjYj belonge_to G)
In a sense the arising quantifier A by itself points to the generative nature of set G (informational consistency between A and G).
3/4
The next possible question concerns the negation of information. Informational negation is, of course, not logically (in the sense of mathematical or formal logic) exclusive in respect to the observed informational original. This means that information i and its negation .'i (the sign ' is used to denote the negation) do not represent to each other such a contrast (controversy) that, in the sense of classical logic, it would hold
J OK 'i = true and 1 AND 'S = false
In informational logic, j and	can exist
simultaneously, although the essential question arises: what is "i in respect to i.
In the case of a simple predicative (affirmative) information (is-sentence), the answer to the last question ia still possible, for instance, 'Peter is a wolf and 'Peter is not a wolf. But, in a much more interwoven information, the notion of logical negation is loosing or losing its sense. One can only state the fact that information and its negation are two (spacial, temporal, logical, informational) cases of information which, in ge.neral, are different.
In no case, informational negation can be an exact negation of the informational original in the classic logical sense. Negation as information is arising in the sense of informational arising similarly as the informational original. When ii arises from I'l, one. can write
theory, ■ union and intersection. In an informational realm, it is quite easy to imagine the so-called union of informational cases; however, the informational intersection is faced with the problem of searching equal pieces	of	information	in	different
informational cases. The informational intersection is based on equalisation of informational pieces as they appear in different information.
In cases of logical quantifiers and logical operations, it is possible to observe how the rationalistic symbolism is losing its logical steadiness. Maybe that rationalism as a form of modern cognition and understanding in philosophy and also in technology became inadequately rigid, untrue, and nonsensical (in itself non-rational) to the degree where it cannot stay any longer in the exclusive position as a prevailing principle of scientific and technological progressing. The searching of principles standing outside of rationalism is certainly on the way of surpassing those doctrines which block the progressing and more critical and differently organized thinking and also scientific and technological constructing.
3/6
Besides logical operators, the so-called modal logics use various modal operators. These operators are constant or, in a limited form, variable statements (propositions). For instance, for 'a believes p' one cane introduce a L p. However, for 'agent e believes p' one can choose L, p. When a belief after some time becomes the norm, it can be set Ki p with the meaning 'agent a knows p'. In this way it is possible to construct languages of ' logic and give semantics to them [11].
4. Relation as Information
4/0
it = Ajj;
However, the quantifier A which simultaneously has the property of operation of arising, is arising by itself. And, because A is information by itself, it becomes
AA
as arising of the quantifier A. This leads to a form of the chain of arising, for instance,
in - Ain-J - AAj«.i = ... - A"-'jj;
This informational ex'p.resBion represents a recurrent autoinf ormational and alloinfor matio-nal operation.
The Latin relatio (-onis, f) means report, proposition, motion; throwing back; returning; repetition. Today, relation has the meaning of the act of telling or recounting, an aspect of quality that connects things, reference, etc. Obviously, relation is a characteristic form of information. In mathematics, relation is a propositional function of two or more arguments defining, for instance, exactly those elements belonging to a set. These elements satisfy the propositional function of the relation, for instance, in concern to the components appearing as elements of the set.
4/1
3/5
The moat common logical operations are disjunction and conjunction or, in the set
Mathematics has developed several notions of relation. One of the most general relations is, for instance, the relation to be the element of a set. This relation seems to be mathematically-one of the most natural. In general,
mathematical relations can be determined by means of other relations, operations) esiatentials, quantifiers, axioms, predicates, and/or	verbal,	abstract,	intuitive
determinations, etc.
The notion, which concerns relation, is the so-called relational set. A relational set R consists of elements which are pairs (a, b) of some elements a and b belonging to some sets. Now, the relation ß which is represented by the relational set A, can be understood as operation Jl over elements a belonging to a set, in the way that it produces elements b belonging to a set. The expression
a R b
has the meaning that b will be produced by Jl from a.
conceptually transposed into the formalìBm and abstract understanding of information. Informing as a process of information can be comprehended as relation, i.e., as the moat general informational operation. As pointed out, this operation (relational information) is called Informing,
Let us examine the most general case of the relation we call Informing /• In general, there is
1 I J
This reflexive relational expression has the meaning that Informing / produces information i from information i. A more constructive relational expression is, for instance,
i I il
4/2
where ii arose by I from / and includes the so-called oounter-inforniation c.
What is the relevance of the relation in an informational realm? The most obvious informational relation is, for instance, that information, in one or another case (place, time, process, phenomenon), is itiformationaily different. If information is taken as a unity or an autonomous entity, then informational inequality between one and another informational case is the most natural relation. In the course of human social development, as controverny, opposition, antithesis, counter-information became usual, evident, and consciously regular ways of thinking, the controversy of informational difference (as the most evident relation in an informational realm) arose as an informational principle (as the counter-possibility), too. This way of inference led to the notion and relation of iatormatioaal equality. Although informational phenomena are naturally different, they can be abstractly counter-informed as equal. A characteristic similarity of observed objects ceased as an abstract way of inference into their (informationally dazzled) equality.
The relation of equality <in the mathematical sense) cannot exist between two informational cases (informational forme and informational processes). The relation of equality belongs obviously to the most abstract (unbelievable) cases of the real world. However, on the other side, the relation of equality is the foundation of each, quite arbitrary symbolism. Symbolic equality qua symbolism is in the domain of informationaUsm (irrationalism, ideology,	state-of-mind )	[3],	This
informational distinction for the case of the equality relation is essential, because also the so-called sign of equality '=' will be used in further explication of the mathematical or forma] informational (mathemational) background or field of relevance. In ail our cases, the sign of equality will represent only a formal, abstract (theoretical) symbolic equality.
4/3
The concept of the mathematical relational set and its understanding as operation can be
As J changes and arises, so does I, In this way it is meaningful to set the relational expression
In
many
/ i U
scientific
(mathematical)
and-
technological cases 7	is a constant
(unchangeable, non-arising)	information. Under
these circumstances it is	possible to create
the so-called deductions!	chain (transitivity of relation J) of the form
i I i, I i, I i) I Ì4 ...
or, expressed in a shorter form, i I* i, ...
In the case where informing I is arising, the deduction schemes may become extremely complex and informationally interwoven, for instance,
i i I
i I ij h it Xi ia U ié ...
I il h JI h il I, ...
In the first line (chain) we have a kind of mathematical (definitional) inconsistency, for (i, I) appears as an element of relation i (i generates Informing I from i). In the second chain, relations are Informings Ii, It, I), etc,; however, in the third chain, relations are i,, it, ii, etc., and both chains are mutually dependent (developing). The second chain generates cases of information j,, ij, il, Ì4, etc. by Informings I, It, It, Is, etc. where the third chain generates Informings by cases of information generated by the second chain by these Informings.
S. Sets as Information
... aemanticB lies outside the mathematical theory of inforiaation.
acope of the
Fred I. Dretske [1] 42
The eet as a mathematical notion arose by the work of G. Caator. A set ia a collection of objects of thought or intuition with a certain realm, taken as a whole. However, this is not a strict definition of the notion of a aet. Each object in the collection ia called an. element or member of the set. The point of the set theory is not bo much to explore the nature of a set as relations among elements of a set and the set as a whole.
Cantor's naive set concept leads to various logical paradoxes. His set theory was reconstructed as the axiomatic set theory, which is considered to be free from paradoxes.
5/1
Mathematical sets can be defined in several ways. If possible, elementa of a set can be simpty listed down. Finite and infinite sets can be defined by means of the so-called characteristic (or definitional) functions, which are logical predicates for elements of a set. A set is a definitional statement which determines the relation of belonging between elements of the set and the set. In this way, a set is merely static (or unchangeable) information of a collection of set's elements. The set-definitional predicate is a mathematical-logical statement, and this statement is definitlonally constant (stable, unchangeable). For a set definition, the main problem is how to construct the adequate characteristic function to obtain the desired collection of elements.
In the set-theoretical discourse, the definition of the set as information is basically very important, too. If a set consists of elements representing informational entitles, these elements can	hardly be
informationally completely independent of each other. It means that besides this set several relational sets can exist, determining the informational connectives among set elements. On the other hand, sets as informational entities' In their reality can only be variable and not defined once for all. This reality claims for the possibility to define a set as a definitionally variable entity. In this case, the logical predicate representing the so-called characteristic function of the given set has to be replaced by an informational function for which predicative variability is not exceptional, but (informationally) regular.
If thè so-called informational set claims for its variable definition, the set-definitional predicate or set-characteristic function roust be a definitionally variable (non-mathematical or quasi-mathematical) predicate or function. In the next section we shall show some problems of defining informational functions.
Let us now introduce the concept of the so-called generative set ' which, in a slightly modified form, was proposed by the author already in 1975 [7]. A generative or, now, informational set fi-set for short) of
informational elements (i-elements) performs as an information generator, producing information which represents elements of this set. However, this set generator or i-set is not a generator in the usual sense, for generated elements are not simply accumulated, but can exist as generated, they can vanish, or can be changed through their life time. Formally, the definition of an informational set can be written as
i~aet = (i-element [ i-functioiì(i-eìem€nt)}
In this formal expreealon i-faactioa(i-element) represents a function i(i) where i stands for information. We shall- discuss informational functions more in detail in the next chapter.
5/2
Ideology, as treated in [4], is a complex example of informational function (informational form and/or informational process). An informational function can be used to define objects (elements) belonging to a generative set. Thus, it is possible to choose a proper, subideological function for creating sets concerning Ideological matters.
Let us look at an example of a generative set and the dependence of its structure on information constituting its characteristic function (informational set-definitional predicate). Let be given an informational pool of people belonging to a particular population. Obviously, this pool is a generative set.
Let us create an ideological subset of people whose behavior or suspicion of such behavior satisfies the characteristic function 'Human, who ia class enemy'. It is more or less clear that in the course of time, the characteristic function of the ideological subset will depend on informational clrcumatancea (complex informational environment), ariaing alao in an unforeseeable way. These circumstances will contribute informationally to the structure of the subset characterized aa 'Human, who ia à class enemy'. It is possible to imagine how this characteristic function of the subset is arising, changing, and, under unforeseeable circumstances, vanishing or, in the course of change, disappearing. In this respect, as one-can understand from the previous example, sets concerning any reality, even the most abstractly (ideologically) understood ones, in principle are generative. They are only particular cases of information or, comprehending their subatantiality, are informational (ideological, scientific, mathematical, etc,).
5/3
The concept in the informational fuzzy aet theory would be that an element has the information of membership in a fuzzy set as a generative set. The form of the information of membership can be in the most simple and logically reductional case that the information of membership is a probability or a mathematical measure belonging to an interval
of real values. But, the information of a fuzzy set membership can be any type of information, Informing, counter-information, embedding, etc. In this case, the information of membership becomes a kind of dynamic potentiality.
A fuzzy set has to do with the so-called universe of discourse flOJ- A fuzzy subset of a universe of discourse is a function
/; y —> [0, 1]
On the sets of all fuzzy subsets defined on the universe Ui operations of union, intersection, and inclusion can be defined, using the	informational	(also	mathematical)
predicates of maximum, minimum, and 'lower or equal*.
The concept of the fuzzy set shows how the membership of a set element depends on further information concerning this element. In this respect, the notion of the fuzzy set is nothing else than a discussion on the membership of an element. In this discussion, the most important fact is not so much the fuzzy set itself by its elements as it is the so-called membership set, which carries information and, within this information, the possibility or even potentiality for an element to be the element of a fuzzy set. In this way, the concept of the fuzzy set is reduced to the problem of setting information concerning the membership of set elements. In a way, the membership set of the fuzzy set is a set of characteristic functions approving harder or softer belonging of an element to the fuzzy set. As an arbitrary set, through its characteristic function, also the fuzzy set becomes more and more not the set itself, but its characteristic function. In this course of the discourse, the set concept becomes a set-general information, which could constitute the set as a concrete set realized as a collection of all possible elemente belonging to a set.
6. The Notion of Information qua a Quasi' Mathematical Function
... Communication theory does not teJl us what infonnation is. It ignores questions having to do with the content at signals, what specific information they carry, in order to describe how much information they carry.
Fred I. Dretske [1] 41
variable determine the values (completely arbitrary functions).
of another
6/0
The term function was introduced into mathematics by Leibniz (in 1694) to refer to certain line segments whose length depend on lines related to curves. The modern functional notation f(x) had been used in 1734 by Clairaut and by Euier, who defined functions as analytical formulae constructed form variables and constants. Cauchy introduced the terms of independent and dependent variables (1821). Dirichlet (1837) generalized a function as a correspondence In which the values of one
Today, a function has the meaning of mapping or univalent and also multivalued correspondence. A letter x, for which a name of an element of a set X can be substituted, is called a variable, and X is called the domain of the variable. An element of the domain of a variable x is called a value of x. A letter which stands for a particular element is called a constant.
6/1
The arising of information calls for a concept of the function which as the definition of a function is variable (arising-like ). First, let us try to define information by the so-called definitionally static function, i.e., by a sort of quasi-mathematical functional concept. In this case, a function is meant to be a mapping of a set of informational values of the functional argument x into (or onto) a set of Informational functional values (y). In principle and in general, Information can be formally (approximatively ) conceptualized as a circular system of two metainformational equations:
y - y(x) and X = x(y)
This system is the metainformational expression of the (approximative) nature (information) of information. Within these equations, x has the following meaning:
-	in the first equation, x is the original (starting) information;
-	in the second equation, x (left from ' = ') is the counter-information; and
-	in the second equation, jt (right from ' = ') is the so-called embedding (embedding Informing, embedding counter-Informing) of the counter-information y into the original information x (Ićft from '=').
The meaning of y is as follows:
-	In the first equation, y (left from ' = ') is the countei—information, arising from the original information x through the so-called Informing (counter-Informing) y (right from '=') and
-	In the second equation, y is the counter-information, which will be embedded into the original information x (left from
' = ') through Informing (counter-Informing) X (right from '=').
This system of equations is questionable in many respects and some of them will be illuminated in the following discuasion.
6/2
Through symbolic playing with mutual substitutions of equations, one can obtain, for instance,
y - rMrM ... (r) ... )))) and X = x(y(x(r( ... (X} ... W)
This system represents the comins of information Into existence through several cycles of Informing, where each cycle is understood to be a composition of counter-Informing and the embedding of information. The later system of functional equations is not very clear and the functional structuring of information, Informing, ■ counter-information, and embedding remains hidden, in general. So, a slightly different symbolism can be introduced, where the processes of informational generation and embedding are marked by capitals (X, Y). These capitals represent functional "prescriptions". Instead of the later system, there are
y = Y(X(Y(X( ... {y) ... )))) and * = X{Y(X(Y{ ... M ... ])))
This quasi-mathematical system does not show explicitly the coupling occurring between the consequent prescriptiona X and Y when the arising-embedding cycles are taking place. In this metainformational representation, the arising of X and Y cannot be explicitly identified, even this arising is implicitly present in the last metamodel. This model can be analyzed (indexed) in the following way;
y» = Yo(xa}, XI - Xù(yo)ì yt - Yi(xi), XI = xi(y,):
ya = Ya(Xa), Xatl = Xa(yn);
where Jn the consequent lines the so-called cycles Xt and Yt^i are listed, representing counter-Informing and embedding and where i belongs to the discrete interval [1, n + 1}. This case can be transformed into a clearer form
y. = r.rÄ-irf.-iCÄ-iC ... (yo) ... ))});
X.« = Xa<Y, CX-jfJV-jC ... <xo) ... )))}
At least, it is evident that the consequent Xk and Yt vary from one f-cycle to another.
6/3
A function y - y(x) is definitionally constant if its functional prescription (its definition) is given once for all, i.e., if it does not change during, its existence. It was pointed out that information needs a concept of variable functional prescription. Because of this, a prescription in the form
yt<x) tor ao <= X < at, y(x) -	yi(x) tor at <= x < at,
ya(x) for ar-t <- * < as,
where [a«, an] is an informational interval, is not adequate. This type of function represents merely a sequence of constant functional prescriptions yj, ... , ya.
Let i be information (for instance, a being's total information) when in the framework of this information. Informing I . is coming into existence. Informing I is a generative processing product of information j. A trivial description (formula) of this generation would be / :: i(i). This formula informs that information / has generated Informing I from (or on the basis of) information i. In this case, information i appears as the generator of I (informational host of Informing I, in the sense that / is the argument and the producer to itself) producing counter-information c. This fact can be denoted by c = I(i). Counter-information c can now begin to be embedded into information i. Again both, information i and through the Informing I generated counter-information c, have now to generate the information of embedding E, by which counter-information c will be embedded into source information i, i.e., E - ì{i, c). Embedding information (Informing of embedding) E embeds the arisen counter-information c into already existing information J in such a way that a new total information ii arises. This is denoted symbolically by j'j - E(c, i). In this way the following, so-called minimal informational system is obtained;
i;
J = iW; c = Id); E = id. c); il - E(i, c); ii;
This system is called minimal (or the first approximation) because Informing I and embedding E are not considered as argumentative information. As soon as information appears (arises), it can be inf ormationally active (function-like) and/or the argument for other information simultaneously. In this sense, the form id) in the second line of the upper system is not trivial. It represents a form of the eo-called implicit (implicitly hidden) autoreflexiveness (autoreflexive implicitness) of information. Prom this line on, the following lines of the system are incomplete in the sense of an adequate I and E consideration. Thus, let us set the following, more completed scheme (the so-called second approximation):
i;
I = i(i): c = Id. I);
■ É = id, I, c);, Jj r E(i, c, E); it:
This informational system is enhanced; however, it is far from the	so-called maximal
approximation. In .this, the so-called second approximation there does not appear the explicit autoreflexiveness of information, which will be introduced in the next, the third approximation. Although Informing J and embedding B are autonomous items of information, their functional action is still under supervision (observation) of the entire information	j.	Consequently,	counter-
information c arises irrespective of the
produced Informing / also through the influence of information j and, thua, the third equation of the system becomes c -	I). In this
equation,	represents the symmetrical
functional connective (mechanism), where already arisen Informing I is under the influence of i and information j is under the influence of J, In general, information ìb always under the influence of its own arising and vice versa. In this way the later scheme can be advanced into the system (the third approximation )
i;
I = ria, I);
C = I'cd, I, C>;
E = i'EO, I. C, E);
Jj r E'i,(i, c, E, ii};
iji
In this scheme the explicit autoreflexivene'ss of information is coming into the foreground. The avitoreflexive explicitneas appears in the second, third, fourth, and fifth line of the scheme in the form / - ,..(,.. I); c - ...(,,. c); E = .,.(.,. E), and il = ,„(... il). In some way, the implicit autoref lexivenesa becomes also more complex, for instance) in / = i'lUi I)t information i and Informing I are implicitly autoreflexive in a more complex (composite, mutually dependent) manner.
For the maximal scheme, the following would hold: everything, which appears informationally as an argument can appear as a function too, and vice versa. The maximal scheme seema to be possible only as an informationally asymptotical concept. Thua, generally, i(i) and J - .,.(... i) are informationally reasonable recursive (implicitly and explicitly aùtoref-lexive) formulae.
It has to be mentioned that in a scheme of the form
i; u
the upper and the lower i are not equal at all, because they appear separately as labels for different informational forms-processes.
6/5
The symbolic notation i(i), which characterizes the arising of information from Information by information, is mathematically trivial, inconsistent, because in this case, information i is simultaneously an independent and from itself dependent variable. In the rationalistic logic this form of expression is called tautology (the formal agreement of subject and predicate). Within this understanding, information is a tautological principle and tautology itself the fundamental (arising, generative) principle of information. The exclusion of tautology from the classical logical discourse represents a digression from the essential informational research, therefore also from the essential research of logic, mathematics, or any other exact discipline, respectively.
The symbolic notation l(i) is tautological, recurrent, and/or recursive. Recursion is a legal mathematica! principle where, for instance,	the left side of a rule (of
substitution or inference) is expressed by its right side. In this case, it would be possible to denote a mie by j i. In fact, this rule is recursively trivial, because from a given i it generates i again, consequently, circulates in itself without arising reproduction. The formula i := i illustrates the waiting, nonproductive cycling, the non-variability of i, A more adequate formula would be i i(i), which already illustrates the previous meaning of the functional tautology i(i).
At this point of discourse, the question can be raised, which information, in fact, arises with the usage of formula i(i). What is the mechanism hidden behind i(i)? The principles of arising of information are circularly spontaneous. The mechanism i(i) is spontaneous, however, also circular [4], as it proceeds from the previously described schemata of informational arising. The mechanism i(i) remains	unforeseeable	(generally
unpredictable), although its unforeaeeability is informationally rhythmic (oscillatory) and harmonic (embeddingty adequate, informationally connective,	and	in	this	respect,
informationally predictable) [51. That is to say that the rationalistic unforeseeability of the mechanism i(i) has just the meaning of informational foreseeing, this is of the arising and embedding of arisen informational products (counter-information) into the existing (valid, original) information. The informational foreseeability (rhy thmicalness, harmonicalness) is in no way arbitrarily determinable in advance, but arises also as foreseeability, as the information of foreseeing. The position of the rationalistic tradition is symptomatic exactly in the way it refuses the mechanisms of the type id), because by their use each and every rationaiistic (informationally closed, hard-defined) disc ipline would be disintegrated and decomposed.
Common sense and everyday experience approve that the principle i(i) is generally valid and that this principle cannot be captured adequately through the standard symbolic, notational, abstract, ideological, logical, mathematical, and even philosophical course of rationalistic thinking. However, this is not true of the philosophy as a whole. Soft philosophies permit circular hermeneutic (a being's semantic) schemata. For these philosophies, statements of the form "language is on the way to language", "information produces information", "a living being is a being's self-production (autopoiesis)", etc. are permissive, Martin Heidegger's philosophy ia an unsurpassable mastery of tautology. The main reproaches of rationalistic philosophy (materialism, hard dialectic) concern precisely the philosophical tautology. However, in the informational sense, circular inquiring has an essential value, because it does not represent a static (rationalistic) tautology, but can generate intelligence. A soft philosophy enables a spontaneous, free Informing of a being, while a hard philosophy performs as ideology {5], as artificially limiting
informational spontaneity.
7. Differentiation and Counter-Information
cou nter-inf or matio n
the derivative; a result of informational differentiation
7/0
In this section we shall show only one of the several aspects of informational differentiation, namely, the countar-informational one. To understand the mathematical differentiation, let. us remember its basic sense and properties.
7/1
Every differentiation has to do with a kind of difference occurring between the source object and that from which this object ie derived, i.e., in some way changed one. Differentiation is a procedure by which the derivative product is processed from the original one. This notion coincides with the mathematical concept of differentiation and will, in some manner, coincide also with the informational differentiation, As information, differentiation will be a process of Informing by which the original information will arise (change) into a new form, where the product of differentiation will appear as counter-information. This ■ way of thinking concerning informational differentiation is only a simplified and general one.
Symbolically, If y is a function of x, the instantaneous (non prompt or immediate) rate of change of y with respect to x can be expressed in the form i)jy or dy/dx. This form is called the derivative of y with respect to x. We understand that x in this relation functions as a reference information, as a reason loriginator) of differentiation.
7/2
Now, let us discuss how counter-information can be brought into the context of differentiation. Counter-information can be understood (within other . possibilities) as a product of informational differentiation, i.e., the result which	explicates	an	informationally
instantaneous rate of informational arising, changing, and vanishing of information with respect to some specific information. What does-informational	differentiation	aa	the
information which operates along some source-specific information represent in this context? We can choose counter~infor mation as the information-characteristic term for informatio~ nal differentiation. To continue the course of our discussion, the so-called counter-Informing can be the component distributed (arieingly) in Informing, which produces counter-information. In this sense, we can adopt the following scheme:
source information
counter-Informing within Informing
Informational function to be differentiated
process of informational differentiation
In the following step, the arisen counter-information has to be {hermeneutically, semantically, meaningly) integrated into the source and into the new information as a whole, otherwise it performs as informational noise. This noise is a process of ignoring or vanishing (dying) of counter-information and can occur already during the cycle of the arising-embedding in which counter-information arose, but the embedding of it was "noisy" by itself. In the next arising-embedding cycle the process of ignoring or vanishing of counter-information continues and so, the effect of informational noise is progreeeing, causing the losing of substantial counter-informational items.
7/3
On the basis of the previous discussion it is possible to rewrite informational systems (the first approximation) presented in Section 6/4. Let us introduce c for counter-information
which yields
c - d idJ/di
đi<i)/di = Ki, C)
where C represents counter-Informing within the Informing I. According to this, one can
introduce
C = d 1(0/d i
In this manner, according to Section 6/4, the first approximative informational differentiation system becomes
i;
I r id):
c - dS(i)/di, where
diY/Vdi = I(i, C) and C = dl(i)/di;
E = uu c); ,
i, = E(i, c): Jjj'
Similarly, the second approximative system from Section 6/4 can be rewritten into
j;
I = id),
c - djYWdj * dJ(j)/dI, where djY/>/d/ = I a, c), C = AKWAi,
dim/di - la. c,).
C, r dl<l)/dl, and labels a complex informational connective; E r i(i, I, c); il = E(i, c, E}; ii:
and the third approximative system from Section 6/4 has, for instance, the form
jV J
C
E U h:
ria. I); i'{
AHÌUAÌ *J diw/d/ dj'OV/dc
d7/dy d//dc *i dc/d/ *t dc/d/ h
where the expression in parentheses ( and ) after the sign "' represents c(i, I, c) or the taking into account of informationally differentiated (differential) details; di(i)/dl = 1(1, Ci), Ci - d7C/;/d/; dSW/dc = Cid, ci). cl = dCi/dl; *i, *t, *»,	*s are different
forma of complex informational connectives; : i'EU, I, c, E): ■ E~ii(i, c, E, il);
Similarl7, it is not necessary to expose the fact that informational derivatives are always holistically particular (e.g., partial within the mathematical terminology) and that it is, at least informationally, against common sense to construct the so-called total derivation which can exist in mathematics, for in a mathematical function f(x), the variable x belongs to a well-defined (definitionally constant) set of objects.
A searching of the so-called maximal general scheme of Informing can be, speaking informationally, only asymptotically approximative. In this context, expressions like i(i) or even di(i)/di have to be understood informationally holistically, in the sense that in the cases of i(i) or di(i)/di. in fact, these expreasional meanings represent a holistic function or holistic derivative respectively, with ij(ù) or dii(ii)/dh, where il,	il,	and	ij	are	informational
particularities (and thus, mutually divergent, different intentional or chaotic information).
8. Integration and Informational Embedding
8/0
The notion of integral represents one of the central notions of mathematical analysis and also of mathematica as a whole. The emerging of this notion concerns two basic tasks: the determination of a function on the basis of its derivative and the calculation of a distance, surface, or volume (generally a multidimensional volume) bounded by the function (its "curve" and coordinates' interval). Two forms arise from this fundamental concept of integral: the indefinite and the definite ones.
The notion of integral (or integration) in the informational realm ie the embedding (incorporation	or,	mathematically,	the
operation of finding a function whose derivative ia known ) of arisen information into the existing information or also a particular form of embedding, which is called reinterpretation (of already embedded information). However, the changing of interpretation can be understood also as
informational arising causing a new embedding of an (arisen) informational case. Integration ie a very general informational phenomenon taking place during informational arising. Integration is the most general principle of informational embedding.
8/1
Every integration has to do with a kind of incorporation of the source object into its broader environment. Integration is a procedure by which the integrated product has been processed from the original, informationally narrower one. Integration has the function of (for instance, hermeneutical or semantical) connection of the arisen (informational) original to a broader (informational) realm. Mathematically, integration produces the function from its derivative and, in this respect, It represents (to some degree) the reversible (or even inversive) operation to mathematical differentiation.
Symbolically, if	is a function of x, the
integral of y with respect to x can be expressed in the form INTi^iT.nmiit r(x) djr or in a more compact form as INTt,/ y(x). In this expression there are two cases of reference: d is the integration domain (similar to the lower and higher limit) and * is the integration variable.
8/2
Now, we can discuss the bringing of counter-information into the context of integration. Counter-information as a product of informational differentiation has to be integrated into the already existing, informationally valid information. Otherwise, counter-information is on the way to be vanished, forgotten, and lost as informational noise, which arose, but was not embedded sufficiently into an existing informational realm.
By informational integration, the arisen counter-information is embedded into a closed informational realm and, in this way, the arising of information ia, in fact, closed into the cycle of Informing, in which information is coming into existence in an informationally senseful way. The scheme of Informing in 7/2 can now be advanced into the following cycle:
source information
counter-Informing within Informing
counter-information
cou nte r-infor mation
informational function to be differentiated
process of informational differentiation
the derivative: a result of informational differentiation
the reference information for integration into a given informational domain within informa-
embedding of counter-information
new source information
nal realm
process of informational integration
informational function to be differentiated in the next cycle
8/3
On the ground of examples given in 6/4 and 7/3, it is possible to rewrite these systems of Informing considering the function of embedding £ as a form of informational integration. We have introduced some more detailed, also parallel components of information i. Informing I, counter-information c, counter-Informing C, embedding E, and information ii at the end of arising-embedding cycle into the context of these entities. In this respect, Informing I was expressed as
I = Ui); I = rui, I);
counter-information c as
c - Id):
c = Id, I);
c = red, I,, c);
c = difV/dÙ
c = dj'fiV/di * di(i)/dl;
c r r(di(i)/di t, diW/dl *3 d2(i)/dc
d//dj *4 dl/dc t, dc/di dc/dl);
counter-Informing C as
C = dm/di;
and counter-Informing of Informing Ci, which is a component of counter-information c, as
C, = dfa>/d7;
embedding E as
E = id, c); E = id, I, c): E = J'Ed, I, c, E);
and information i, at the end of the arising-embedding cycle as	.
U - Ed, C); U = E(i, c, E); Ü - E'Ud, c, E, i,);
u;
Yet, we still have to analyze some possible ■effects of informational embedding E. We can conclude that through embedding some information of embedding e and counter-embedding ee will also arise. The upper set of expressions for embedding E is now coming into consideration in more detail.
8/4
Formally, information of embedding e is a sort of co/inectjve information, by which the arisen
counter-information c through the process of embedding E will be embedded into information i obtaining a new informational case ii. Evidently, information i. Informing I, counter-information c, embedding E, Information of embedding e, and information j'j are connected information. Information of embedding e is the answer to the question how counter-information c is connected in h. In a realm of information, it is always possible to state the question how information is connected with information or, how one informational case is embedded into another one.
For the information of embedding following cases would be possible:
e.
the
e = E(c, i); e r INTi c di; e = E(c, I. Ih
e = mil C di 4 INTi / de # JWTj c dl;
where '#' is a complex informational connective.
From these cases we can see that embedding E is an operation of integration. In this respect, it is possible to introduce even more complex integrational aggregates, for instance, E'i, E^I, E'c, E'I'c, etc. to point out the integrative connectivenesB (.") of different informational processes (operations).
8/5
On the basis of the previous discussion, we can rewrite the ao-called first , approximation of the informational system presented in Section 6/4 and 7/3 in the following way:
j;
I = id);
c = did)/di; e - INT, c di!
il = i'e(i, c) : INTi e dc # INT, c de; il;
In this system we have introduced informational differentiation and informational integration, where counter-information c is the derivative of source information ;' (for instance, of a being's metaphysics, state-of-mind of a being), information of embedding e is the integral of counter-information c in ' informational domain j by i, and new information it a complex composite # of integrals, the first of which is the integral of information of embedding 9 in the domain j by i and the second of which ia the integral of information i in the domain e by e </-e mutually dependent embedding).
Similarly, the second approximative system from
Section 6/4 and 7/3 becomes
/
/
/ - id): !
C r djYjV/di * diViV/d/;	,
e =■ INJi cd; # INTi I dc;
il = i'ed, c, e) = INT, e dc il INT. c de iž !
INTc e di;
il:
The third approximative system from Section 6/4 and 7/3 has, for instance, the form
i; I ;
C .
e ; /i
i-i;
rm, I):
r(dUi)/di t, Ai(i)/àl dim/dc dl/di
dl/dc *5 dc/di *( dc/d/; INTi C di fi INT, I de i, INT, c dl INT, i de; : 1 e 't i, a, I, C, h) =
INTi e do il INT, c de #j INTt e di i, INT, C di, §, INTii C d«; where 'J and are composite informational connectives;
8/6
In general, instead of INTu it dji, the denotation (instruction) of the form
integrate it in i, by it
would be more appropriate, since the INT operation has to do with the mathematical integration merely in the cases of the regular mathematical integration. The integrate instruction represents informational operation by which the information of embedding (informational connection) is obtained.
generative set.
(2) In a class of InformingB, to each pair A, B of objects belonging to the object class, a generative set HomMy -BJ will be assigned which elementa will be called morphiama. A morphism is an Informing I/ which concerns the object pair A, B, thus that B = 1(A), In this way, object B includes the embedded counter-information which arose during Informing I of information A, The generative set Hom S} is a collection of arisen inorphiBms concerning A and B.
(3> Abstractly, binary operations of the type
Horn ('S, C) # Horo M/ B) ~> Horn M, C)
can be performed. According to auch operations, the composite morphism, belonging to the set Bom (A, C), denoted by Iti, arises to each pair (Iif II) of iDorphisms, where h belongs to HoroC^ B) and Jj to HorofB, C).
For binary operations, the associative law could be introduced; if Informing I, belongs to HoroM, B}, It to HomrS, C), and It to Hom('C, D), then
(Iti,) I, - hdth)
9. Category as Information
Also, for each object A, the so-called waiting (or unit) morphiam eji belonging to HomfA/ A J can be introduced, for which
What is knowledge? A traditional answer is that knowledge ia a form of justified belief. ... Beliefs can be false, and the truth may not be believed, ... It is of no help to be told that knowledge depends on having an adequate justification if, as ia ao often the case, one is not toid what constitutes an adequate justification.
Fred I. Dre take [1] 85
In the mathematical sense, a category can be understood as the most universal mathematical notion. In such a universe there are certain kinds of objects (these can be represented as informational cases, i.e., informational forme and informational processes) and certain kinds of transformations or morphisms (these can be meant as Informings, i.e., informational processes) between objects. Categories can be conceived as a form of the most general mathematical discourse, the summit of today's mathematical knowledge.
heA - ejih = I,
holds. However, it is worth mentioning that the associative and the waiting case are idealistic, because, as information, Informings i"), It, and Is arise while being coropoaed, and also the waiting before or after Informing may deliver different information.
10. Development of Informational Mathematics: the Mathemation
... Tako Je Jezik odtujevanje od konkretnega carstva reči, oziroma njihove simbolizacije (gledanje na stvar kot na hkratno pojavljanje enega ali več njenih znakov z drugimi), to odtujevanje pa se godi samo za to, da Je "snidenje" s konkretnim bolj sijajno
Valter Motaln [8] 176
Category K as informational entity can be defined abstractly in the following way:
(1) In a class (a given field of relevance, a generalized set), informational objects (information or, more precisely, informational cases in the usual way) will be denoted by capitals A, B, C, ... . These objects are understood to be initial information of the categorical class. During the Informing of K, the object class will be generated in the usual informational sense when new objects of the class will arise (counter-information and ita embedding), some of them will be changed (counter-informationally and embeddingly), and some of them will disappear (as informationally non-relevant entities). In principle, during the Informing of K, the object class will perform similarly aa an informationally
Mathematics is only one of the possible scientific disciplines concerning abstract symbolic formalization. This way of informational abstraction (rationalism) is called mathematization. In this respect, mathematizaCion is the action of specific
> ... Thus, language is alienation from the concrete realm of things or from their symbolization (looking to the subject qua to the simultaneous appearing of one or more Its symbols among the others), where this alienation takes place only with the aim that the meeting-again with the concrete is shining more ...
abeiractneBB vhich has the meaning of putting coinmon-senee processes into ' mathematicalty legal expresBional and semantic (symbolic) forms. In the same way as mathematization concerns mathematics, mathemation (or mathemating) as Informing will concern mathemation (itself).
or logics can be derived on the basis of this generalized concept. In this context, questions of informational truths and falsity, informational derivation and integration, inforraatio-naJ modality, inference, particularity, etc. as forma and processes of informational arising have to be stated.
The nevr term matAemstion has, similarly as mathematics, its roots in the Greek mathema, which means: to be learned, learning, knowledge, science, Mathemation will be a particular, specific field of information representing a formal (or formalistic) generalization of the field known as mathematics. Mathemation will include mathematics as a specific rationalistic informational component of mathemation.
If mathematics is explicitly rationalistic, deductive, inductive, mathematical-ty pically (methodologically) inferential and, in these senses, provable theory, then mathemation is also irrationalistic and has typical properties of Informing, counter-Informing (spontaneous and circular informational generation), and informational embedding (circularly or recurrently spontaneous connectiveness' in the realm of meaning). Mathematical notions can always be generalized in some mathemational way, e.g., the notions of logic, relation, set, function, differential, integral, category, etc. can be generalized (notionally broadened) mathemationally.
It is evident that mathemation is in no way a generalization of information. It remains only a part of cultural or a being's informational realm with the purpose of specific, mathemationally characteristic formalization. In the previous sections of this article, it has been shown in which manner certain informational formalizations (in the course of mathematical tradition) are possible, However, this does not mean that completely new ways of formalization, different from mathematical concepts, cannot be sought.
In summarizing the dealt with mathemational notions in the previous sections the following is to be pointed out: informational relation, generative set, functional type i(i), derivative as inforinational difference, in'tegral as informational embedding, category as informational functional complexity, and last but not least, mathemation as an informationally free principle of abstraction and aymboUsm are generalized, non-mathematical notions. They exceed mathematically true and legal conventions, however, can be reduced into mathematical objects according to their mathematical conventions as particular, reduced, and simplified (static) cases.
11. Conclusion
Some problems of the rational understanding of information, discussed in this article, represent a set of arguments for the development of a logical system which could be called the informational logic. This system has to be constructed in such a manner that arbitrary cases of particular logical systems
In this respect, informational logic deals with the phenomenology of informational arising. In fact, today's logics, from mathematical to modal ones, are studies of ideas concerning linguistic items and dealing exclusively with informationally static, steady, and well-determined informational objects, irrespective of the field of relevance where they are used and of the purposes for which they are constructed. Becauae of their incapability to express logically arising objects and operations, today's logics are staying . outside the possibilities to express dynamic activities appearing in social and man-machine interaction. So far, no logic up to now has tried to capture the most general property of living beings, which appears to be the arising of information as the substantial phenomenology of living self-production and surviving of organisms.
The developing philosophy of information cannot accept uncritically and simplistically the limited and eesentially narrowed natures and principles of information as they are proposed under the protection of today's information theory, information Bcien,ce, and even artificial intelligence. It has to search for new logically formalized mechanisms for the expression of structures, whose natures are in the procese of coining into existence, changing, vanishing, or disappearing. Informational logic by itself must be conceptualized as a formal system of arising in which new, unforeseeable objects, relations, and operations can come into existence, i.e. can be produced from the existing informational regularity and from the informational chaos.
12. References
[1] F. I. Dretske: Knowledge and the Flow of Information. Basil Blackwell Publisher, Oxford (1981).
[21 A. P. Zeleznikar: On the Way to Information. Informatica 11 (1987), No. 1, 418.
[3] A. P. Zeleznikar: Information Determinations I. Informatica 11 (1987), No. 2, 3-17.
[41 A. P, Zeleznikar: Principles of Information. Informatica 11 (1987), No. 3, 917.
[5]	A. P. Zeleznikar: Information Determinations II. Informatica 11 (1987), No. 4, 8-25.
[6]	1988 IEEE International Symposium on Information Theory, Kobe, Japan (June 19-24, 1988) (Call for Papers).
[7]	A. P. 2eJeznikar: Uvod v formalne informacijske sisteme (An Introduction to Formal Information Systeme). Automatika 16 (1975) 3-7.
[8]	V, Motala: Jezik in mieel (Language and Thought). Antbropos (1985) 166-176.
[9]	G. W. F. Begei: The Science of Logic. Hegel's Logic, Translated by W. Wallace with foreword by J. N. Pindlay, Oxford, At the Clarendon Press (1985).
[101 C. V. Segoita: Expert Systems and Fuzzy Systems. The Ben jamin/Cummings Publishing Company, Inc., Menlo Park, Ca (1985).
[11] 5. Kripka: Semantical Analysis of Modal Logic. Zeitschrift f. math. Logik u. Gründl, d. Mathematik 9 (1963) 67-96.
Problemi racionalnega razumevanja informacije
Pri racionalnem, zlasti matematiö nem izražanju informacije kot nastajajočem procesu se pojavljajo znaäiini problemi matematične nezadostnosti in neustreznosti. Matematični formalizem, ki je bil povsem aadcaten za opisovanje dobro definiranih jezikovnih konstrukcij, odpove v tej svoji tradicionalni obliki, ko je potrebno izražati tisto realnost, ki je nastajalno spremenljiva, nepredvidljiva in razvijajoča. Zaradi take (dinamične) informacijske pojavnosti je potreben kritični pogled tudi v' same temelje klasične simbolne oziroma formalne logike in opustitev preveö togih in za današnje razmere mišljenja pretrdih, nespremenljivih definicij matematike, kot so npr. sama logika, relacija, množica, funkcija, diferencial, integral, kategorija itd. To seveda ne pomeni, de matematika zgublja svojo dosedanjo vlogo, očitno pa je, da s svojimi nespremenjenimi in nerazdirJenimi temelji ne more več konstruktivno in znanstvenorasvojno posegati na nova področja kognitivnega ražvoja.
Prispevek v tem Članku je seveda Se vedno bistveno oddaljen od neke nove, bolj ustrezne formalizacije) kot je klasično matematična, Nekaj stvari pa je vendarle 2e bolje osvetljenih. Matematična logika, ki izhaja ia mišljenja J prejšnjega in iz prve polovice tega stoletja, je zgodovinakorazvojno sicer koristna, vendar je potrebno njene temelje tako preoblikovati, da je iz teh temeljev mogoče brez velikih težav izpeljati različne sploäne in konkretnejše logike. Dosedanja formalna logika matematike in njeni različni logični sistemi BO v tem konceptu lahko kvečjemu posebni in v svojem bistvu celo bistveno konceptualno reducirani primeri. Iz teksta tega članka je razvidno, kako so posamezni logični, relacijski, množični, funkcijski, diferencialni, Integralni, kategorjski primeri pojmov omejeni in neustrezni in tudi, kako bi bila ta njihova dinamična preformulacija mogoča. Ta članek je predvsem mehek uvod v to problematiko in raziskava določenih matematičnih neustreznosti. Cilj te predpriprave pa je seveda izgradnja poti, ki vodi v nekaj, kar bomo v doglednem času lahko imenovali, informacijska logika in v okviru katere bo mogoče razviti tudi bistveno drugačne koncepte množice, relacije, funkcije, kategorije itd, kot je v danaänji oziroma včerajšnji matematiki to mogoče.
Problematika v tem članku je seveda tudi na poti neke sploSne kritike tiste vrste racionalnega razumevanja, ki se največkrat izrojeva v racionalizem in ki skupaj s tradicionalizmom postavlja bistvene blokade napredujočemu mišljenju. Tako tudi za to vrsto racionalnosti velja, da po določenem razvojnem obdobju preide v svojo protiinfor macijo, ki postane iracionalnost in za nadaljnji razvoj nesprejemljiva dogmatic nost.
A CRITIQUE ON PARALLEL COMPUTER ARCHITECTURES
INFORMATICA 2/88
UDK 681.3.001.519.6
L. M. Patnaik R. Govindarajan M. Špegel" and J. Silc**
Indian Institute of Science BANGALORE and Jožef Stefan Institute**
The need for parallel processing is felt in many disciplines. In this paper we discuss the essential Issues involved in these systems. Various architectural proposals reported in the literature are cla&sified based on the execution order and-parallelism exploited. Design and salient features of some architectures which have gained importance are highlighted. Architectual features of non-von Neumann systems such as data-driven, demand-driven, and neural computers, which open the horizont for research in the new models of computations, are presented in this paper. Principles and requirements of programming languages and operating systems for parallel architectures are also reviewed.
Contenta
I. Introduction
2.	Issues in Parallel Processing Systems
2.1.	Models of Computation
Control Flow, Data-Driven Model, Demand-Driven Model
2.2.	Concurrency
Temporal, Spatial and Asynchronous Parallelism, Granularity of Parallelism
2.3.	Scheduling
2.4.	Synchronization
3.	Von Neumann Parallel Processing Systems
3.1.	Pipeline and Vector Processing
Pipeline	Architecture,	Vector
Processing
3.2.	SIMD Machines
3.3.	MIMD Architecture
Interconnection Networks,	Tightly
Coupled Multiprocessors, Loosely Coupled Multiprocessors
3.4.	Reconfigurable Architecture
Connection Machine
3.5.	VLSI Systems
Siatolio Arrays, Wavefront Array Processors
3.6.	Critique on von Neumann Systems
4.	Non-von Neumann Architectures
4.1,	Data-Driven Model
4.2,	Demand-Driven Systems
4.3,	Neural Computers
5.	Software Issues Related to Multiprocessing Systems
5.1.	Parallel Programming Languages
Conventional Parallel Languages, Non-von Neuman Languages
5.2.	Multiprocessor Operating Systems
6.	Conclusions
7.	References
1. Introduction
There has been an ever-increasing need for more and more computing power. In a real-time enviroment, the demands are much more. While the computers built around a single processor cannot stretch their processing speed beyond a few millions of floating point operations per second (FLOPS), an operating speed of a few giga FLOPS is required in many applications. Parallel processing techniques offer a ■promising scope for these applications,
Several applications such as computer graphics, computer aided design (CAD) of electronic and mechanical systems, wheather modeling and robotics have a high greed for high computing power. To quote an example, ray tracing techniques [32] used to display 3-dimensional objects in computer graphics have to process 5x10» rays, each ray intersecting 10 to 20 surfaces, and each intersection computation requiring 20 to 30 floating, point operations on an average. In order to provide a fast, interactive and flicker-free display, each frame has to be computed 30 times per second. Thus, the total computation power required 60 giga FLOPS. Comparing this with the execution speed of the present day supercomputers — whose expected/measured performance is about 100 millions FLOPS [53] — we observe that the demand is more by an order of magnitude of two.
The need for parallel processing is conspicuous from the above illustration. The recent advances witnessed in VLSI technology and the consequent decline in the hardware cost further encourage the construction of massively parallel processing systems.
The paper is organized in the following manner. In Section 2, we point out the major issues associated with multiprocessing systems, with a brief discussion on each of these items.
We classify parallel processing architectures broadly into two classes. The first of them, the von Neumann type, includes all multiprocessing systems that adopt the principles of the von Neumann computation model. Pipeline and vector processing, SIMD tnfichines, MIMD machines and VLSI systems which fall under this category are surveyed in Section 3. Supercomputers employ one or more of these techniques for achiving high performance. More emphasis is given to current and recent developments in these areas'. In non-von Neumann architecture, we include data-driven, demand-driven and neural computers. Their methodology of computation is functionally different from the ones discussed earlier. An overview of these architectural configurations is presented in Section 4.
The programt&ing language and operating system support required for working with these complex parallel processing systems are reviewed in detail in the subsequent section. Parallel programming languages are classified into two categories: the conventional von Neumann type (that adop the Imperative style of programming) and the non-von Neumann languages. Case studies of some of the popular languages and their salient features are also reported.
We remark that an exhaustive coverage of all contributions and research work reported in this fascin&nting area of parallel processing systems is beyond the scope of this paper and we do not attempt that. Bather, we will concentrate on the basic principles behind the design of the various architectures. We will pay more attention to some specific architectures and modela of computation which have gained importance.
2. Issues in Pa^llel Procesaing Systems
In the this section we discuss the various associated with multiprocessing systems. First, it is interesting to look at the models of computation, the way one has evolved from the other, leading to myrial architectural configurations and machines.
2.1. Models of Computation
Depending on the instruction execution order, computer systems can be classified as control-driven, data-driven or demand-driven machines.
Control Flow Conventional von Neumann uniprocessing and multiprocessing systems have an explicit execution order specified by the programmer. This hinders the extent of parallelism that can be exploited. However, these control models perform well for structured data items such as arrays [34]. Also the three decades of programming experience we had with these models still makes it a proponent candidate for future generation computers.
Data-Driven Model In this model, • the execution order is specified by the data dependency alone [26]. Instructions are executed as soon as all their input arguments become available to them. Data flow model is suitable for expression evaluation [34]. Since, only data dependency determines the execution order, and the granualarity of parallelism exploited is as low as a single instruction, this model of computation exploits maximum parallelism. However, because of its eagerness in evaluating functions, it executes an instruction, if its operands are available, irrespective of whether or not it is required for the computation of the final result.
Demand-Driven Model In demand-driven (also called reduction) execution, pioneered tly Berkling [15] and Backus [9], the demand for result triggers the execution which is turn triggers the evaluation of its arguments and so on. This demand propagation continues until constants are encountered; then a value is returned to the demanding node and execution proceeds in the opposite direction. Since a demand-driven computer performs only those computations required to obtain the results, it will perform less computations, in general, than a data-driven computer. As the computation model is 'lazy" in evaluating expressions, instructions accept partially filled data structures». This makes demand-driven model to support infinite data structures. Such a facility is not available in the other models of computation. However, this model suffers overheads in terms of execution time due to the additional propagation of the demand (for arguments). The control mechanism and the management of program memory add further to the inefficiency [91].
2.2, Concurrency
The granularity and nature of parallelism exploited, significantly influence the performance of the parallel processing systems.
Temporal,
-§.Eft.tlal_
and Asynchronous
Parallelism	By, performing	overlapped
computation one can exploit temporal parallelism. Pipeline computers [53,77] execute instructions in an overlapped manner as in the assembly line of manufacturing to achieve parallelism. Examples of pipeline processing are the execution of the different phases of an instruction namely, instruction fetch, decode, opcode fetch and execution; execution of the various steps involved in floating point arithmetic operations at various stages of the pipeline. These machines are ideally suited for processing vector instructions (53] namely, vector add, vector multiply and dot product.
Spatial parallelism is the parallelism inherent in performing an operation over the elements of structured data, such as arrays. Spatial parallelism is easy to detect and exploit [34]. The synchronization and scheduling overheads involved in exploiting spatial parallelism are much less compared to those experienced in the other two models.
Multiple instruction multiple data (MIMD) [53] machines achieve asynchronous parallelism through a set of interactive processors with shared resources. Several independent processes running asynchronously on different processors work to accomplish a common goal by exchanging messages or by sharing common variables.
While spatial parallelism is easy to detectet and exploit, the range of applications on which it is inherent is limited. MIMD machines	which	exploit	asynchronous
parallelism, on the other hand, offer flexibility in programming. Hence it can be used for a wide range of problems.
Granularity of Parallelism For high performance, we want to exploit as much
' The nature of the model in executing an instruction only on is termed 'lazy',
' The computation of a recursively defined data structure, for example SEQ(N) = CONS{N,SEQ(N+1)) never terminates, and hence is Infinite. Such a. data structure can partially be filled by evaluating only those terms which are required for further computation.
parallelism as possible. This means that the granularity of the tasks should be low. We observe that as the level of granularity goes down, the paralleliam that can be exploited increases. However, loner the. level of parallelism more will be the synchronization overheads. A tradeoff between the paralleliara exploited and the synchroniiation. overheads is required to achieve high performance.
Another major issue involved is parallelism detection, Paralleliso can either be explicitly specified by the user, or can be detected from a program written in a sequential language using an intelligent compiler, For expressing parallelism explicitly, language such as CSP tSOJ, Occajn [55] and Concurrent Pascal 145] can be used. The user need to have a knowledge of the architecture of the machine on which the program is executed, the application program and its runtime characteristics. The second approach which uses program restructuring techniques [71] to transform a sequential program into a parallel form, does not required a knowledged-user. However, these techniques are inefficient and cannot detect parallelism to the fullest extent. Active research to develop efficient parallel programs using these paradigms is underway.
In a multiprocessor system, scheduling algorithms assign each task to one or more processors with the goal of achieving high performance. Scheduling can again be static or dynamic (refer 16] for a comparison). In static scheduling, the tasks are allocated to processors either during the algorithm design by the user or at compile time by an intelligent compiler. In both these approaches, the scheduling costs are paid only once even if the program is run many times with different data, Morover, there is no runtime overhead. The disadvantage of static scheduling is possible inefficiency in guessing the runtime pirofile of each task.
Dynamic scheduling at runtime offers better utilization of processors, but at the cost of additional scheduling time. The dynamic scheduling algorithm can be distributed or centralized.
(iii) NYU Ultracomputer's 'fetch & add' [42J.
The subsequent sections highlight the architectural features of parallel processing systems. The whole spectrum of architectures proposed in the literature can broadly be classified as conventional von Neumann, parallel processing systems and the' non-von Neumann systems.
3. Von Neumann Parallel Processing Systems
Von Neumann parallel processing systems can be divided into three major architectural configurations: pipeline, SIMD, and MIMD architectures. These three architectural models exploit the three kinds of parallelism, namely temporal, spatial and asynchronous parallelism respectively. Each of the above mentioned architectures is discussed in detail and the design and salient features of certain parallel processing machines are highlighted in this section. Reconfigurable architectures and VLSI systems, though not entirely distinct from the above discussed models, attract the attention of researchers due to their many interesting features. In particular VLSI systems offer new scope in computing for designing dedicated architectures for a variety of applications. Constituent elements of the VLSI systems are systolic [63] and wavefront [64] architectures.
3.1. Pipeline and Vector Processing
Pipeline Architecture Pipelining [77] offers an economical way to realize temporal parallelism. The concept of pipeline processing in a computer is similar to manufacturing in assembly lines in an industrial plant. To achieve pipelining one must subdivide the input task into a sequence of subtasks, each of which can be executed by a specialized hardware stage that operates concurrently with other stages in the pipeline. Successive tasks are pumped into the pipe and get executed in an overlapped fashion at the subtask level. Pipeline processing leads to a tremendous improvement in system throughput. A k-stage linear pipeline could be at most k times faster. However, due to memory conflicts, data dependency, branch and interrupts this ideal speedup cannot be achieved.
2.4. Synchronization
Synchronization methods are required to coordinate parallel execution of tasks in a multiprocessor system. Architectures with shared storage achive synchronization by semaphores [27] and monitors [49]. In a massage passing multiprocessing system, synchronization of processes is achived using remote procedure calls [46].
In order to update the shared variables in a multiprocessor in a consistent manner (avoiding read-read, read-write races [53]), the updating of the variables is done in a region called critical region. Only one process is allowed to enter the critical region at a time. Preventing access by other processes when the critical region is being accessed by a process is known as mutual exclusion. Synchronization primitives are used to achieve mutual exclusion.
Of the many synchronization primitives proposed (refer [35] for a survey), a few worth mentioning are
(i) the 'test 4 set' primitive of IBM 360/370 machines .[17] ; (ii) 'lock' instruction used in C.mmp [101];
One way of classifying a pipeline is based on the function performed by the pipeline. There are two classes of pipelines based on this classification, namely the instruction pipeline and the arithmetic pipelinei In the instruction pipeline, the various phases of instruction execution such as instruction fetch, instruction decode, operand fetch, and instruction execution, are identified and are executed in the successive stages of the linear pipeline. Thus, from pipeline, after an initial delay, instructions are executed once every cloclc cycle.
Arithmetic pipelines subdivide the arithmetic operations such as floating-point add or floating-point multiply into subtasks and execute them on specialized arithmetic and logic units. In an instruction pipeline, the instruction execution unit can itself be an arithmetic pipeline to further improve the performance. IBM 360/91 machine employs both these pipelines. Much research has already been done on scheduling of pipelines; buffering and delaying techniques are used to improve the execution speed. In Cray 1 [82], there are 12 functional pipeline units to perform both scalar and floating point arithmetic operations. Cyber 205 supports four vector pipelines in addition to a scalar arithmetic
so
unit.
Vector Processing In this section, we explaint the basic concepts of vector processing [53]. Vector processors operate on vector data and execute vector Instructions. Vector pipelines, unlike scalar pipelines, are assured of continuous stream of data. The overheads involved in initializing the vector pipeline is compensated by the speedup improvement gained, as the number of tasks executed is lar^e. Loop termination conditions are performed by specialized hardware in various stage of the vector pipelines. These features make vector processing more efficient than the scalar pipelines.
Vectorizing compilers [6) transform programs written in conventional imperative languages into vector instructions suitable for execution on vector processors. The fact that present day supercomputers have vectorising compilers and vector processors as their major components demonstrates that pipeline processing is an easy and efficient way of realizing high speed computation. However, not all programs can be vectorized. Pipeline processors perform poorly in such cases.
3.2. SIMP Machines
A synchronous array of parallel processors executing in a lock-step fashion, consisting of multiple processing elements under the supervision of one control unit is called an array processor. The system and user programs are executed under the control of control unit. The array processor can handle single instruction and multiple data (SIMD) stream. SIMD machines exploit spatial parallelism. The propesaing cells are interconnected by a datarouting network. The interconnection pattern to be established for specific computation is under program control. Vector Instructions are broadcast to the processing cells for distributed	execution over	different
processors. The cells are passive devices without instruction decoding capabilities.
Array processors became well publicized with development of Illiac IV [12), Clip 4 [291, and Massively Parallel Processors (MPP) [14J. Future research in SIMD machines is towards deaignig and implementing multiple-SIMD (MSIMD) machines [53], consisting of more than one control unit. Each control unit in this class of machines shared a pool of dynamically allocatable processors.
Array processors are characterized by their ability to support parallelism at a low-level. They are ideally suited for specific applications in the areas of image and signal processing [81]. For example, cellular array [81] is a two dimensional array of processors, each of which communicate directly with its neighbors. The structure of the array ia very appropriate in terms of layout on a chip. Work in this direction has led us to the design of SIMD architectures for CAD applications [78],
However, as mentioned in section 2,Z, the range of applications over which SIMD machines can be put into efficient use is restricted and hence they are not candidate architectures for general purpose parallel processing machines.
3.3. MIMD Architecture
MIMD machines can be grossly characterized by two attributes: first, a multiprocessor system Is a single computer that includes multiple processors and second a multicomputer that has several autonomous computers which are
geographically distributed and connected through a communication network. There exists an important distinction between two systems. In the first case, the multiple processors work concurrently in order to achieve a single -goal. Interaction between two processors is essentially in terms of intermediate results and synchronization messages.	Whereas
multiComputers oomniunioate among themselves basically to share expensive resources. Each autonomous computer works on an independent task. In this section we will concentrate more on multiprocessors. Discussion on distributed computing systems using computer network is beyond the scope of this paper.
Multiprocessing systems can be classified into two groups, based on how the processing elements interact among themselves J 53]. When several processors communicate among themselves through a shared global memory, we classify them as tightly coupled systems (refer Fig. 1)t-Hence the rate at which the processors cah communicate is of the order of the bandwidth of the memory. On the other hand, we have loosly_ coupled systems where processors do not share \a common memory, but communicate using messa® passing primitives.
Interrupt signal
■■"'sms®!
Processors
Unmapped
••		1.0/p inter connection network (tOPIN)
) ■9	9	
Input-output^ dwnnets Disks
J-r^
Memory map (MM)

Shared memorymodules
P/m interconnections network ( pmisi.
.lil . . . [T^
(a) WITMOUT PRIVATE CACHE
Lo
l-O »^'-i.-"-'
Li-i
lb) WITH PRIVATE CACHE Fig. 1 Model of a Tightly Coupled System
Ca) Linear array
(bi Ring
(0 Star
W) Tree
(e) Near-neighbor mesh


43
(f) Systol'ic array
(g) Comoletely connected
(h) Chorctal ring
(i ) Cube,
Fig. 2 Static Interconnection Topologies
The interconnection network plays an Important role in multiprocessor systems, and significantly influence the performance of the system. A quick overview of the interconnection network is presented in thia section, which-will be followed by a aiscussion on the two configurations of multiprocessing systems.
Interconnection Networks The increasing popularity of many proposed multiprocessing systems with 10< to 10' processing elements makes the concept, design and implementation of the interconnection network a crucial factor in the design of such systems. A typical interconnection network consists of a set of switching elements. The network can be classified based on the following four design factors; operation mode, control strategy, switching method and network topology [301, A network can send and receive messages in a synchronous	or	asynchronous	mode.
Classification on the control strategy is based on whether the switching elements are controlled in a centralized or distributed manner.
Circuit switching and packet switching are two switching methods adopted in a network. In circuit switching, a physical path is aictually established between the source and destination nodes. In contrast, in packet switching, data is put in a packet which is routed through the network without establishing a physical path. Fig. 2 and Fig, 3 depict some of the static and dynamic topologies of computer network.
The bus interconnection spheme connects the various processing cells through a common shared bus. The bandwidth of the bus is very low, and contention la a serious consequence when a large number of cells are connected to the bus. Various schemes [11] such as daisy chain, parallel priority and time-sliced schemes, have been proposed to resolve bus contention. Fig. 4 shows the organization of a shared, bus multiprocessing system with daisy chain scheme for priority resolving. Despite its shortcomings, shared bus still attracts system .designers because of its low cost and complexity and easy upgradability of the system,
Cross bar switch [53] on the other hand is very expensive, but provides high memory bandwidth. The cost of the network is of the
order of n', where n is the processing cells in the system.
number of the
Multistage interconnection network (MIN) strikes a balance between cost and performance. It is dynamic network using a number of switching elements, unlike a static network where dedicated links are used, A MIM of n x n size establishes' a maximum of n links at any instance, at a coat of n log n. This attractive feature makes MIN a proponet in many multiprocessing systems such as the New York university Lfltracomputer (NYU) [42),
(a) 8 X 8 baseiine network
(b) 8x6 Benes network nxm rxr m xn
»					
«	r		m		r
					
(c) Clos network
Fig. 3 Dynamic Interconnection Topologies
52
Fig. 4 Daisy Chain Implementation of Shared Bua
Tightly Coupled Multiprocessors Tightly coupled systems are ideally suited if high speed or real-time processing is desired. In a tightly coupled system a set of processing elements is connected to a set of memory modules through an interconnection network.
If two or roore processors attempt to access the same memory module, a conflict occurs. Hence the memory modules are either low-order interleaved or high-order interleaved to reduce the number of conflicts. In a variant of tightly coupled systems, each processor is allowed to have a private memory, called the cache for that processor. This will greatly reduce the traffic through the network as local data can be stored in and accessed from the cache.
Processors communicate the intermediate results through the shared memory modules. The set of processors may be homogeneous or heterogeneous. It is homogeneous if the processors are functionally identical. Also, a processor may differ from others in Its capability to access I/O systems, performance and reliability.
Various multiprocessing systems have been proposed using a shared memory architecture. Examples of these are, the C.mmp system [101], the Heterogenous Element Processor (HEP) [25], the New Vork university Ultracomputer (NYU) [42], Honeywell 60/66, Univac 1100/80 and IBM 3084 AP, A dedicated multiprocessing architecture with time shared bus has been designed and experimented by use for computer graphics applications [36,37,38].
Tightly coupled systems can tolerant a higher degree of interaction without much deterioration in performance. However, one of tlje limiting factor of tightly coupled is the performance degradation due to conflicts when the number of processors system is increased.
systems memory in the
Loosely Coupled Multiprocessors Loosely coupled multiprocessor systems do not generally encounter the degree of memory conflicts experienced by tightly coupled systems. In such systems, each processor has a set of input/output devices and large local memory where it accesses most of the instructions and data. Processes which execute on different computer modules communicate by exchanging messages through a message transfer system. The degree of coupling in such a system is very loose (and hence the name). The determining factor in the degree of coupling is the communication topology of the associated message transfer system. Loosely coupled systems are- efficient when the interactions among the tasks are minimal.
Fig. 5 illustrates the model of a loosely coupled system. The channel and arbiter switch in each of the computer modules may have a high speed communication memory which is used for buffering block transfers of messages. The communication memory is accessible by all processors through the communication interface. The message transfer system for non-hierarchical system could be a simple time shared bus. The performance of such configuration is limited by the message arrival rate on the bus, the mesage length, and the bus capacity (in bits per second),
Local [ memory) (LM)
1/0
Local bus
Chonnel and . arbiter switch
(a) COMPUTER MODULE
_ C OrryDut e r_iTiodul_e_0 ^

_Cornpuler module N- 1
......--q"--™'
CA 5
I____

Message transfer system tMTS)
Fig.
(b> LOO se COUPLING OF COMPUTEH MODULE 5 Model of a Loosely Coupled System
Loosely coupled systems where processors are connected using dedicated links, are also popular. In such a network of processors, locality (nearness between a pair of processors) can be exploited by scheduling tasks which interact among themselves more to a group of processors which are closer to each other. Failure of a node or link will not catastrophically affect the system, as alternate paths can be established and the system can perform with graceful degradation.
The Cm» architecture [54] developed at the Carnegie Mellon University is one example of a loosely coupled system. It consists of a set of clusters connected by an intercluster bus (Fig. 6). Each cluster has a set of processing elements connected over a map bus. The system forms a good example for hierarchical structure.
'mop

^mop
InterCluster buses



^mapl-
c^-lc^
Fig. 6 Cm» Architecture
A loosely coupled system designed for artificial intelligence and image processing application is 2MOB [80], In this architecture, a set of 256 Zilog microprocessors are connected by a pipeline system.
Another loosely coupled architecture which has attracted the attention of many researchers due to its high performance and relatively low coat, is the hypercube architecture 147,83]. Because of the increased attention it has received, we devote the following paragraphs to this architecture.
The concept of a hypercube computer can be traced back to the work done in the early 1960's by Squire and Palais [93] at yhe University of Michigan. They carried out a detailed paper design of a 4096-node (12-diraensional) hypercube.
The hypercube topology is an n-dimensional generaTization of the simple cube. The hypercube of dimension n has 2" cells. Each cell is connected to n neighboring cells which are at a Hamming distance of one. The connection between adjacent nodes is by point-to-point link. Fig, 7 shows the topology of hypercube architecture for dimensions three and four. The cube manager provides a high level
interface for system users. It serves as a local host for the cube, supporting the programming environment, compilation, program loading, input/output and error handling.
Hypercube architecture has many attractive properties. The hypercube topology yields a regular array in which the node are close to one another: no more than n steps apart. At the same time, the number of connections from each node to its neighbors is quite low (also equal to n). It thus strikes a balance between a two-dimensional array in which the internode connection costs id low, but the nodes are far apart 0(n'/3) gteps on an average, and a completely connected array in which the internode connection costs is high, but the nodes are only one step apart.
The hypercube architecture is homogeneous in the sense that all the nodes are identical. Further, the hypercube architecture is hierarchical and eminently partitionable. For example, a hypercube of dimension n+1 can be pertltioned into two hypercubes of dimension n. This means that it is quite easy to allocate aubcubes to subtasks, especially for problems which adopt a divide-and-conquer strategy. Lastly, the hypercube architecture can itself embed other regular topologies such as tree, mesh, pyramid or hexagonal structure.
000

(a) Dimension = 3
(b) Dimension sii
Fig 7. Hypercubes
3.4. Reconfigurable Architecture
Another	interesting	aspect	oT
multiprocessing systems in which active research is under progress is reconfigurabiiity 185]. Often, full potentials of multiprocessing systems are not realised in many applications. The reason for this is the mismatch between the application and the architecture. In order to alleviate this problem, we build dedicated systems where the architecture matches the application. Another approach which is actively pursued by researchers is	building
multiprocessing systems with programmable switches; using these switches it is possible to reconfigure the system depending on the application. Such a reconfigurable system, by nature, is flexible and hence can be used for a variety of applications. Configurable Highly Parallel computer (CHiP) [85] is a reconfigurable system designed to suit the topologies of various applications. In this section we will highlight the salient features of another reconfigurable architecture, the Connection Machine [48], Its organization is similar to an SIMD machine, but it functions as a reconfigurable architecture executing multiple instructions on multiple data streams, and hence it is hard to classify this as SIMD or MIHD.
Connection Machine The desire to build a machine that will be able to perform the functions of a human mind, the thinking machine, is the motivating force behind, the design of Connection Machine. Specifically, retrieving commonsense knowledge from a
semantio network was the application in the designer's mind.
The Connection Machine computes through the interaction of many, aay a million, simple identical processing/aemory cella. The two requirements for the connection machine are;
(i) each processing element must be as small as possible so that we can afford to have as an many of them as we need;
(ii) the processing elements should be connected by software.
The Connection Machine architecture follows directly from these two requirements. It provides a large number of tiny processor/memory cells connected by a programmable network. Each cell is sufficiently small so that it le incapable of performing meaningful computations on its own. Instead, multiple calls are connected together into data-dependent patterns, called 'active data structures' that both represent and process the activities of these active data are directed from outside the Machine by a conventional host This host computer stores data on the Connection Machine in much the same way that a conventional machine stores them in a memory. Unlike a conventional memory, though, the Connection Machine has no processor/memory bottleneck. The memory cells themselves do the processing. More precisely, the computation- takes place through the coordinated interaction of the cells in the active data structure. Because thousands or even millions of processing cells work on the problem simultaneously, the computation proceeds much more rapidly than would be possible on a conventional machine.
A Connection Machine is connected to a conventional computer much like a conventional memory. Its internal state can be read and written a word at a time from the conventional memory. It differs from a conventional memory in three aspects. First, associated with each cell of storage is a processing ceil that can perform local computations based ori the information stored in that cell. Second, there exits a general intercommunication network that can connect all the cells in an arbitrary
data. The structures Connection computer. structures
500 M bits /sec
Fig. 8 Connection Machine
pattern. Third, there is a high-bandwidth input/output channel that can transfer data between the Connection Machine and peripheral devices at a much higher rate than tfould be possible through the host.
A connection is formed between two processing memory cells by storing a pointer in the memory. These connections can be set up by the host, loaded through the input/output channel, or determined dynamically by the Connection Machine itself. In this prototype system, there are 65,536 (2'»| processor/memory cells each with 4096 bits of memory. The block diagram of the Connection Machine with host, processor/memory cells, communication network, and input/otput ia shown in Pig. 8.
The	control of the individual
processor/memory cells is oroheastrated by the host of the computer. For example, the host may ask each cell that la in a certain state to add two of its (cell's) memory locations locally and pass the resulting sum to a connected cell through the communication network. Thus a single command from the host can result in tens of thousands of additions and a permutation of data that depends on the pattern of connections. Bach processor/memory cell is so small that it is essentially incapable of computing or even storing the results of any significant computation on its own. Instead, the computation takes place in the orchestrated interaction of thousands of cells through the communication network.
The ability to configure the topology of the machine to match the topology of the problem turns out to be one of the moat important features of the Connection Machine. The Connection Machine can also be used as a content addressable or associative memory, but it is also able to perform non-local computations through its communication network.
While the previous subsection discusses the features and requirements of reconfigurable architectures, this section ia devoted to the architecture of dedicated systems. Nowdaya, mature VLSI/WSI (Very Large Scale Integration / Wafer Scale Integration) technology permits the manufacture of circuits whose layouts have minimum feature size of 1 to 3 microns (69]. The effective yields of VLSI/WSI fabrication processes make possible the implementation of circuits with upto half a million transistors at reasonable cost -- even for relatively small production quantit Ì6S * This opens ^hs horizont, for building systems with thousands of processors in a cost-effective and compact manner,
The key attributes of VLSI computing structures [33,63] are
Ut simplicity and regularity, (ii) concurrency and communication and (iii) computation-intensivenesB, A VLSI structure should be such that its basic building block is simple and regular, and is used repetitively with simple interfaces. This helps us to cope with high complexity. The algorithm designed for these structures should support a high degree of concurrency, and employ only, simple, regular communication and control to allow efficient implementation. VLSI processors are suitable for implementing compute-bound algorithms. In VLSI processors, we discuss the two classes of architectures namely, systolic and wavefront processors.
Systolic Arrays Systolic arrays are admired for their elegance and potential for

a LINEAR SYSTOLIC ARRAY
b MESH CONNEaED SYSTOLIC ARRAY
C he;<agonal systolic array
triangular systolic array
i
□ □ □ □
e TREE STRUCTURED SYSTOUC ARRAY
Fig. 9 Systolic Arrays
high performance. Systolic arrays belong to the generation of VLSI/WSI architectures for which regularity and modularity are important to achieve area-efficient layouts. The concept of systolic architecture is a general methodology rather than being an ad hoc approach — for mapping high-level computations into hardware structures. In systolic system data flows from the computer memory in a rhythmic fashion,
passing through many processing elements before it returns to memory, much as blood circulates to and from the heart. Systolic arrays derive their	computational	efficiency	from
multiprocessing and pipelining. The data items pumped into the systolic processors are reused many times as the move through the pipelines in the array, This results in balancing the processing and input/output bandwidths, a requirement for any parallel processing system for alleviating the von Neumann bottleneck.
The topologies for interconnecting the processing elements of a systolic array are many. Some of the most commonly used topologies, namely linear, mesh, triangular, hexagonal and tree structures are shown in Fig.
9-
Essentially, the whole operation of the systolic system is synchronized with a global clock and it may be visualized as a sequence of computation and data transfer cycles. Incidentally, the clock ^s the only global signal allowed in-systolic architecture apart from the power and ground lines [93]. During the data transfer cycle, all the processing elements pump data into the existing data channels, to be accepted by the neighboring processing elements connected to the data channels. After this cycle is over, all the processing cells enter the computation cycle where each of the cells computes concurrently till the end of the computation cycle. This sequence goes on rhythmicall and perpetually in striot synchronism with the olook beats. Systolic architectures thus capture the concepts of parallel processing, pipelining and regular interconnection structure in a unified framework (63 J.
Having all the desirable properties of an efficient special purpose system, the systolic architecture is an interesting area of research for a variety of applications, namely digital and image processing [62], linear algebra [28,69], database systems [63], computer graphics [67,96,97], computer-aided design, and solid modeling [60]. The systolic algorithms are characterized by repeated computations of a few types of relatively simpleoperations that are common to many input data items. Often, the algorithms can be described with nested loops or by recurrence equations that describe computations performed on indexed data.
The systolic architectures were originally conceived as devices capable of performing a specialized task.	Essentially,	these
architectures consist of a large number of identical processors, each having only a single arithmetic or logic operation build into its hardware. This greatly limits the applicability of a system to many areas. The latest trend in research in this direction is towards the development of systolic cells which are versatile enough to implement the compute-intensive functions. The design of programmable systolic cells (31] has been suggested as an effective way towards achieving high performance systolic systems. Each systolic cell now possesses a rich instruction set along with some amount of local storage, which were completely absent in the original versions of the systolic architecture. By suitably programming these systolic cells, a variety of operations can be performed. The programmable nature of the systolic cells offers a high degree of flexibility in operation and high performance.
Wavefront Array Processors The data movements in a systolic array are controlled by global timing-reference 'beats'. The burden of synchronizing the operations of the entire systolic, computing network becomes heavy for
very large arrays. A simple solution is to take advantage of the data flow computing principle t911 (which will be discussed in detail in Section 4.1), which leads the designer to wavefront array [64] processing.
The wavefront array combines systolic pipelining principle with the data flow computing concept. In fact, the wavefront array can be viewed as a static data flow array that supports the direct hardware implementation of regular data flow graphs. Exploitation of the data flow principle makes theextraction of paralXeliara and programming for wovnfront arrays relatively simpler.
Synchronizing with the global. cJock and consequently the large surge of current ?due to the simultaneous energizing or changing of states of the components) are two major problems in systolic arrays. These can be alliviated in wavefront arrays, because of their asynchronous nature. When the processiiig times of the individual cells are not uniform, a synchronous array may have to accommodate the slowest cell by using a. slower clock. In contrast, wavefront arrays, because of their data-driven nature, do not have to hold back faster cells in order to accommodate the slower one. Wavefront arrays also yield higher speed when the computing tiroes are data-dependent. Lastly, programming of wavefront arrays is easier than that of systolic arrays because wavefront arrays require only the assignment of computatione to processing elements, whereafi systolic arrays require both assignment and scheduling of computations.
3.6. Critique on von Heumann Systems
The most serious problem with the multiprocessors which use the von Neumann model, as discussed in [lO], is the presence of globally updatable shared memory. Special mechanisms are required to ensure correctness of results while updating a memory cell. The explicit execution order to be specified by the programmer is another bottleneck of von Neumann systems. This has led to research in non-von Neumann architectures.
4, Non-von Neumann Architectures
f
The principal stimuli for developing the non-von Neumann machines have come from the
pioneering work on data flow machines by	Jack
IJennis [26], and on reduction languages	and machines by John Backus [9] and Klaus Berkling
[15]. In data-driven model, the availability of operands triggers the execution of the operation to be performed on them, whereas in demand-driven model, the requirement for a result triggers the operation that will generate the result. Of late, the realization of the suitability of biological nervous systems for many applications and the use of artificial nets have triggered the design of a new system, the neural computer. In this section, we present the details of these three non-von Neumann approaches.
4.1, Data-Driven Model
In a data flow computer, an instruction is executed as soon as all its operands are available. Since the data availability solely dictates the execution order of instructions, there is no need for having a program counter. Also, the data are passed as values between instructions; this eliminates shared memory and makes the synchronization mechanism simpler.
Thus data fLow model alleviates the shortcoming of the von Neumann model. Also, parallelisa is exploited at instruction level. Hence, a very high Bpeed of computing Is possible.
The machine language for a data flow computer is the data flow graph [24]. The data flotj graph consista of nodes, to represent
operators, and arcs, to carry data between the nodes. Tokens carry data values along the arcs to the nodes. When all the required operands are available, the node is 'fired'. As a result, the input tokens are removed and wutput tokens are produced.
Data flow architectures are classified as static and dynamic architecture. In static model, an additional constraint — no token should be present on any of the output arcs of node — is required to enable the execution of •an instruction. The implioatioji of this is that the static data flow model oaTinot. support execution of reentrant routines. This severely restricts the extent of parallelism and asynchrony that can be exploited in the static model. In a dynamic oiodel, additional tags, called environment tags <eolor), are associated with the token to distinguish the various instantiations of a rentrant routine. Thus, the dynemio model supports fine grain parallelism with full asynchrony. In this paper we describe the architecture of a data flow coaputer, using the Manchester data flow computer [98} as an illustrative example.
In Fig. 10, the switch unit serves as an interface between the host computer and the data flow machine. It routes the intermediate results to the token queue and the final results to the host computer. The token queue unit is a FIFO buffer which smoothes the flow of tokens in the ring. Tokens in the token queue are checked for their operand type. All tokens directed to single input nodes are directly routed to the node store. Other tokens are sent to the matching unit.
Tokens that arrive in the matching unit search for their matching partner. If the search fails, the token is stored in the matching store; it awaits its partner. On the other hand, if the matching partner is found, it (the matching partner) is removed from the matching store; the incoming token is merged with its partner to form a group token. The group token which contains the information about the operand values, address (of the node to which the token is destined to) and environment tag is sent to the node store.
The node store unit stor&s the program graph; it stores the opcode for the operator, destination and operand type of result tokens. Tokens entering the node store get the above information to form an executable token. The executable packets are sent to the distributor unit which distributes the tokens to one of the free processing elementa, The processing element performs the operation specified by the token to produce result tokens. The result tokens are collected by the arbitrator and are sent to the switch unit. Fig. 10 depicts the block schematic of the Manchester data flow computer.
Research in the area of data flow computation is a rapidly expanding area in United States, Japan and Europe. There are a number of data flow projects that are underway in many universities. Some of them worth mentioning here are M.I.T. data flow computer [26] developed by Dennis, Irvine data flow machine [7,39) by Arvlnd, Manchester data flow system [98], its extended version (Extended MANchester architecture) proposed by the
Input
Input (oken , Result tokea.
FC
Host

I
Switch
►— Output Output token
Arbitrati
■ ■
Token
Tok«n queue
		network \
1		...11
Processing unit		
		...
\	Distribution/	
network
/
Executable packet
Group packet
Fig, 10 Block Schematic of Manchester Data Flow Computer
authors [74], Texas data processor [22].
Instruments distributed Utah data driven machine [23}, Toulouse LAU system [20,76], Newcastle data-control flow computer [89], the efficient static dataflow architecture for specialized computation proposed.by the authors f87], and the high speed data flow system developed by Nippon Telegraph and Telephone Systems [3],
Much work needs to be carried out in the design of languages for data flow machines, and implementation of compilers for converting the programs" into data flow graphs, (Refer [88] for the initial work on these issues.) Efficient methods to overcome the inherent overheads associated with exploiting . fine-grain parallelism have to be developed. Although many data flow machines have been proposed in the literature, no effort is made to prototype them. Demonstrating the feasibility of the data flow model of computation is thus a positive step towards the commercialization of such systems.
i.2. Demand-Driven Systems
In contrast to control flow and' data flow
programs which are built from fixed-size instructions, demand-driven (reduction) [91] programs are built from nested expressions,. The need for result triggers the execution' of a particular instruction.
An important point to note reduction machine, a program is equivalent to its value. Demandi of definition a, defined as x = means that the embedded reference rewritten in a simple form. This only one definition of x may program, and all references to it value, a property known as transparency [91],
is that, in a mathematically ng the result y+1) * (y-z),
to X is to be requires that occur in a give the same referential
There are two form of reduction, called string reduction and graph reduction. The basis for string reduction is that' each instruction that accesses a particular definition will take and manipulate a seperate copy of the function definition. Whereas, in graph reduction each instruction that accesses a particular definition will manipulate references to that definition. That is, graph reduction is based on sharing of arguments using pointers. In string reduction each access for a definition
Backing Store
DEQ
I* DEQ
h
F	>U		PU
DEQ
buffer il direc. :	npUt itate
Reduction Table	Action unit
	
	CP store
	
r* DEQ
h
PU
Memory unit ( Definitions)
Fig. 11 The Newcastle Reduction Machine
will result In the evaluation of the definition. Reduced definitions or data values are accessed when the demanded definition has already been evaluated. While graph reduction machine takes advantage of the shared definition (in term of the number of definitions evaluated), it is more complex than string reduction.
There are two basic problems in supporting reduction approach on a machine organization (90]: first, managing dynanicalljr the memory of the program structure being transformed and, second) keeping control information about information about the state of the transformation. The basic organization of a reduction machine, the Newcastle reduction machine (90], is presented below (refer Fig.
The reduction machine organisation discussed here supports reduction by expression evaluation. To find work, each processing element traverses the subexpression in its memory looking for a reducible expression. When a processing element locates a reference to be replaced by the corresponding definition, it sends a request to the communication unit via its communication element. The communication unit in such a computer frequently organized as a tree-structured network on the assumption that the majority of communications will exhibit properties of locality of reference. Concurrency in such reduction computers is related to the number of reducible subexpressions at any instant and also to the number of processing elements that traverse these expressions.
Apart from the pioneering work of Klaus Berkling £15], the stage of development of reduction computers somewhat lags behind that of data flow computers (91]. However, researchers have demonstrated the principle and feasibility of reduction machine organization by designing many prototype system such as the GMD reduction machine [58], Newcastle reduction machine (90J, Mago's cellular tree machine [66], Applicative Multiprocessing Systems (populary known as AMPS) (57], and Combridge University SKIM reduction machine (921.
4,3. Neural Computerà
In the fields of image processing and speech recognition, the ability to adapt and continue learning is essential. Traditional techniques used in these applications are not adaptive. It has been realized that the biological nervous system is more suitable for
applications involving pattern recognition and learning [2]. Artificial neural nets have been studied during the last few years in the hope of achieving human like performance in the fields of image processing and speech recognition. The neural net models [651 attempt to achieve good performance via dense interconnection of simple computational elements. The interest in this type of non-von Neumann computing techniques in recent years is due to the development of new net topologies and algorithms, new analog VLSI implementation techniques and the growing fasclnantion for the understanding of the functioning of the human brain as well as the realization that humanlike performance is required for applications involving enormous amount of processing [51,65]. Several mathematical models have been proposed to exhibit some of the essential qualities of human mind: the ability to recognize patterns and relationships, to store and use knowledge, to reason and plan, to learn from experience and to undrstand what is observed,
Neural net models are specified by the net topology, node characteristics and training or learning rules. The computational elements or nodes used in neural net models have nonlinear characteristics, typically analog, and are specified by the type of nonlinearity. Most net algorithms also adapt in time to inprove performance based on current results. Any artificial neural model must necessarily be a speculation: definitive experimental evidence about the structure and function of the neurological circuitry in the brain is extremely difficult to obtain since it is hard to measure the neural activity without interfering with the flow of information In the neural circuit. Further, the neurone are intricately interconnected and the flow of information is complicated by the presence of multiple feedback loops.
Nevertheless enough is known about some parts of the brain to fuel the desire for constructing mathematical models of the neural circuit. In general, the models propose to generate a sensory-activated goal-directed behavior and control a multilevel hierarchy of computing modules. At each level of hierarchy, input commands are decomposed into strings of output subcommands, that form input commands to the next lower level. Feedback from external environment or from internal sources drives the decomposition process, and steers selection of subcommands to achieve the goal successfully (refer Fig. 12),
Intermediale commands
Output commands
Pig. 12 Neural Net Modules
The benefits of neural net include high computation rates, provided by massive parallelism and a greater degree, of fault-tolerant since there are many processing nodes each with primarily local connections. Designing artificial neural nets to solve problems and studjrlns real biological nets may also change the way we think about problems and lead us to new Insights and algorithmic improvements.
5 Software Issues „„Related tQ_„MultÌBroces8Ìng
Having	discussed the	various
multiprocessing systems, it is worth probing further into two aspects of parallel processing: the language to program and the operating system support to handle these complex systems.
5.1. Parallel Programming Languages
One of the motivations behind the development of concurrent languages has been the structuring of software — in particular, operating system — by means of high-level language constructs. The need for liberating the production of real-time applications from assembly language has been another driving force. In the following discussion, we classify the concurrent high-level languages into two groups: 'traditional' languages of the von Neumann type (based on imperative style of programming) and the unconventional languages, such as data flow, functional and logic-based languages. A quick- review of some of these ■ languages Is presented below.
Conventional Parallel Languages Based on the imperative style of programming, these languages are just the extensions of their sequential counterparts. A concurrent language should allow programmers to define a set of sequential activities to be executed in parallel, to initiate their evolution and to specify their interaction (refer [4] for an excellent survey on the concepts and notations for parallel programming). One important point regards the 'granularity of parallelism', i.e. the kinds of granules that can be processed in parallel. Some languages specify concurrency at statement level, and certain others at task level. Constructs for specifying inter-activlty interaction are probably the most critical linguistic aspects of concurrency. Language constructs ensuring mutual exclusion are called synchronization primitives. Some of the best-known and landmark solutions that have been adopted to solve these problems are the semaphores [27], mailboxes, monitors [4d] and remote procedure calla [46]. Research in this direction is towards designing new machanisms for interprocessor communication, such as ordered ports. [13J, In the following discussion we restrict ourselves to a few parallel programming languages and their salient features.
Communicating Sequential Processes (CSP) [50] is a language designed especially for distributed architectures. In CSP, activities communicate via input/output commands. Communication requires both the participating processes to issue their commands. Also CSF achieves process synchronization using the input/output commands. Another interesting feature of., this language is its ability to express non-determinism using guarded commands. An implementation of a subset of CSP [73] has been has been successfully attempted by the
authors' research group. Their work on the design of an architecture to execute CSP is reported in [79].
Distributed Proc^ses (DP) [46], developed by Hansen, is proposed for real-time applications controlled by microcomputer networks with distributed storage. In DP, a task consists of a fixed number of subtasks that are started simultaneously and exist forever. A process can call common procedures defined within other processes. These procedures are executed when the other processes are waiting for some conditions to become true. This is the only means of communication among the processes. Processes
are synchronized by means of guarded commands.
non-deterministic
- /
Occam language [55], which is based on CSP has also been designed to support concurrent applications by using concurrent processes running in a distributed architecture. These processes communicate through channels. The Transputer [56), also developed by Inmos Corporation, supports the direct execution of this language.
Languages which adopt monitor-based solution for synchronization are oriented towards architectures with shared memory. Examples of these languages are Concurrent Pascal [45J, Ada 15!, and Modula [991, These languages, in general, support strong type checking and seperate compilation, and express concurrent actions using explicit constructs.
Concurrent Pascal [45] extends the sequential programming language Pascal using the concurrent programming tools, processes and monitors.. The main contribution of this language is extending the concept of the monitor using an explicit hierarchy of access rights to shared data structures that can be started in the program text and checked by a compiler.
The programming language Modula [99] is primarily intended for programming dedicated computer systems. This language borrows many ideas from Pascal, , but in addition to conventional block structure, it introduces a so-called module structure. A module is a set of procedures, data types and variables where the programmer has precise control over the names that are imported from and exported to the environment f99]. Modula includes general multiprocessing facilities such as processes interface modules and signals.
In Ada [5], we only have active components, the tasks. Information may be exchanged among tasks via entries. An entry is very similar to a procedure. The call to an entry is like a procedure call; parameters should be passed if the called entry requires them. A randezvous is said to occur when the caller is in the call state and the called task is in the accept state. After executing the entry subprogram both the tasks resume their parallel execution. Ada provides specific and elaborate protocols for task termination. Ada is designed to support reliable and efficient real-time programming.
Non-von Neumann Languages Conventional languages imitate the von Neumann computer. The dependency of these languages on the basic organization of the von Neumann machine is essentially a limitation to. express and exploit parallelism [10], These imperative languages perform a task by changing the state of a system rather than modifying the data directly. In parallel processing applications, it makes more sense to use a language with a
nonsequential semantic base. Various paradigms have been adopted and new programming languages based on these approaches have evolved. We will restrict ourselves in this paper to two suoh paradigms, namely the applicative and the nonprocedural style of progratnining I and the resulting parallel versions of the languages that adopt these approaches.
Applicative languages (also referred to as functional languages) avoid side-effects, such as those caused by an assignment statement. The lack of side-effects accounts, at least partially, for the well-known Church-Roaaer property, which essentially states that no matter what order of computation is chosen in executing a program, the program is guaranteed to give the same result (assuming termination). This marvellous deterininacy property is invaluable in parallel systems. Another key point is that in functional languages the parallelism is implicit and supported by their underlying semantics.
A system of languages known as Functinal Programming languages (FP) [10] and Lisp [68] are two major outcomes of the applicative style of programming. Languages for data flow architectures, which avoid side-effects and encourage single assignment, are also included in the set of applicative languages. Dennis' Value oriented Algorithmic Language (VAL) [1], Arvind's Irvine Data flow language {Id> {8], and Keller's Flow Graph Language (FGL) [57] are candidate examples in this category.
Considerable work has been done by us in the area of aplicative programming languages. A high level language for data flow computers, called Data Flow Language (DFL) [72], has been proposed by us, and a compiler to convert the programs written in this language into data flow graphs has been implemented. The concepts borrowed from CSP and DP when embedded into data flow systems results in two new languages for	distributed	processing,	namely
Communicating Data Flow Channels (CDFC) and Distributed Data Flow (DDF) respectively [75], Communication and non-determinism features have been added to FP by us (40,41] to strengthen its power as a programming language. We have also proposed that FP can be used as a language for program specification [41],
Although parallelism in a program is expressed by the functional languages in a natural way, their automatic detection and mapping to processors do not result in optimal performance. It is desirable to provide the user with the ability to explicitly express parallelism and mapping, retaining the functional style of programming. Languages which allow the programmers to annotate the parallelism and mapping scheme for the target architecture lead to optimal performance on a particular machine. Two languages developed with this motivation are ParAlfl (the Para-Functional language) [52] and Multilisp [44]. Efforts have been taken to exploit the advantages offered by the functional languages to the maximum extent by developing new machines based on non-von Neumann architecture (refer [95] for a recent survey).
Applications such as the design of knowledge base systems and natural language processing revealed the inadequacies of the conventional programming languages to offer elegant solutions. The use of predicate logic, which is a high-level human oriented language for describing problem and problem solving methods for computers, promised great scope for these applications. Logic programming languages combine simplicity with a lot of powerful features. They separate the logic and control
[59], the two major components of an algorithm, and thus enable the programmer to write more correct, more easily improved and more readily adapted programs. The powerful features of these languages also include the declarative style, the unification mechanism for parameter passing and execution strategy offered by non-determiniatic computation rule. The powerful execution mechanism provided by these languages is due to the non-procedural paradigm. An outcome of the research carried out in this area with these motivations is the design of the language Prolog [19].
Logic programming languages offer three kinds of parallelism, namely the 'AND', 'OR' and 'argument' parallelism [21,43), The inability of the von Neumann architecture to efficiently execute logic programming language (in essence supporting non-procedural paradigm) has led to the design of many parallel logic programming machines (16,43,70,94,100). Further research on these languages has led to the design of three parallel logic programming languages, the PARLOG (PARallel LOCic programming) [18], P-Prolog [103] and Tonourrftnt. Prolo« [84),
With the increasing size and complexity of parallel processing systems, it becomes essental to design efficient operating systems, without which handling of such systems would be impossible. The general principles and requirements of multiprocessor operating systems are discussed in the following section.
5.2. Multiprocessor Operating Systems
The basic goals for an operating system are to provide programmer-interface to the machine, manage resources, provide mechanisms to implement policies, and facilitate matching applications to the machine. There is conceptually little difference between the operating	system requirements of a
multiprocessor and those of a large computer system with multiprogramming. The operating system for a multiprocessor should be able to support multiple asynchronous tasks which execute concurrently, and hence is more complex.
The functional capabilities of a multiprocessor operating system include resource allocation and management schemes, memory and data set protection, prevention of system deadlock and abnormal process termination or exception handling and processor load-balancing. Also, the operating system should be capable of providing system reconfiguration schemes to support graceful degradation of performance in the event of a failure. In the following discussion, we introduce	brifly the	three	basic
configurations, namely master-slave, separate supervision and floating-supervision systems [53] .
In a 'master-slave' configuration, one processor, called the master, maintains the status of all processors in the system and apportions the work to all the slave processors. Service calls from the slave processors are sent to the master for executive service. Only one processor (the master) uses the supervisor and its associated procedures. The merit of this configuration is its simplicity. However, parallel processing system which has the master-slave configuration is susceptible to catastrophic failures, and a low utilization of the slave processors may result if the master cannot despatch processes fast enough. Cyber 170 and DEC System 10 use this mode of operation. The master-slave
configuration is most effective for special applications where the work load is well-defined.
In a 'seperate supervisor system,', each processor contains a copy of a basic kernel. Each processor services its own needs. However, since there is some interaction among the processors, it is necessary for some of the supervisory code to be reentrant, unlike in the master-slave mode. ■ Separate supervisor mode is more reliable then master-slave mode. But the replication of the kernel in all the processors causes an under-utization of memory,
'The 'floating supervisor' scheme treats all the processors as well as other resources symmetrically or as an anonymous pool of resources. In this mode, the supervisor routine floats from one processor to another, although several of the processors may he executing service routines simultaneously. Conflicts in service requests are resolved by priorities. Table access should be carefully controlled to maintain the system integrity. The floating supervisor mode of operation has the advantages of providing graceful degradation and better availability of reduced capacity systems. Furthermore, it is flexible and it provides true redundancy and makes the most efficient use of available resources. Examples of the operating systems that execute in this mode are the MVS and VM in the IBM 3081 and the Hydra [102] on the C .lump.
6. Conclusions
In this survey, we have identified the various issues involved in parallel processing systems. Approaches followed to solved the associated problems have also been discussed and their relative merit are put forth. The principles and the requirements of language and operating system support for complex multiprocessing systems are elaborately described, For the wide spectrum of architectures proposed in the litarature, their design principles and salient features are brought out in a comparative menner.
While the envisaged potentials offer a promising scope for parallel processing systems for many applications, hardly a few systems are commercialized. The reasons fot this is the lack of good software support for these systems. Design of intelligent compilers which can identify parallel subtasks in a program (written in a sequential language), schedule the subtasks to the processing elements and manage communication among the scheduled tasks, is a step toward this end. Although there are many existing proposals in this line, none of them seems to achive all the three goals in an integrated manner, relieving the burden from the user completely.
Another question that remains unanswered is whether or not to continue with von Neumann approach for building complex parallel processing machines. While familiarity and the past experience with control flow model make it a proponent candidate, ' its inherent inefficiencies, .such as the explicit specification of control and global updatable memory, limit its capabilities. Although data-driven and demand-driven computers exploit maximum parallelism in a program, their complex structure and inadequate software support force the designer to have a second throught on these approaches.
With the advent of VLSI technology and RISC design, dedicated architectures are becoming more and more popular. However, the
inapplicability of these systems to a variety of applications causes a serious concern. At the other end of the spectrum, we have general purpose parallel processing systems which give degrade performance due to the mismatch of the architecture and algorithm, and the reconfigurable machines. Considerable and on design efficient algorithms (for g,eneral purpose computing systems) which will bridge the gap between the application program and architecture.
Finally, the research on neural computer and molecular machines is at its infancy. Modeling the neural circuits and understanding the functioning of human brain have to be considerable refined before one could make use them for building high speed computing systems.
The vastness of this fascinanting area in which active research is underway, and the innumerable problems that remain to be solved are themselves standing evidences for the promising future of parallel processing. With the ever-growing greed for very high speed of computing, and with the inability of the switching devices to cope up with the need, parallel processing techniques seem to be the only alternative.
7. References
1.	W.B. Ackerman and J.B. Dennis, "VAL - A Value Oriented Algorithmic Language, Preliminary Reference Manual", Tech. Rep. TR-218, Lab. for Computer Science, M.I.T. Cambridge, MA, June 1979.
2.	J.S, Albus, Brains, Behavior and Robotics, Byte Books, McGraw-Hill, New York, NV, 1981.
3.	H. Ammamiya, R. Hasegawa and H. Mikami, "List Processing and Data Flow Machines", Lecture Note Series, No.436, Research Institute for Mathematical Sciences, Kyoto University, September 1980.
4.	G.R. Andrews and F.B. Schneider, "Concepts and Notations for Concurrent Programming", Computing Surveys, Vol.15, No . 1, pp.3-43, March 1983.
5.	A(JSI/MIL-STD 1815A, Reference Manual for Ada Programming Language, 1983.
6.	C.N. Arnold, "Performance Evaluation of Three Automatic Vectorizer Packages" , Proc. Int'l Conf. Parallel Processing, pp.235-243, 1982,
7.	Arvind and K.P. Gostelow, "A Computer Capable of Exchanging Processors for Time", Proc, IFIP Congress, pp.849-854, 1977 ,
8.	Arvind, K.P. Gostelow and W. Plouffe, "An Asynchronous Programming Language and Computing Machine", Tech. Rep, TR-ll4a, Dept. Information and Computer Science, University of California, Irvine, CA, December 1978.
9.	J. Backus, "Reduction Languages and Variable Free Progratnming" , Rep. RJ.lOlO, IBM T.J. Watson Research Center, Yorktown Heights, NY, April 1972.
10. J. Backus, "Can Programming be Liberated from von Neumann Style? A Functional Style and its Algebra of	Programs",
Communications of the ASM, Vol.21, No.8, pp.613-641, August 1978.
11.	W.L, Bain Jr. and S.R. Ahuja, "Performance Analysis of High-Speed Digital Busses for Multiprocessing Systems", Proc. 8'" Ann. Symp. Computer Architecture, pp.107-131, May 1981.
12.	O.H. Barnes, R.M. Brown, M, Kato, D.J. Kuck, D.L. Slotnick and R.A. Stokes, "The ILLIAC IV Computer", IEEE Trans. on Computers, Vol.C-17, No.8, pp.746-757, August 1968.
13.	J. Basu, L.M. Patnaik and A.K. Goswarai, "Ordered Ports - A Language Construct for High Level Distributed Programming", The Computer Journal, Vol.30, 1987.
14.	K.E. Batcher, "Design of	a Massively Parallel Processor", IEEE	Trans. on Computers, Vol.C-29, No.9,	pp.836-840, September 1980.
15.	K. Berkling, "Reduction Languages for Reduction Machines", Proc. Z»« Int'l Sytnp. Computer Architecture, Januar 1975.
16.	L. Eie, "A Data-Driven Model for Parallel Implementation of Logic Programs", Dept. Information and Computer Science, University of California, Irvine, CA, January 1984.
17.	R.P. Case and A. Padegs, "Architecture of the IBM System 370", Communication of the ACM, Vol.21, No.l, pp.73-96, January 1978.
18.	K. Clark and S. Gregory, "PARLOG: PARallel Programming in LOOic", ACM Trana. on Programming Languages and Systems, Vol.8, No.l, pp.1-49, January 1986.
19.	W.F. Clocksim and C.S. Hellish, Programming in Prolog. Springer-Verlag, New York, NY, 1984.
20.	D. Comte, A. Durrieu, 0. Oelly, A. Pias and J.C. Syre, "TEAU 9/7 SYSTEME LAU -Summary in English", CERT Tech. Rep. #1/3059, Centre d'Etudes et de Recherches de Toulouse, October 1976.
21.	J.S. Conery and D.F. Kibler, "AND Parallelism and Nondeterminism in Logic Programs", New Generation Computing, Vol.3, No.l, pp.43-70, 1985.
22.	M. Cornish, "The TI Data Flow Architectures: The Power of Concurrency for Avionics", Proc. 3^" Conf. Digital Avionics Systems, pp.19-25, November 1979.
23.	A.L. Davis, "The Architecture and System Method of DDMl; A Recursive Structured Data Driven Machine", Proc. 5'" Ann. Symp. Computer Architecture, pp.210-215, April 1978.
24.	A.L. Davis and B.M. Keller, "Data Flow Graphs", Computer, Vol,15, No.2, pp.26-41, February 1982.
25.
'26.
Deneclor, Processor : 1981.
Ine. Heterogeneous Element Principles of Operation, April
J.B. Dennis and D.P. Misunas, "A Preliminary Architecture for a Basic Data Flow Processor", Proc. 2" Int'l Symp. Computer Architecture, pp.126-132, January 1975.
27.-E.W.
Dijkstra,
Discipline
of
Programming. Prentice-Hall Inc., Englewood Cliffs, NJ, 1976.
28.	B.L. Drake, F.L. Luk, J.M. Speiser and J.J. Symguski, "SLAPP; A Systolic Linear Algebra Parallel Processor", Computer, Vol.20, No.7, pp.45-49, July 1987.
29.	M.J. Duff, 'CLIP-4", in Special Computer Architecture for Pattern Recognition, K.S. Fu and T. lohigawa (Eds.) CRC Press, 1982.
30.	T.V. Feng, "A Survey of Interconnection Networks"Computer, Vol.14, No.12, pp.i2-27, December 1981.
31.	A.L. Fisher, H.T. Kung, L.M. Monier, H. Walker and y. Dohi, "Architecture of the PSC: A Progrnmmable Systolic Chip", Proc. 10<» Ann. Symp. Computer Architecture,
1983.
32.	J.D. Foley and A. van Dam, Fundamentals of
Interactive_Computer_Graphics. Addieon-
Wesley, Reading, MA, 1982.
a;). J. A, 11. Fortns rind H.W. Wnh, "Systolic-Arrays -- From Concept to Implementation", Oueät Editor's Introduction, Computer, Vo!.20, No.7, pp.12-17, July 1987.
34.	D.D. Gajski and J. Peir, "Essential Issues in Multiprocessor Systems", Computer, Vol.18, No.6, pp.9-27, June 1985.
35.	C. Ghezzi, "Concurrency in Programming Languages: A Survey", Parallel Computing, Vol.2, pp.229-241, 1985.
36.	D. Ghosal and L.M. Patnaik, "Performance Analysis of Hidden Surface Removal Algorithms on MIMD Computers", Proc. IEEE Int'l Conf. on Computers, Systems and Signal Processing, pp.1666-1675, December
1984.
37.	D. Ghosal and L.M. Patnaik, "Parallel Polygon Scan Conversion Algorithms: Performance Evaluation on a Shared Bus Architecture", Computers and Graphics, Vol.10, No.l, pp.7-25, 1986.
38.	D. Ghosal and L.M. Patnaik, "SHAMP: An Experimental Multiprocessor System for Performance Evaluation of Parallel Algorithms",	Multiprocessing	and Microprogramming, Vol.19, No.3, pp. 179192, 1987,
39.	K.P. Gostelow and R.E. Thomas, "Performance of a Data Flow Computer", Tech. Rep. 127a, Dept. Information and Computer Science,	University	of Califirnia, Irvine, CA, October 1979.
40.	A.K. Goswami and L.M. Patnaik, "An Algebraic Approach Towards Formal Development of Functional Programs", accepted for publication. New Generation Computing.
41.	A.K. Goswami, "Non-Determinism and Communication in Functional Programming Systems: A Study in Formal Program Development", Ph.D. Dissertation, Dept. Computer Science and Automation, Indian Institute of Science, Bangalore, India, October 1985.
42.	A. Gottlieb, R. Grishman, C.P. Krushkal, K.P. McAaliffe, L. Randolph and M. Snir, "The NYU Ultracomputer — Designing an MIMD Shared Memory Parallel Computer", IEEE Trans. on Computers, Vol.C-32, No.2, pp.175-189, February 1983.
43.	Z. Halim, "A Data-Driven Machine for OR Parallel Evaluation of Logic Programs",
New Generation pp.5-33, 1986.
Computing, Vol.4, No.l,
45.
46.
47,
48.
49.
50.
51.
52.
53.
54.
55.
56.
R.H. Halstead Jr., "Multilisp: A Language for Concurrent Symbolic Computation", ACM Tr&ns. on Programming Languages and Systems, October 1985.
P.B. Hansen, "The Programming Language Concurrent Pascal", IEEE Trans. on Software Engineering, Vol,SE-1, Nò.Z, pp.199-209, June 1975.
P.B. Hansen, "Distributed Concurrent	Programming
Communications of the ACM, pp.934-941, November 1978.
Processes: A Concept", Vol.21, No.11,
J.P. Hayes, R. Jain, W.R. Martin, T.N. Mudge, L.S. Scott, K.G. Shin and Q.F. Stout, "Hypercube Computer Research at the University of Michigan", Proc. Second Conf.	Hypercube	Multiprocessors,
September/October 1986.
W.D. Hillis, The Connection Machine. M.I.T. Press, Cambridge, MA, 1986.
C.A.R. Koare, "Monitors: An Operating System Structuring Concept", Communication of the ACM, Vol.17, No.10, pp.549-557, October 1974.
C.A.R. Hoare, "Communicating Sequential Processes", Communication of the ACM, Vol.21, No.8, pp.666-677, August 1978.
J.J. Hopfieed and D.W. Tank, "Computing with Neural Circuits", Science, pp.625633, August 1986.
P. Hudak, "Para-Functional Programming", Computer, Vol.19, No.8, pp.60-70, August 1986.
K. Hwang and F.A. Briggs, Computer Architecture and Parallel Processing. McGraw-Hill Book Company, New York, NY, 1984.
A.K. Jones and E.F. Gehringer, "Cm* Multiprocessor Project: A Research Review", Tech. Rep. CMU-CS-80-131, Carnegie-Mellon University, July 1980.
Inmos Ltd., Occam Reference Manual. 1986.
Inmos Ltd., transputer Reference Manual. Ref. No. 72TRN 04802, 1986.
57.	R.M. Keller, S. Patii and G. Lindstrom, "A Loosely.	Coupled	Applicative Multiprocessing System", Proc. Nat. Computer Conf., AFIPS, pp.861-870, 1979.
58.	W.E. Kluge and H. Schlutter, "An Architecture for the Direct Execution of Reduction Languages", Proc. Int'l Workshop High-Level Language Computer Architecture, pp.174-180, May 1980.
59.	R.A. Kowalski, "Algorithm = Logic + Control", Communication of . the ACM, Vol.22, No.7, pp.424-435, July 1979.
60.	D. Krishnan and L.M. Patnaik, "Systolic Architecture for Boolean Operations oh Polygons and Polyhedria", Eurographics, The Guropian Association for Computer Graphics, Vol.6, No.3, July 1987.
61.	C.P. Kruskal and A, Weiss, "Allocating Independent Subtasks on	Parallel Processors", Proc. Int'l Conf. Parallel Processing, pp.236-240, August 1984.
62.	H.T. Rung and S.W. Song, "A Systolic 2D Convolution Chip", Dept. Computer Science, Carnegie Mellon University, Tech. Rep. CMU-CS-81-110, March 1981.
63.	H.T. Kung, "Why Systolic Architectures?",. Computer, Vol.15, No.l, pp.37-46, January 1982.
64.	S.Y. Kung, S.C. Lo, S.N. Jean and J.N. Hwang, "Wavefront Array Processors — Concept to Implementation", Computer, Vol.20, No.7, pp.lÖ-33, July 1987.
65.	K.P. Lippmann, "An Introduction to Computing with Neural Nets", IEEE ASSP, pp.4-20, April 1987.
66.	G.A. Mago, "A Network of Multiprocessors to Execute Reduction Languages", Int'l J. Computer and Information Science, Part 1: Vol.8, No.5, pp.349-385, Part 2: Vol.8. No.6, pp.435-571, 1979.
67.	P.C. Mathias and L.M. Patnaik, "A Systolic Evaluation of Linear, Quadratic and Cubic Expressions", accepted for publication, Journal of Parallel and Distributed' Computing.
68.	J. McCarthy, et.al., LISP 1.5 Programmer's Reference Manual. M.I.T. Press, Cambridge, MA, 1965.
69.	C.A. Mead and L.A. Conway, Introduction to VLSI Systems, Addison Wesley, Reading, MA, 1980.
70.	R. Onai, M. Aso, H. Shimizu, K. Masuda and A. Matsumoto, "Architecture of a Reduction Based Parallel Inference Machine: PIM-P", New Generation Computing, Vol.3, No.2, pp.197-228, 1985.
71.	D.A. Padua, D.J. Kuck and D.L. Lawrrie, "High Speed Multiprocessor and Compilation Techniques", IEEE Trans, on Computers, Vol,C-29, No.9, pp.763-776, September 1980.
72.	L.M. Patnaik, P. Bhattacharya and R. Ganesh, "DFL: A Data Flow Language", Computer Languages, Vol.9, No.2, pp.97106, 1984.
73.	L.M. Patnaik and B.R. Badrinath, "Implementation of CSP-S for Description of Distributed Algorithms", Computer Languages, Vol.9, No.3/4, pp.193-202, 1984.
74.	L.M. Patnaik, R. Govindarajan and N.S, Ramadoss, "Design and Performance Evaluation of EXMAN: An Extended MANchester Data Flow Computer", IEEE Trans. on Computers, Vol.C-35, No.3, pp.229-244, March 1986.
75.	L.M. Patnaik and J. Basu, "Two Tools for Interprocess Communication in Distributed Data Flow Systems", The Computer Journal, Vol.29, No.6, pp.506-521, December 1986.
76.	A. Pias, et.al., "LAU System Architecture: A Parallel Data Driven Processor Based on Single Assignment", Proc. 197 6 Int'l Conf. Parallel Processing, pp.293-302, August 197B.
77.	e.V. Ramamoorthy and H.F. Li, "Pipeline Architectures", Computing Surveys, Vol.9, No.l, pp.61-102, March 1977.
78.	C.P. Ravikumar and L.M. Patnaik, "Parallel Placement Based on Simulated Annealing",
IEEE Infi Conf. Computer Design: VLSI in Computers and Processors, October 1987,
79.	C.P. Ravihutnar and L.H. Patnaik, "An Architecture for CSP and its Simulation", Int'l Conf, Parallel Processing, 1987,
80.	C. Rieger, "ZMOB: Doing it in Parallel", Proc. Workshop Computer Architecture for PAIDM, pp.119-125, 1981.
81.	A. Rosenfield, "Parallel Image Processing Using Cellular Arrays", Computer, Vol.16, No.l, pp.14-20, January 1983.
82.	R.H. Russel, "The Cray-1 Computer System", Communications of the ACM, Vol.21, No.l, pp.63-72, January 1978.
83.	C.L. Seitz, "The Cosmic Cube", Communication of the ACM, Vol.28, No.l, pp.22-33, January 1985.
84.	E.Y. Shapiro, "Concurrent Prolog: A Progress Report", Computer, Vol.19, No, 8, pp.44-58, August 1986.
85.	L. Snyder, "Introduction to Configurable Highly Parallel Computer", Computer, Vol.15, No.l, pp.47-64, January 1982,
86.	J.S. Squire and S.M. Palais, "Proflrammin« and Design Considerations Cor a Highly Parallel Computer", Proc. AFIP Conf., Vol.Z3, pp.395-400, 1963.
87. J. Silc and B. RobiC, Dataflow Architecture Computatins"p Proc. Congress on Scientific France, July 1988.
"Efficient Static for Specialized 12"' IMACS World Computing, Paris,
8S. K.H. Traub, "A Compiler for M.I.T. Tagged-Token Data Flow Architecture", M.S. Thesis, M.I.T., Cambridge, MA, 1986.
89.	P.C. Treleaven, "Principle Components of a Data Flow Computer", Proc. 1978 Euromicro Symp., pp.366-374, October 1978.
90.	P.C. Treleaven and G.F. Mole, "A Multiprocessor Reduction Machine for User-Defined Reduction Languages", Proc. 7*" Int'l Symp. Computer Architecture, pp.121130, 1980.
91.	P.C. Treleaven, D.R, Brownbridge and R.P. Hopkins, "Data-Driven and Demand-Driven Computer Architecture", Computing Surveys, Vol.14, No.l, pp.93-143, March 1982.
92.	D.A. Turner, "A New Implementation Technique for Applicative Languages", Software Practice and Experience, Vol.9, No.5, pp.31-49, September 1979.
93.	J.D. Ullman, Computational Aspect of VLSI, Computer Science Press, 1984,
94.	S. Umeyama and K, Tamura, "Parallel Execution of Logic Programs", Proo, 10« "i Ann. Symp. Computer Architecture, pp.349355, 1983.
95.	S.R. Vehdahl, "A Survey of Proposed Architectures for the Execution of Functional Languages", IEEE Trans. on Computers, Vol.C-33, No.12, pp.1050-107 1 , December 1984.
96.	A.G. Venkataramana and L.M. Patnaik, "Design and Performance Evaluation of a Systolic Architecture for Hidden Surface Removal", accepted for publication. Computer and Graphics.
97.	A.G. Venkataramana and L.M. Patnaik, "Systolic Architecture for B-Spline Surfaces", accepted for publication, International Journal of Parallel Programming,
98.	I. Watson and J. Curd, "A Prototype Data Flow Computer with Token Labelling", Proc. Nat. Computer Conf. New York, AFIPS Press, pp.623-628, 1979.
99.	N. Wirth, "Modula; A Programming Language for Modular Multiprogramming", Software Practice and Experience, Vol.7, No.l,
■ pp.3-35, January 1977.
100. M.J. Wise,	"A Parallel	PROLOG: The
Construction	of Data-Driven Model", ACM
Conf. LISP	and Functional	Programming, 1982.
101. W.A. Wulf and C.G. Multi-miniprocessor", Joint Computer Conf., 1972.
Bell, "C.ramp — A Proc. AFIPS Fall Vol,41, pp.765-777,
102.	W.A. Wulf, et.al., "Overview	of the Hydra Operating System", Proc.	5'" Symp. Operating System Principles,	pp.122-131, November 1975.
103.	R. Yang and H. Aiso, "P-Prolog: A Parallel
Logic Language Relation", Vol.5, No.l
Based on New Generation pp.79-95, 1987.,
Exclusive Computing,
THE RELATION BETWEEN A POINT AND A SIMPLE POLYHEDRON
INFORMATICA 2/88
UDK 519.7
Dušan Surla Prirodno-matematički fakultet Novi Sad
ABSTRACT; The notion Is introduced of the piercintr point of ä straight line i through the hull of a simple polyhedrofi P. An Intersection point of i and the hull of P is considered to be the piercing point if the straight lino at the Intersection point passes fi-oin the Interior to the eKterioi* domain of P> or vice versa. It is proved that the point Z belongs to the interior domain of P if an arbitrary straiir^t line i, passing through Z, contali^ an odd number of piercing points on the located on the one side of Z. On the basis . of this statement, an effective procedure is formed for determination whether a given point belongs to the interior domain of P. An algorithm based on this procedure, enables determination of those intervals on the straight line which belong to the interior domain of P.
■Key wards: Computational geometry. Point-Location. Polygon, Polyhedron.
ODNOS TACKE I PROSTOG POLIEDRA. Uveden Je pojam prodorne taCke prave t kroz omotaC prostog poliedra. Presečna taCka izmedju i i omotača od P Je prodorna ako prava u presefinoj tački prelazi iz spolJaSnje u unutraSnJu iii iz unutrašnje u spoljažnju oblast od P. Dokazano je tvrdjenje da tačka Z pripada unutrašnjoj oblasti od P, ako proizvoljna prava t koja prolazi kroz Z sadr'i sa iste strane od Z neparan broj prodornih taCaka. Na bazi ovog tvrdjenja formirana je efektivna procedura' za ispitivanja da li data tačka pripada unutrašnjoj oblasti od P, Koristeći ovu proceduru opisan je algoritam za izdvajanju intervala na pravoj koji pripada unutraSnjoj oblasti od P.
1. Introduction
One of tto most fundamental problems in computational geometry is the so-called fvrirU-Ì9C,<Uion. ivutSi&n.. To the present, the attention has been mainly paid to location of a point in a planar subdivision. The problem can be stated as follows: Given a straight-line graph Ö having n vertices, and a point 2, determine what is the' region of the planar subdivision induced by 0 in which Z is found. Hain results on the study of this problem have been presented in [1-8K Much less attention has been paid to solving the problem in three- and multi-dimensional space. In [93, the problem has been solved by generalization of the binary search, whereas in 110] the data structure ha«s been described by a sequence of similar lists formed in such a way to ensure each list to be binary searched.
Ve consider first the following problem: Oiven a point 2 and a simple polyhedron P, determine whether the point Z belongs to the interior domain of P. First,, we shall describe the data structure to represent the
polyhedron, and adopt a doubly-connected list. In addition, we shall present a procedure for separation of those edges which define a corresponding facet of P. Further, we describe in detail a new algorithm and the appropriate procedures by which it is possible to determine whether Z belongs, to the interior domain of P.	.
2. Data Structure
The data structure adapted for representation
of a polyhedron is of the form of a planar
graph G«I<V,E>, where V and E are the sets of
vertices and edges, respectively, of a
polyhedron embeded in a plane. The graph O is
represented as a doubly-connected edge list
CDCEL5 C11]. The main component of this graph is an edge node which contains 6 fields: VI,
V2, Fl, P2, PI, and P2. The fields VI and V2
define the edges which are oriented from VI
to V2. The fields Fl and P2 contain the names
of facets of P lying on the left, i.e. on the
right, with respect to the edge orientation
from VI to V2. The field PI (resp. P2> paints
out to the edge node which is encountered
first if the edge <V1,V2> is rotated
cóut«rcloclcwise around VI Cresp. V2>.
Suppose that FACE <JrA,n > Is the procedui-e for separation of the edges detenni ni ng the J-th facet of P, where Aj, contains the addresses of the ordered edges which deternine the J-th facet of P.
3. Description of the Alsorithm
Denote the vertices, edges and facets of P by V^. i-n,...,(Vj; Ej, i-l,...,)E|; Pj.
respectively,. I>enote with t a half-straight line having the original In the point Z. The Intersection point of t and the hull of P is a fUe'vUng point If at this point t passes from the interior to the exterior domain of P, or vice versa. Since i is unlimited in one directloni there Is a point on t belonging to the exterior -domain, such that all piercing points of Z through the hul 1 of P and the query po 1 nt Z are on the sane side of that point. Starting from this point and going toward Z, at each odd piercing point t enters the interior domain ' of P and each even point i comes out of it. Therefore, the following statement holds: If the total nuaiber of piercing points of I through the hull of P Is odd, the query point Z belongs to the Interior domain of P.
If t lies in the plane of a facet, then that facet does not qontain a piercing point.'Let R be a crossing point of I and a facet of P, If R belongs to the interior region of that facet, then R is the piercing point. If,
however, R belongs to an edge, then it may
/
be, or nay be not a piercing point. In this case, an additional analysis would be required to deternine whether R Is a piercing point. Since t Is chosen In an arbitrary way, let it be such that the Intersection of t and E^ for ial,...,|E| is an enpty set.
Consider the set of all planes that are uniquely determined by the point Z and the edges Ej, i>l,...,|Ej. Denote these planes by Oj, J'l, . . . ,k, where k :£ |E|. Determine a new plane ß which Is different from^ al-t -planes" cij. JbI, . . . ,k, and passes through Z. Further, let for« a set of half-straight lines H which lie In the plane p and all of then have, their origin in Z, in the following way. First, if E^ lies In p then the half-straight line is deternined by E^ and Z. This is an inplication of the nanner in which the planes
eij, J»i,...,k have been determined. Namely, if Z and E J do not lie on the sane straight line, then E^ and Z deternine the plane a^, Je<l,...,k>, which is in contradiction to the
statement that a. and
p are different
planes. Second, if the intersection of p and Ej Is a pont, then the half-straight line is determined by that point and the point Z. In this way, a finite set of half-straight lines H in the plane p is formed. Since the set of half-straight lines <havlng their origin In Z> in the plane p is unlimited. It comes out that it Is always possible to choose a half-straight line / in p, originated in Z such that Zen. Therefore, the procedure for choosing t can be formed in the following way.
I
procedure CHOOSE < ^ J; begin
k:=0;
for i::*! to |E{ do if < E^ and Z deternine a plane > then besln k:-k+i;
oij^;>plane <Ej,Z>; end;
PLANE
<• This procedure determines plane p, so that Z « ^ and ^ * a^, i-l,...k
for 1:»1 to |E| do begin
if < E^ lies on p ■> then begin
h -«half-straight line <E ,Z>; ^ / ond;
else if C ^ Ej 0 > then '	begin
R:-/5 n Ej. '
hj:=.halC-straight line <R,Z>; end; / end ; <»fore5 HALFLINE < / 3
Cm This procedure determines half-straight line / on with the Initial
'point Z and t ^ h^, 1-1, ...,J *>
end.
The half-straight line t determined on the basis of this procedure is such that it has no comnon points with any edge of P. If the Intersection of t and the plane which is determined by a polyhedron facet is not an
eBpt.y set, tti*n tlwir Intersactlon is a point. Now, It, Is necessary to determine whether this point belongs to the corresponding facet. Thus the task is reduced to determination of the relation between a point and simple polygon. The sane statenent holds as for the relation between a point and a polyhedron. The algorithm for datemi nati on whether a point belongs to the interior region of slnple polygon consists in the following.
Let be given a simple polygon with ordered
vertices V^, 1*1,... ,n, and the point R.
Denote with r an arbitrary half-straight line
having the origin in R. The piercing poitit of
r through the sinple polygon can be
determined in the following way. If an edge
lies on r, then there are no piercing points
on it. Suppose that r passes through the
vertex V^. If the neighbouring vertices
and V, _ are on different sides of r, then V. 1+1 1
is a piercing point. If the condition
is satisfied, then V^ is a piercing point, otherwise it is not. If r and an edge have one common point and that point is not a vertex, then It is a piercing point. In this way. the total nunber of piercing points can be determined. If this number is odd, then R belongs to the interior region of a simple polygon.
On the basis of the algorithm described, it is possible to form a procedure for determination whether there Is a piercing point of { through a facet of P. This procedure is of the form:
procedure PAGEOON < J,f,A,n,test J; begin
irCplaneCj> f^ f ■> 0 > then test: =f al se else beeln Rj:-planeCJ> p i; Rgi^VaCACl]]; AlrrH] :=Atl] ;
for i : to n do ' becln J:-VIC AC i]] ; k:-V2EACin;

prCk);-edge<Vj,Vj^> p
straight-line CR.,R_>;
1 *
end; else if<E,<.Ej^-0>
®k- -«V'^l^**^'V'
f CEj.l
beein
if CEj.Ej^<0> then
J
then
ifCEj=0> then ifCEj^fOl then begin m:-VlCAti-1]] ;
if CE. .E <0> then K m
begin fc:-k+l; prC kJ:»Vj; end; end;
else If CE^-0> then begin
i»=V2CACi+in;
ir	then
begin
k:=k+l; prCk3:-V^.; end; end; end ; C*for*> SORT <pr,k,npr>; <• npr is the number of different piercing points on the same side of R *> test:sfalse;
<» i is not piercing the
J-th facet of P *> ifCnpr odd> then test:Ktrue
C« e is piercing the J-th facet od P end ; end.
On the basis of the statement given above and the introduced procedures1t is possible to form a routine for determining whether a given point belongs to the Interior domain of a simple polyhedron. This procedure is of the form :
procedure TESTPZ C test >; begin
CHOOSE CO i for i:=-l tò |F| -do _ begin PAGE <i,A,nJ;
FACEGON Ci,A,n,test);	^
if test then pri-pr+l end ;
test; "faIs« C» Z does not. belong to tho
interior domain of P ♦> ir C pr odd > then
tost.: "true <» "Z belongs to the interior domain of P
end.
By this procedure, the following task can be solved. Determine the intervals on a strai^rht lino which belong to the interior donain of a simple polyhedron. To solve this problem, the followini; alterations of the above procedure should be nade. First, if* the point R is a cross in« point of this straight line and of an edge of a simple polyhedron, then an additional analysis should be carried out to deternine whether this point is a piercing point. This can be achieved in the following way. Let choose two points which lie on the given straight line in an arbitrarily small vicinity of the point R, and that they are
found on different sides of the point R. If one of these two points belongs to the interior and the other one to the exterior domain of the simple polyhedron, then R is a piercing point, otherwise it is not. Second, determine the piercing points and sort them out along the given straight line. These points determine the sequence of consecutive finite intervals, each odd of which belongs to the interior domain of P.
Multiple Points in a Planar Subdlvision" Infor. Processing Lett., Vol. 9, No. 4, <1977> 190-193,
13] Preparata P. F., "A Note on Locating a Set of Points in a Planar Subdivision", SIAM J. Gomput. Vol. 8, No. 4, C1979J 542-343.
£4] Preparata P. F., "A New Approach to Planar Point Location", SIAM J. Gomput. Vol. 10, No. 3, <1981> 473-482.
[SJ Klrkpatrick D., "Optimal Search in Planar Subdivision", SIAM Comput. Vol. 12, No. 1, <1983> 28-33.
[6]	Edahiro M., Kokubo I., Asano. T,. "A New Point-Location Algorithm and its Practical Ef f ic iency-Gontparison With Existing Algorithms", ACN Trans. on Graphics, Vol. 3, No. 2, <1984> 86-109.
[7]	Edelsbrunner H., Oulbas L. J., Stolfl J. , "Optimal Point Location In a Monotone Subdivision", SIAM J. Comput, Vol. IS, No. 2, C19a6> 317-340.
t8] Sarnak N. , Tarjan R. , "Planar Point Location Using Persistent Search Trees", Com. ACM Vol. 29, No. 7, <19803 609-679.
C9] Dopkin D., Llpton R. , "Kul11dimens!on Searching Problems", SIAM J. Comput. Vol. S, No. 2, C197fii 181-186.
[10] Cole R., "Searching and Storing Similar Lists", J. of Algorithms 7, C1986> 202-220.
C11] Preparata F., Shamos M., Computational Qeometry, an Introduction, Springer -- Verlag 1985.
4. Conclusion
The procedures described enable the following: the choice of a half-straight line I originated in the query point Z such that the intersection of t and all the edges of simple polyhedron P is an empty set; detern1nation whether t pierces a facet of P, On the basis of these procedures a routine is formed to determine whether Z belongs to the interior donaln of P, In addition, the other algorithm presented enables determination of the intervals on a given straight line which belong to the interior domain of P. Furthermore, the described procedures nay be used for solving other problems in computational geometry (identification of hidden lines, determination and detection of intersections, etc.>.
5. Raferences
CI] Lee T. D., Preparata P. F., "Location of Point in a Planar Subdivision and its Applications", SIAM J. Comput. Vol. 6. No. 3, <1977> 394-606. [2] Lee T D., Yang C. C,, "Location, of
ODNOS TAČKE I PROSTOG POLIEDRA
dr Du£an Surla Prlrodno-natenatlCkl fakultet Institut sa matematiku dr Ilije »uričiiia 4 Novi Sad
Sadržaj Uveden je pojam prodorne tafike prave t kroz omotae prostog poliedra. Presefina ta£ka iZmedJu t i omotata od P Je prodorna ako prava u presefinoj taCki prelazi iz spoljaSnJe o unutrašnju ili iz unutrašnje u spolJaSnJu oblast od P. Dokazano je tvrdjenjo da taCka 2 pripada unutražnjoj oblasti od P, ako proizvoljna prava t koja prolazi kroz Z sadrži sa iste strane od Z neparan broj prodornih taCaka, Na bazi ' ovog tvrdjenja formirana Je efektivna procedura za ispitivanja da li data tacka pripada unutrašnjoj oblasti od P. Koristetìi ovu proceduru opisan Je algoritam za lasdvajanju intervala na pravoj koji pripada unutrašnjoj oblasti od P.
DERIVING PROTOCOLS FROM SERVICES IN THE FINITE STATE MACHINE REPRESENTATION
UDK 519.713
Monika Kapus-Kolar Inst. Jožef Stefan, Ljubljana
A method is suggested for derivation of protocols from services, basèd entirely on . the finite state machine representation. The method provides several suggestions for human intervention in the design process and thereby for a great variety of solutions. Other benefits are parametrization, transformations for data-flow.optimization, uniform treatment of synchronous and asynchronous channels, a uniform approach to composition and decomposition of entities and thereby a uniform approach to design of services and protocols.
Izpeljav* protokolov iz servisov ob uporabi predstavitve s končnimi avtomati.
Predstavljena je metoda za avtomatsko konstrukcijo komunikacijskih protokolov za realizacijo podanega globalnega servisa, ki v celoti temelji na predstavitvi s končnimi avtomati. Metoda je zelo primerna za interaktivno delo, ki vodi v Široko paleto rešitev. Druge dobre lastnosti metode so parametrizacij a, transformacije za optimizacijo pretoka podatkov, enotna obravnava sinhronih in asinhronih kanalov in enoten pristop h kompoziciji in dekompoziciji osebkov, ki omogoća poenotenje načrtovanja servisov in protokolov.
O. Introduction
Derivation of a consmunication protocol from a given service specification is one of the most challenging problems in the field of computer networks. Two methods have been proposed so far, which we find particularly interesting, because they provide algorithms for totally automatic construction of a' suitable protocol. The method, proposed in [Prin], constructs a Petri-net type protocol specification from a finite-state machine service specification, while the method of IBochGotz] is based on ■attribute grammars. In our paper, we are proposing a method, based entirely on finite state machines, which follows the selection / resolution principle and is therefore also suitable for construction of protocols in a man-machine dialogue.
We assume that a distributed system consists of a set of entities, communicating with each other and the environment through a set of reliable two-point channels. Some of the channels are unbounded FIFOs with unknown delays (asynchronous), while the others are synchronous (the "rendez-vous" type of communication).
A global service, which a system should provide to the environment, is specified by a finite state machine G, with edges representing an asynchronous transmission or reception of a particular message by the system on a particular external channel or a synchronous external event. Paths, leading from the starting state of G, represent the characteristic sequences of system actions.
A message is a tuple of parameters, posessing explicitely or implicitely stated identifiers and values. The crucial observation about parameters is that each parameter identifier, occurring in a specification, represents a
global system variable, which is concurrently updated or read by the entities. Parameters, exchanged in an action, are its output parane-tars, while parameters, generating the values of the output parameters, are the input parameters of the action. If an action is not an asynchronous transmission, the values of its parameters can also be obtained from the environment, 5 must possess the following properties:
Property 0.1s It must not contain two nonterminal states Si and S=, such that for every outgoing edge a in Si^ leading to a state S. there is also the same outgoing edge in Ss, and vice-versa (equivalent statea) .
Property 0.2; If in a state Si, there are two paths Pi and Ps, such that the action sequence of P= is a permutation of the action sequence of Pi, respecting all causality relations of Pi (Pa is equivalent to Pi), they must both lead to the same state S=. Path equivalence is formalized in the section 1.
Property 0.3: If there is an edge, representing an action, requiring a value of a particular parameter as an input, then the edge must be preceded from any direction by some edge, generating the value.
The first step in our protocol design method is to convert G into another finite state machine Bp, which mirrors particular design decisions about parallel execution of external actions of a system, while respecting the causality relations of G. Then the states of 6f., ' requiring communication between' entities, are identified. For each such state, a procedure for exchanging messages on internal channels must be provided by a designer. Such procedures explicitate externally invisible transitions of a system, which are initially hidden in the states of G. as indicated in Gc,
an extended version of 6f>. Note also that it is execution of the internal procedures, that entities without access to external channels are used for. At that point it may turn out that the task can not be solved with the existing channels. By integrating the internal procedures into Gc, another global system behaviour specification Gi is obtained, which is data-flow optimized into 5o and subsequently used for generation of finite state machines for individual entities. ^
Two algorithms will be used extensively throughout thè paper; Algorithm 0.1, which introduces to a state machine a new path, and Algarlthffl 0.2, which deletes a particular path.
Algartthn 0.11
{create a new path P)
b«ain<Algorithm 0.1} represent P by a finite set of finite segments
(not all types of the representation might be suitable for'a particular purpose)i for every segment, which is a concatenation of an action sequence ■ and an action « and should lead from St to 8j do
bagin
i4 there is no state 6k, different from si, with a single outgoing edge, namely a, leading to 8j. or with a single incoming path, namely b, with the initial state 8i than create Bk as a new state: if in Sk, there is no outgoing edge a than create an edge a from Sk to Sa ■Im
if the edge ■ leads to a state, different
from Sj, than axit Mlth arrar; if in 8i, there is no outgoing path • than create a path s from Sx to 8k; alaa
if the path a leads to a state, different
from Sk, than aMlt with arror; merge the equivalent states and
and(Algorithm 0.1>. Algorithm 0.2i
(delete a particular path, leading from 8i to
bagin{Algorithm 0.2} for every state 8 of the path do bagin
if in 8. there are some incoming edges, not lying on the path, and also some, lying on the path, than bagin
create a state 8a, equivalent to 8; redirect the incoming edges of 8, not lying on the path, to Sa and:
for every edge e of the path do If all outgoing edges of the destination
state of a are lying on the path than delete a; delete the unconnected states; merge the equivalent states andfAlgorithm 0.2>..
1. Converting « Global Service Specification into an Equivalent Form Mith a Desired Degree of Paralialism
Service specifications in (BochGotzl use three types of composition (parallel, sequential and alternative!. This versatility makes it diffi-
cult to identify actions, which could be enabled concurrently, as in a specification, they might lie far apart.
In a finite state machine specification, parallel composition of actions is represented by various permutations of the actions, connecting the same pair of states, with parameters inducing no causality relationship between the actions. The desired degree of parallelism in a global service specification can be achieved by repeated application of Transformation l.l, which increases parallelism, and Tranaformation 1.2, which decreases it.
The idea behind Tranaformation 1.1 is that if there is a path P from a state Si to a state Sa, the two states may also be connected by all paths, equivalent to P. Px is equivalent to Pa, iff there is a path P, such that the action sequer.oes of P» and Pa can be generated from P by zero or more applications of Tran«formation 1.1. Tr«n»formation 1.1 generates equivalent paths by repeatedly selecting an action as of the current action sequence and moving it towards the start of the sequence. If in that process «a meets an action at, such that ai is a potential necessary condition for «a (Predicate 1.1). «a may not move any further. We use the word "potential", because G might negate the causality relationship between twa actions by providing an alternative path with the two actions in the reverse order.
Predicate l.li
(ai is a potential nscassary condition for as)
begin(Predicate 1.1} Predicata 1.1;-
(ai is a synchronous action or a reception and
aa is a synchronous action or a transmission)
or
ai and as are two actions on the same channe1
or
ai generates a parameter value, which is read or redefined by aa end(Predicate 1,1).
Transformation l.lt If there is a path atOa, connecting Si and 8a, and at is not a potential necessary condition for aa (Predicate 1.1), then it is possible to create (by Algorithm 0.1) a path asAi from 8i to Sa.
To achieve the highest possible degree of parallelism, Tranaformatlon 1.1 should be applied as long as possible. On the other hand, we want Gs. to be a finite state machine, but if there is a cycle C and an action a, such that it can move through the cycle for ever (as no action of the cycle is a potential necessary condition for it) and C contains at least two different actions, the set of paths, equivalent to the cycle, is infinite and can not be described by a finite state machine. Therefore Transformation 1.1 must be applied under designer's control.
If ai is not a potential necessary condition for aa, a system is free to execute the actions in the reverse order, because the environment can not observe it. Transformation 1.1 adds a path, which represents execution of the actions in the reverse order, but as the environment can not observe the existence of such a path, a designer is also free to delete it from G by Transformation 1,2.
Transformation 1.2i If there is a path a.aa from Si to Sa and a path aaai from St to Sa and as is not a potential necessary condition fora» (Predicate l.l), then it is possible (by
Algorithm 0.2) to delete the path
2. Idantl-fyin« St«tM, Which Raquir* Int«rn«l Coamunic«tion
Some states in B^ might require communication between entities. In this section we introduce Algorithm 2.2. which generates another global system specification 6c by extending G» with internal communication requirements.
Observing the actions, possible in a given state B, some of them may be enabled simultaneously and some not. Simply speaking, a set of actions may be enabled simultaneously, if they are in parallel or in exclusive composition. The next Resign step is to identify in each state 8 a set of exclusive compositions of parallel compositions of multisets of actions, possible in 8, which might be selected by a system for simultaneous enabling. Algorithm
2.1.	if not effected by designer's decisions, generates a solution with the highest possible degree of parallelism and minimal amount of internal communication, securing complete implementation of a. service. The algorithm should be called systematically from Algorithm
2.2.
Algorithm 2.11
<A1 the set of all altarnatives of a given state 8}
{Ala : the set of groupm o-t «Iternativss, which may be selected for simultaneous enabling in
bsgln{Algorithm 2.1> find A, the set of all actions possible in 8; find Al, the set of all non-empty multisets of actions in A, such that the members of each multiset are in parallel composition and lead to a final state or a state with an outgoing edge, labeled by an action a, which is not in parallel composition with _ the members^of the multiset or must not be a'dded to the multiset because of a designer's decision
{Hvembers of a multiset are in parallel composition, iff they may access their parameters simultaneously, each permutation of them is represented by an outgoing path in 8 and any two prefixes of the paths with the same multiset of actions lead to the same state}; find Ala, the set of all subsets of Al, which are maximal in respect to the following property P (for special control purposes, a designer may also decide to cover Al with ^ subsets, which do poses the property P. but are not maximal): {a subset X of a set V is maximal in respect to a property P. iff it has the property P, but can not be extended by any other members of Y without loosing the property> if all members (alternatives) of a member X of Al are enabled simultaneously, the entities, participating in their execution, are always able to select one of the alternatives without any internal communication
(the global decision is equivalent to a set of local decisions) •nd{Algorithm 2.1).
Al in Algorithm 2.1 answers the question, which actions may be enabled simultaneously, because they are in parallel composition, but one has to be . careful. First, although we wish to enable simultaneously as many actions as possible. strict application of that rule , might lead to an incomplete implementation i of a service, Second, if in a state 8, therb is a loop with all edges labeled with the same
label, it is possible to define an infinite Al, which requires careful definition of Ala and careful construction of paths in Algorithm 2.2.
The idea behind grouping of alternatives is that a global decision procedure for selecting an alternative for actual execution might to some extent be performed as a set of local decision procedures. Respecting the property minimizes the amount of internal communication and at the same time provides a solution to the problem that actions for further execution can only be discussed among entities in terms of their a priori properties (as the only a priori property of a reception is its channel, it might not be possible to distinguish between two alternatives).
If only a partial implementation of a service is required. Algorithm 2.1 is the most suitable point for human intervention. Partial implementations can be generated by definition of incomplete sets of alternatives or groups of alternatives,
Algorithm 2.2« '
begin<Algorithm 2,2) ■ ,
Qpsn;- (st'irting state of Gp];
Clo«ad:-
Gc is just the starting state of G^;
whil« not Opan-d do bagln	^
move a state Bo from Open to ClOBad; find (by Algorithm 2.1) Al(So) and Ala(SD): for each member Aa of Ale(8D) do bag in
if Aa is not the only member of Ala(So),
or special guarding is required then add to Gc a Td edge from 8» to a new state 8»
(a state is new. iff there is no state with the same name neither in Gp nor In Sc) else 8>: - 8d;
find I(Aa). the set of Input {parameters
of Ah;	i '
if I(Ao) is not empty then begin
for each meinber In of I (Ab) do	\
begin
find U(In), the set of entities, using the value of In In execution of Ao;	, -
find K(In). the set of entities, knowing the value of In and i
create an edge from Si tp a new state 8a and
else 8«:- Bi;
(create in Sa a graph G«, representing execution of Ah:} for each outgoing path, of Bb in Gi>, representing execution of one of the members of Ao, do create the same outgoing path ^in 8» in Gc (Add as few new edges as possible (Algorithm Oil), but keep paths, belonging to different groups ofj alternatives, disjoint. Do not use in G« any old state names.); find Pr{Ao), the set of all entities. ' participating In execution of Ao; for each member E of Pr(As) do ' select T(E). the set of all action sequences with a length>-0, executed by E as part of execution of Aa,! after which E might decide to abandon execution of Aa and enter a synchronization procedure	I '
{although T(E) ^s selected by a designer, it has some mandatory members: the sequences, after which E has no asynchronous transmission to execute in Aa);
find T. the set of all synchronization states of Ga
(S is a synchronization state of Ga, iff for every entity E, the projection of a path from Sa to S on the actions of E is in T(E)
find T«, a version of T, in which every member is replaced by its old name (the name of the equivalent state in Gf.) ;
■for each member Sm of Tn do begin
if Sn is not yet in Gc then add Sm to Seif not Sn in Closed then add 8« to Open end;
for each member Ss of T do begin
find its old name Sm; create in Gc a r« edge from Se to S« end •net end ;
terminal states of Ec;- terminal states of ond{Älgorithin 2.2>.
Each edge tp requires execution of a parameter distribution procedure.
Each state So, coming onto Open in Algorithm 2,2, requires a global decision, what to do next, and is therefore called a decision stats. If in Sd, there are several groups of alternatives or special guarding is necessary, then So requires execution of a decision procedure. In Gi, decision procedures are represented as trees of to edges in decision states (8i>) .
After a group of alternatives Ac has been selected and enabled, it starts executing. After executing Ab for some time, control of the participating entities is gradually transferred to a synchronization procedure. States of Sn, in which «11 the entities might enter a synchronization procedure, are called aynchro-nizstion states. In 6=, synchronization procedures are represented as Tb edges in synchronization states (Ss). With the help of a synchronization procedure, a system synchronizes to a state Sm of Gp, which corresponds to the currently active synchronization state.
The aim of firing a synchronization procedure after successful execution of one of the enabled alternatives is distribution of the knowledge that the actions, guarded by the alternative, are now enabled. The aim of firing a synchronization procedure before successful execution of any of the enabled alternatives is resynchronization of a system, after which another group of alternatives may be selected. This might be necessary, „if the environment is not forcing the same group of alternatives as the system and does not cooperate promptly.
To minimize the amount of internal communication, an entity should fire a synchronization procedure only when it has no other action to execute without cooperation of the environment, but in principle, a designer might also define some additional synchronization states. When entering a synchronization procedure, the entity does not know, which of the enabled actions have already been executed by other entities. Therefore definition of synchronization states should be consistent, as stated in Algorithm 2.2.
If in a decision state, there are several groups of alternatives and a system is trying to execute one of them by repeatedly selecting a group, trying for some time to execute it and (if not successful) resynchronizing, some actions are enabled infinitely often, but not all the time. If the pending actions are synchronous, this is a degradation of fairness
of the system, which is due to a particular distribution of external channels among the entities.
3. A General Design Method -for Internal Procedures
The next task is to construct a finite state machine Qo by integrating into Gc the necessary internal procedures. Go should represent the total behaviour of a system in a concise style, similar to that of Gp.
In Gc, all actions are on external channels, which have two end-points, but are observed only from the side of the system, while the actions, constituting internal procedures, are on internal channels with both end-points within the system. An action on an asynchronous internal channel actually consists of two events; transmission of a message and reception of the message, To retain the specification style of Gp, all actions should be represented in So as single events and their granularity should not become apparent before projecting Go onto individual entities.
Let's ignore for a moment the external actions of a system and concentrate on its internal actions - the protocol. We argue that a general purpose protocol should be spaci-fied by a single deterministic finite State machine P, representing the characteristic sequences of message transmissions and synchronous events. In that way, a designer is forced to concentrate entirely on inter-entity causality relations of the protocol and not to rely upon intra-entity causality relations, which should be treated as implementation details. The approach is a direct application of the "empty medium abstraction" heuristic, which has proved to be useful for protocol verification, to protocol synthesis.
Specifications of individual entities can be genei'ated from a global protocol specification P by Algorithm 3.1 and Trans+ormations 1.1 and 1.2. Algorithm 3.1 projects P on one of the entities E. so that all actions on its incoming channels become receptions. Then Transformations 1.1 and 1.2 are applied to specifications of individual entities to obtain the desired degree of intra-entity parallelism. In the two transformations, E represents a system, and the entities, cooperating with it, represent its environment.
Algorithm 3.1:
{projecting a global protocol specification P onto an individual entity E>
begin(Algorithm 3.1} while applicable do begin
i-f there is an edge from Bi to Sa, labeled by an action on a channel, which is not connected to E or
there is an edge a from 8 to Si and an edge a from S to Sa then merge Si and Ss into a single state: i-f there are two or more a edges from Bi to
than replace them by a single a edge end
and{Algorithm 3.IK
Application of Transformations 1.1 and 1.2
might result in several different sets of individual entity specifications. But this ambiguity of a global protocol specification P is not a deficiency of the specification method; As delays of all asynchronous channels are totally ' fiknown. the sets can not be
distinguished by observing the entities for a finite period of time, hence the ambiguity is immaterial and any attempt to remove it (by explicitly mentioning asynchronous receptions in the global state machine or by specifying the protoco^l by a set of local state machines) is an QverspBCification and should be avoided.
The basic problem in protocol synthesis is to avoid deadlocks, unspecified receptions and unspecified parameters. When designing a global protocol specification of our type, those design errors can be avoided by respecting five simple common sense Rules 3.1 to 3.5.
Considering only the basic semantics of a state machine, each node represents an exclusive composition of the outgoing paths, but in protocol specification, there is also another, equally important type of composition - the parallel composition of actions. Parallel composition of actions can be described by exclusive composition of their permutations, but this mental task is not trivial enough to be carried out subconsciously. A potential deadlock or an unspecified reception occurs whenever some actions are in parallel composition by the' nature of the system architecture, but that fact is not properly described by a state machine, usually because a designer is not aware of the existence of the parallel composition.
Rules '3.1 and 3.2 define paths, which must mandatory be specified, while Rules 3.3 to 3.5 define some mandatory properties of the specified paths.
Rule 3.li If A is a subset of actions, which are labels of the outgoing edges of a state S, such that every entity participates in execution of at most one memtoer of A (an asynchronous transmission has one participant, the sender, and a synchronous action has two participants) - the actions are in parallel composition^ then every permutation of the members of A must fee represented by an outgoing path of B, as no entity is allowed to make any assumptions about execution of the actions of other entities, which it is not guarding. In the case of parametrization, any two actions, possible in a state 8, on different channels, which are not both synchronous, must also be considered as in parallel composition and obey RuIb 3.1, although the actions share a participant. This is to guarantee the soundness of Rule 3.3.
Rule 3.2i If in a state 3, there is an outgoing path AiAa and, by Rule 3.1, an outgoing edge a^ must not be created in S without creating an outgoing path Ssai, then the path must actually exist.
Rule 3.3i If in a- state S, there are two outgoing paths with the same multiset of actions M, such that no two different members of H belong to the same channel and no two different synchronous members of M share both participants, then the two paths must lead to the same state, as no entity can communicate to the rest of the system any information about the order, in which it has executed the actions of M,
Rule 3.4« Projection onto any entity must have Property 0.3.
Rule 3.5» If two actions are in parallel composition and one of them is generating a value of a parameter, then the other must neither- read nor redefine the value.
Formal proof of the rules if outside the scope of the paper. Intuitively, they prevent unspecified receptions, because receptions are hidden in transmissions, they prevent deadlocks.
because there is no state without' transmissions and they guarantee coordinated progress oi an participating entities, because any assumptions about non-existing information exchanges are avoided.
Returning to our original task, we point out that the initial service specification 6 for a system a under design should be obtained by the same method. S should be considered as an entity of a wider closed system W, consisting of S and the relevant entities, external to S. A designer should first specify a "protocol" for the system M, so that he is forced to think about implications of communication on the channels. connecting entities, external to S, on the service requirements for S. Then G can be generated by fllgorithm 3.1.
As suggested in the section 4. the method should also be used for design of internal procedures, introduced by Gc.
4. Design o-f Parameter Distribution, Decision And Synchronižation Procedures
In the section 2, we have defined three types of internal procedures: pa'rameter distribution procedures, decision procedures and synchronization procedures. The nature bf a prbtocol Is mainly determined by decision procedures, while procedures of the other two types only play an auxiliary role. In our method, design of internal procedures and their integration is guided by eight basic heuristics:
Heuristic 4.1i Initially, each internal procedure should appear m the specification separated from the others. Message merging is subject to the final optimization (section 5).
Heuristic 4.2: An internal procedure should initially be scheduled just before its results are necessary. Earlier scheduling is subject to the final optimization.
In particular, .parameter distribution procedures are inserted in Be instead of Xp edges. Decision procedures are inserted in Gc instead of Td trees, so that the starting state of a procedure is located at the root and its terminal states at the leaves of a tree. For synchronization procedures, the simplest kind of their integration into Bt= is a bit more complicated and will be discussed later. The place for their integration is indicated by Ta edges.
Heuristic 4.3; To prevent harmful re-ordering of messages, belonging to various internal procedures, during their transport, all participants of an internal procedure must agree on its termination, so that the internal procedures can be treated as atomic. Note that this 13 a general solution to the problem, described in the section 3.3 of (BochGotz] . If some of the messages are redundant, they can be deleted in the final optimization, which might sometimes result in the solution from [BochGotzl.
Heuristic 4.4: The main point in^design of an internal procedure is to determine for each of its terminal states T the synchronization set Sy(T), the set of all entities, which must know that the system will progress through T. As at that point of design, internal procedures' are scheduled just in time, the members of a synchronization set Sy(T) are exactly the entities, executing the actions, possible in T. When the participants of an internal procedure have reached an agreement on its termination (which is in a terminal state T), the members of Sv(T) must know, that the execution has terminated in T.
74
Heuristic 4.5i As the basic aim of an internal procedure is to lead a aystem to a particular state, it should be designed as an exchange of proposals about the terminal state, which the procedure should reach, and sets of terminal states, suggested by various participants, should be explicitly visible in the messages, so that the terminal state, which a path is leading to, can be calculated as an intersection of the sets, exchanged along the path. Beside that, terminal states must appear, in the messages with the same names as in Gc. If the requirements are too rigorous, they can be overcome in the final optimization.
Heuristic 4.6i If an Internal procedure is a parameter distribution procedure, it must communicate the necessary parameter values from the members of the relevant K sets to the members of the relevant U sets (see Algorithm 2.2} .
Heuristic 4.7i We-require that internal procedures are provided by a designer (in the spirit of the section 3). but this is not a serious drawback for the automatization of the protocol design process, as in practice, decision procedures, and even more procedures of the other two types, are drawn from a small set of types, which can be pre-constructed and used with suitable parameters, whenever necessary. An internal procedure must respect Rules 3.1 to 3.3, where Rule 3.4 must be checked in regard to the rest of the system specification.
Internal procedures can not be designed in an optional order. The algorithm is the following:
1.	Determine synchronization sets of parameter distribution procedures and design the procedures .
2.	Determine synchronization sets of decision procedures and design the procedures,
3.	Determine synchronization sets of synchronization procedures and design the procedures.
Now we are ready to define an algorithm for integrating into Gc a synchronization procedure. Observing a graph Ga, generated by Algorithm 2.2, it is not sufficient to replace by some procedures the ta edges in 'its synchronization states. The whole G«, together with its ts edges, must be replaced by a graph Gs (the starting state of Gs is the starting state of Ga, the terminal states of Ga are those, pointed to by Ts edges), concisely representing the action sequences of the expression:

(».P(»)) )
The expression has the following meaning: For each member » of a T(E), design an internal procedure P(«), put s and P(b) into sequential composition, put the expressions, belonging to various members of T(E), into or composition, then put the expressions, belonging to various member of Pr(AiB), into parallel composition.
With other words: each entity E, participating in execution of an As, executes an action sequence s, mandatory followed by a procedure P(»), which distributes the knowledge of E about N, the set of the possible terminal states of Go, as known by E after execution of s, to the members of the union of the synchronization sets of those states, M is a member of N, iff in 6a, there is a synchronization state 8, connected with M by a Ts edge, reachable from the starting state of Ga by a path, whose projection onto E is a.
The terminal state T, to which a path of Ga should lead, can be determined from the path by Heuristic 4.S. The requirements of Heuristics 4.3 and 4.4 must be fulfilled on Gs as a whole. It turns out. that it is sufficient to fulfil
Heuristic 4.4 for each P(s), but for Heuristic 4,3 that might not be true. Hence, it as necessary to -"blow" each terminal state T of Gs into a termination agreement procedure for all entities, participating in Gb. Procedures in all terminal states of Gs must be the same.
The principles, used in the design of Gs, lead to another heuristic for construction of internal procedures;
Heuristic 4.8e The first step in design of an internal procedure is to identify the knowledge, which is to be communicated. For each piece of knowledge (which might be a parameter value or a set of suggested terminal states), construct a procedure, which conveys the knowledge from its source to its destination. Put all such procedures into parallel composition and finally put the resulting procedure into sequential composition with a termination agreement procedure for all potential participants .
The result of the integration of internal procedures into Gc is a finite state machine, which might have some equivalent states, that have to be merged. Beside that. it might be necessary to introduce some new paths, required by Rules 3.1 and 3.2, As shown in the section 5, the resulting machine Gi is further optimized into Go.
5. Final Optimisation o-f a Global Service Provider Specificatlon
Final optimization is performed by application of Trans-formations 3.1 to 5.4. The transformations address Rules 3.1 to 3.3, which use the notion of an action participant. The external actions of a system (those from the initial service specification) must be treated as internal actions of particular entities, which are their only participants. The transformations may only be applied, if they do not change the order of external actions.
Trans-formation 5.ii If Rules 3.3 to 3.S are	not
violated, then it is possible to introduce	(by
Algorithm 0.1) a particular path and	all paths, required by Rules 3.1 and 3.2.
The transformation could be used for increasing parallelism or for moving scheduling points of internal procedures.
Transformation 5.2: Let 0 be the set of outgoing edges of a state S, Identify P{0), the set of all paths, mandatory in S by Rule 3.1. Suppose that a member a of 0 is removed from 3. Identify P(G\a). If Rules 3.4 and 3.5 are not violated, then it is possible to delete (by ftlqarithm 0.2) the members of P(0) and introduce to S (by Algorithm 0.1) the members of P(Q\a) and all paths, required by Rules 3.1 and 3.2.
The transformation could be used for decreasing parallelism or for deleting redundant actions.
Transformation S.3> If Rules 3.1 to 3.5 are not
violated. it is possible to apply a particular change of edge labels and merge the resulting equivalent states and edges.
The transformation could be used for decreasing the number of message types or for the final naming of messages.
Transformation S.4| If Rules 3.3 to 3.S are not
violated, then it is possible to replace (by Algorithms 0.1 and 0.2) a path between a pair of states by another path between the same pair of states and then add (by Algorithm 0.1) all
Pisths, required by Rules 3.1 and 3.2.
The transformation could be used for changing the order of actions or for merging of actions (messages).
Whenever possible, the transformations should be applied to such parts of a finite state machine, that Rules 3.1 and 3.2 do not induce any new paths or their destination states are determined by Rul» 3.3, For instance, if an action is executed without knowing, if it will be necessary at all roptimistic scheduling, introduced e.g. by Tr«n«farmation 3.1). the destination state of a new path p t. which includes an unnecessary execution of the action, must be provided by a designer. The suggested heuristic is to direct pt to the same state as p3. which consists of the same sequence of actions as pi. except that the unnecessary action is deleted.
Ò. Conclusions
riety of solutions. Other benefits are para-metrization. transformations for data-flow optimization, uniform treatment of synchronous and asynchronous channels, a uniform approach to composition and decomposition of entities and thereby a uniform approach to design of services and protocols. Similar conclusions can be drawn when comparing our method to (Prinl.
If necessary, the design process can be fully automatized. The only condition is existence of parametrized transport procedures and termination agreement procedures and of some rules, which prevent the process from construction of infinite machines.
Re-ferences
[Prinl Prinoth.R.: "An Algorithm to Construct Distributed Systems from State-Machines". in Sunshine.C-fed.): "Protocol Specification. Testing. and Verification", pp.261-282. North-Holland. 1982
In comparison to [BochGotzi . which generates an unique solution, our method provides several suggestions for human intervention in the design process and thereby for a greater va-
(BochGotzl Bochman.G.v.. Gotzhein.R.: "Deriving Protocol Specifications from Service Specifications", Proceedings of the ACM SIGCÓM Symposium. po. 148-156. 1986
CONTROL FLOWCHARTING FOR DATA DRIVEN SYSTEMS
INFORMATICA 2/88
UDK 681.326
A. Milton Jenkins Graduate School of Business Indiana University
John V. Carlis Department of Computer Science University of Minnesota
Keywords and Phrases:
Control Flowcharting, System Flowcharting, Dual charts. Data Driven, Control Framework.
CR categories: 1.3, 2.2, 2.43, 4.0
ABSTRACT
This paper describes an alternative approach to system flowcharting based on the use of dual charts — one illustrating data flows, the other controls. The use of both system flowcharts and control flowcharts enhance the communicative power of systems documentation.
A s^cific system flowcharting technique is described. This type of system flowchart is the datum for the construction of a control flowchart. A control framework is developed and a conceptual model presented to provide a perspective for the novice system designer. Guidelines are given for the development of a control flowchart.
INTRODUCTION
A control flowchart is a document generated during the design phase of thè system development life cycle. The control flowcharting technique described in this paper is predicated on two other system analysis and design concepts. The first, a system flowcharting technique, described in section two of this paper, is Chapin's "sandwich" concept utilizing ANSI Standards [4]. The second, a two dimensional system control framework, described in section three, is based on Gordon B. Davis's Data Flow Model (Computer Data Processing [63). These two concepts are mutually supportive, and each has been utilized (in various forms) both in practice and as an educational device.
controls are often specified on the system flowchart, producing a complex, cluttered, and sometimes confusing document (see Figure 1.) Yet the increasing, emphasis on controls and the increasing complexity of systems (including user-developed systems) makes the need for clear documentation increasingly important. Toward this end, in 1971, Robert I. Benjamin [3] suggested an alternative approach in flowcharting; using dual charts — one illustrating data flows and the other depicting controls, but nothing has since appeared in the literature developing this basic concept. Section four presents Benjamin's cbncept and develops a detailed technique for control flowcharting. (The
CPIElViTt IJITClt Com«. TiaVt unN I.TCH CSITPĐI.. IIA-^ill WAt rn PURI ni;hici, ^ kihiwciju.
mill.
YlSWUt CHECC m nteSlHC TWiTA tllM
U» un or PKDEF PKDCtSSIW; toe Itrro tETFUhBKINC
Figure 1
technique is easily integrated into life cycle methodologies and can be quickly used by both systems professionals and end-users. The basic objective of the ■ systems control flowchart is to show clearly what controls exist and where they exist in a given system.
systems flowcharting technique
There are two major and different approaches to system flowcharting. The first, and most widely used in the United States, is linked to program flowcharting both in the symboloOT utilized and in the logic-flow convention —■ top to bottom and left to right and is often referred to in the literature as "computer" flowcharting. The second has its origins in the systems and industrial and mechanical engineering fields, and illustrates work flows, document flows, and process and procedural flows [1 & 12]. Generally it utilizes a version of the A.S.M.E, (American Society of Mechanical Engineers) symbols and a horizontal (left to right) logic-flow convention.
The .technique presented here is a specific version of the "computer" flowcharting. As described by Chapin in his tutorial [4], it has two major facets; the sandwich concept and the use of American National Standard Flowchart Symbols [2].
The Sandwich Concept
Chapin describes the basic format of the sandwich rule as a flowchart composed of alternating layers of data identifications [the bread] and process identifications [the filling] as illustrated in Figure 2. The system flowchart is composed of a series of similar relationships where the output of one process may serve as the input to a subsequent process, thus representing the data-flow through the system. The system flowchart always begins and ends with symbols representing data.
ANSI symbols, definitions and usages are applicable to the system flowcharting technique presented here with two exceptions: the preparation symbol and the decision symbol. While essential for program flowcharting, these symbols are not used in system flowcharting.
It is important to note that the adoption of ANSI standards does not prohibit the use of symbols not described within that standard. The use of other specialized symbols may be extremely effective in enhancing the communicative power of the flowchart.
THE CONTROL FRAMEWORK
Numerous control frameworks have been suggested for EDP, MIS and DSS systems [5, 8, 9, 11 and 12]. Each is structured around a particular function or perspective, e.g., administration, auditing, designing, operations, organizational structure, or function.
/ Machint / Readablt fOata (Input)
FIGURE 2: THE SANDWICH CONCEPT
This technique enhances the communicativé power of the system flowchart. It is most effective In flowcharting a data-driven system — a system where data physically exists on media outside the process, e.g., on magnetic tape, disks, documents, etc. throughout. the system. Most traditional data processing systems and many user-developed systems fall into this category. The technique becomes more difficult to use in an event-driven (program-driven) system, e.g., where the data is temporarily stored in core within a process or between programs. In this situation the level (macro/micro) of system detail desired will influence the effectiveness of the sandwich concept.
Using ANSI Standards
The need for a standard set of symbols and a standard for their usage in system and program flowcharting has long been recognized by the industry. The ANSI Standards [2] were developed specifically to fill that need. The
Output
Figure 3: The Basic Data Processing Cycle-
CORPORATE ORfiANIZATION, MWWQEHENT AND POLICY
HARKETWG DIVISION PROCEDURES
•A" DEPT. HMAGEMENT
"X- DEPT. MUMGEMENT
HIS DIVISION PROCEDURES
OATA PROCESSING DEPARTMENT MANAGEMENT
KEYPUNCH SECTION SUPERVISION
COMPUTER		OUTPUT
ROOM		SECTION
SUPERVI-		SUPERVL
SION		SION
COHPUTEf		
SYSTEM		
O.S.		
■6" DEPT. MANAGEMENT
Figure 4: The Organizational Control Structure
Our design definition orientation leads to the selection of a control framework similar to that suggested by Gordon B. Davis [€]• It is structured around two major system aspects, the procedural and processing controls applied to the data flowing within the basic data processing cycle, and the organizational controls applied to the application system.
The Flow of Data
The flow of data through the data processing cycle is an important focus for the system professional and user designer (see Figure 3.) Each of the processes (manual or computer) depicted here provides a vehicle for implementing controls. The controls are specified within the procedures in a manual process and within the programs in a computerized process. The data representations may also function as control vehicles.
Organizational Controls
The system professional and user-designer must function within an organizational structure which imposes many control procedures. Figure 4 illustrates a typical organizational control structure. This structure, reflecting the organization, has a hierarchical character. At the lowest level, any computer data processing must be done within a computer system. The hardware and software which comprise that system have control elements, e.g., the operating system. Data will flow through several sections within the Data Processing Department. Typically each of these sections has its own control procedures and standards. These will operate within the controls for the data processing function established by the data processing department management.
In a similar fashion the systems designer must work within controls established by various departments, through which the flow of data will occur. These departmental controls operate within the framework established by division procedures, which in turn operate
Figure 5; The Contrai Framework for a Data Processing System
I
/ Machine / Readible (Data (Input)
Division •T" COHTMLS
I User / OEPARTMEIfT ^nfonw-/ COHTIlOtS tfon /
DIVISION •Z* COKTROLS
within the corporate policy. Since most application systems cross department and division boundaries, .the designer must be cognizant of the various organizational controls which influence the system design.
Integrated Control Model
The integration of the data flow and the organizational controls is depicted in Figure 5. This control framework provides the designer with one framework which is valid through all the phases of the system development life cycle — from information analysis to post audit. It illustrates which control environments are applicable to each process in the data processing flow. Consequently it is an essential perspective during the design phase, when the designer must specify what controls are to be used and where they are to be applied in the system. This framework underlies the flowcharting techniques described in this paper.
CONTROL FLOWCHARTING
The control flowcharting technique described in this section follows Benjamin's [3] suggestion "that two flow diagrams that present the same process for a system be developed during the design phase — one for the data flow and one for the control flow." It is predicated on the existence of a system flowchart. While this is a reasonable assumption, it is important to recognize that flowcharts are developed in an iterative fashion. The desired controls may necessitate changes in the original system flowchart. This section provides a description of how a typical control flowchart is developed.
The system flowchart from which the control flowchart will be developed is shown in Figure 6 (which illustrates the same application shown in Figure 1). It has been developed using the technique described in
	
	\rzLt
UFDATI	
KAsrn	
nit	
mol tcratt		dailt tetum
		

Figure 6
Figure 7; The Unnarrated Flowchart
section two. While constructing the flowchart, after drawing the symbols and flowlines but before narrating the symbols, a photocopy of the flowchart should be made (see Figure 7). This vacant chart provides the physical starting point for the control flowchart, Figure 8. The guidelines that follow implicitly refer to Figures 6 and 8.
Locations for Controls
Working with the system flowchart as a reference, the analyst can begin to specify on the unnarrated copy what controls are appropriate to the system and where they should be located. What set of controls is chosen depends upon the control framework of the organization as discussed in section three, the equipnent which will be used, and the particular system being designed. It is
XHEILAR IAT» qOimDU nCIEI VIrt WTOi T01U.É iAfH IPIH « T«IT iW» ie1, fihucul wt«»
OKiicl omoi. vnuj at, anali lluniMli-Misa caM* m ffJimlTT taait HUJI un yttl-
Uli owuctos !■
Figure 8
beyond the scope of this paper to prescribe the appropriate set of controls for a given system. We provide guidelines for the construction of a control flowchart.
Process Symbols
There are three major types of locations where controls can be supplied. First, a process symbol should be used to specify controls created or exercised during that process. For example, a batch control ticket is created during the order assembly procedure to identify the batch of transactions and also contain control data for later use by both the édit and update procedures. The operation but not the control data itself is described on the unnarrated flowchart in the symbol corresponding to the order assembly procedure of the systems flowchart. The annotation symbol is a convenient documentation aid when the description is too long to easily fit into the process (or medium) symbol.
If no controls are exercised over a process, as in the case of the sort by item number of the CSO daily file, that process symbol remains vacant on the control flowchart. This absence of controls, highlighted by dual charts, is obscured when using only one chart as in Figure 1.
There may be come redundancy between the dual charts. For example, verifying, which is considered both a significant manual (system) process and a control process, appears in both charts.
Control data should not be attached to a process symbol. The operations performed on the control data are described in the process symbol while the control data itself is appended to a medium symbol.
Media Symbols
The second type of location, a data or medium symbol, is used to specify control fields or records that accompany the data and any physical controls applied to the physical medium. To illustrate, no special control fields are implemented on the annotated customer sales orders themselves. As with a process symbol, a blank symbol indicates the absence of controls. However, a control record (the batch control ticket) is added and requires some minor redrawing of the unnarrated chart.
The control flowchart describes the flow and form of control data. The batch control ticket appears in three forms, paper, card, and tape record, and is retained in offline storage. Each change in form requires the intervening process to perform the transformation.
Media are passive, and therefore operations performed on the control data should not appear within the media symbols. It is improper to show a check digit test performed by a floppy disk. Physical controls of the medium itself, not the data it constrains, should appear on the media symbol, A common example is the removal of the write protect ring on a tape.
Flow Lines
The third type of location where controls may be specified is on a flowline. The annotation symbol is used'to specify procedural controls associated with the flow of data. The customer sales order deck, after verifying, is input to the edit process. However, it must be physically transported from the keypunch room to the computer room. A movement or change in responsibility may require some control. In this case it is a logging procedure.
A procedural control should be attached to the flowline leaving, not entering, a
medium. This assures the proper association of that procedural control with the medium not the preceding process. This convention is consistent with controlling the flow of data across organizational boundaries as discussed in section three.
Advantages
Keith R- London, in his text, "Documentation Standards," [10] presents the purpose of documentation in the context of four categories: 1) Inter-task/phase communication, 2) Quality control and project control, 3) Historical reference, and 4) Instruction reference. This context is used to structure the following discussion of the advantages o£ dual flowcharting.
The first category involves facilitating communication within the project team throughout all phases of the systems development life cycle. Given that most project teams have an interdisciplinary composition, some team members will not be familiar with systems techniques and concepts. Consequently, the use of both system and control flowcharts, which simplify the "picture" of flows and of controls, enhances communication. By factoring the control activities from system flow diagram either element may be examined unencumbered by the detail of the other. However, if both elements are of simultaneous interest, the examination and integration of both flowcharts is greatly enhanced by the symbology and flow common to both elements.
London's second category refers to two dimensions of control - control over the quality of the system being designed and control over the process of designing that system. Dual flowcharting aids both of these dimensions.
First, the dual charting technique provides a document exclusively devoted to the controls exercised within the system. Its function is to answer the what, where, and how questions concerning the system's controls. Further, the control flowchart clearly illustrates where controls are not applied in the system. The control flowchart further facilitates quality control by providing a vehicle for communication with interested parties outside of the project team. For example, the control flowchart can be used as a signoff document for internal auditors, the operations manager, and others within the firm with a responsibility for reviewing and approving controls. In the case of user-developed systems the document greatly facilitates communications between the user and system professionals.
The control flowchart enhances project control by enhancing the effectiveness of the documentation package. With better project documentation performance criteria can be better determined at each step in the development cycle. Since performance criteria are the basis of controls, project control is thus facilitated by the use of dual flowcharts .
The third category, historical reference, is used both for modifying existing systems and for developing similar systems. Again, the better the system documentation communicates the essential characteristics of the system the more valuable it becomes in both of these functions. Given the dynamic environment within which systems function, and the iterative nature of systems development, a clear and concise representation of both the flow of data and the application of controls is essential. The development of these charts can greatly reduce the effort required in designing new systems by providing a clear picture of acceptable levels of controls at various stages in the flow and in various
departments within the firm. Finally because of the greater simplicity of both the system and control flowcharts, the task of modifying this documentation is greatly reduced.
The final category used to discuss the purposes of documentation is instructional reference. The dual charting technique is especially valuable when serving a general instruction function of communication between the systems specialists and the nonspecial-ists - particularly the user. Improved user understanding of the flows and controls within the system not only supports better user relations but goes far in enabling the user to use the system intelligently.
Another instructional function enhanced by dual flowcharting is training within the systems group. Easily understood and correct examples of good system design are extremely valuable in training new employees. These flowcharts facilitate rapid understanding of control practices within the systems area and, of equal importance, illustrate the control required in many other functional areas within the organization. Finally, they clearly illustrate to the novice system builder the value and need for good systems documentation.
CONCLUSION
Both the system flowchart and the control flowchart provide important techniques for the systems professional and user-designer in preparing and documenting systems work. Carefully prepared, these flowcharts will enhance the rigor with which the designer can think through the system. This, in turn, typically reduces the errors and increases efficiency within the system. The major contribution of the dual charting technique, however, is its ability to improve the communication of essential aspects of a data-driven application system between interested parties both within and outside of the systems area.
This technique has been used for several years in an educational setting and has proven very valuable to both students and teachers. Little evidence exists on its value in a business data processing environment. However, with the increasing emphasis on good systems documentation and the expanding use of this documentation to communicate with people outside of the systems department, it appears that the advantages offered by the technique are powerful arguments for its use.
BIBLIOGRAPHY
1.	Allen, George R., et aj,., editors.
Business Systems, Cleveland, Ohio: Association for Systems Management, 1970.
2.	ANSI, American National Standards
Institute, American National Standards Flowchart Symbols and Their Usage In Information Processing, New York: 1971, (Docijment . No. X3.S-1970), Sponsored by the Business Equipment Manufacturers Association.
3.	Benjamin, Robert I., Control of the
Information System Development Cycle, New York: wileylnterscience, a Division of John Wiley and Sons, Inc., 1971, pages 52 and 53.
4.	Chapin, Ned, "Flowcharting With ANSI
Standard: A Tutorial," Computing Surveys (2:2), June 1970, pages 119-146.
5.	Condon, Robert J., Data Processing
Systems Analysis and Design. Reston, Virginia:	Reston Publishing
Company, 1973,
6.	Davis, Gordon B., Computer Data Pro-
cessina, second Edition, New York: McGraw-Hill Book Company, 1973.
7.	Haga, Clifford I., "Procedures Manuals,"
Ideas for Management, Systems and Procedures Association, Cleveland, Ohio, 1968, pages 127-1S4.
8.	Jancura, Elise G,, Audit and Control of
Computer Systems, New York: Petrooelli/Charter, 1974.
9.	Jancura, Elise G. and Arnold H. Berger,
editors, Computers ; Auditing and Control, New Yorks Auerbach Publishers, Inc., 1973.
10. Keith R. London, Doetimentation Standards, Revised Edition, Petrocelli Books, New York, 1974.
11.	Sharratt, John Richard, Data Control
Guidelines. Manchester, England: NCC Publications, 1974.
12.	Skinner, R.M. and R.J. Anderson,
Analytical Auditing, Toronto: Sir Isaac Pitman (Canada), Ltd., 1966.
PARALELNI VEKTORSKI PROCESORJI IN NJIHOVA UPORABA I
INFORMATICA 2/88
UDK 681.3.019
P. KoJbezen S. Mavrič B. Mihovilovič Institut »Jožef Stefan«
POVZETEK, Veil je raCunalniSke zmogljivosti, ki jih zahtevajo danaSnje aplikacije na podroCju raspoznavanja govor«, procooiranja slik, umetne inteligence in äe vrsta drugih, je mogoCe iskati le v novih paralelnih sistemih, pri katerih Je znatno večja utiinkovitost moina le e posebno organizacijo materialna opreme, predvsem v VLSI procesoroki tehnologiji, in takSni opremi prilagojeno programsko opremo.
Proiivajalci računalniških sistemov, ki načrtujejo zmogljive vektorske operacije, bo izdelali posebne FORTRAN - prevajalnike. Ti razpoznavajo, Ce se zanka DD lahko zamenja z eno ali veC vektorskimi inotrukcijami. Zanka DO jo tako ekalarna predstavitev množice vektorskih operacij in zato more biti izvedena mnogo bolj učinkovito s strojnimi instrukcijami, ki so v takäne namene posebej načrtovane in se zato njihove operacije učinkoviteje izvajajo. Arhitekturne odlike se kažejo z uporabo bodisi cevanih aritmetičnih enot, kot jih sreCujemo pri cevanih vektorskih računalnikih (npr. pri CYBER 205 ali CRAV-l), bodisi z večkratnimi procesnimi elementi, kot Jih sreCuJemo pri vektorskem aritmetičnem multiprocesorju (VAMP) ali pri matričnih procesorjih (npr. pri ICL DAP, BSP ali TRANSPUTER ARRAY).
Paralelne vektorske organizacije in podrobnejöi opis strukture računalnika ICL DAP so predmet obravnave prvega dela prispevka. Prikazan je tudi stil programiranja, ki ga narekuje takSen računalnik.	,
V Članku je prikazano, kako so lastnosti paralelnih implementacij sekvenCnih kodov na materialni opremi paralelnega sistema pomembne, da prirojeni paralelizem v problemu, ki ga refiujemo, ustreza uporabljeni arhitekturi oziroma organizaciji raCunalniSke materialne opreme,
ABSTRACT. PARALLEL ARRAY PROCESSORS AND ITS APPLICATION, I. With new applications such as robotics, speech recognition, arti-ficial intelligence, image processing, etc., the need for faster processing devices becomes more important. The only solution is to employ the new parallel systems. More efficient systems can be attained in first of all with the new VLSI technology and with new organizations of computer hardware and to these ones appropriate software.
All manufacturer of computers designed for efficient operations on vector« of numbers have produced FORTRAN compilers that recognise whan a DO loop can bo raplaced by one or eoveral vector instructions. This is the recognition by software that a particular DO loop is just the scalar representation of a set ra-f vector operation, and can therefore be ewecuted much more „offectively by machine instructions that are especially engineered to perform such operations efficiently. The architectural features used are then either the pipelined arithmetic units in the case o-f pipelined vector computers (e.g. CYBER 203 and ÉRAY-l) or the replicated prnceesing elements in the case of vector arithmetic multiprocesor (VAMP) or processor array (e.g. ICL DAP, SSP and TRANSPUTER ARRAY),
In the present first part of this work the basic categories of parallel computers will be described and the detailed structure of ICL DAP will be given in more detail, together with an overview of the programming style that is requires. In the second part the optimization algorithm based on matri>i computations as » tool for implementations o-f application algorithms will be continued.
In the paper is shown that the behaviour of parallel implementations of sequential codes on parlici hardware depends critically on a careful match between the innate parallelism of the problems, the algorithm and the hardware.
1. UVOD
izkariätati prednosti novih konceptov nih sistsmov.
paralel-
Zadnja leta so tehnolottke spremembe in fitevilne inovacije dramatiCno zmanjšale ceno raCunalnitì-kih zmogljivosti. Na vsakih nekaj let so se pojavili bistveno hitrejfii in cenejSi procesorji. Posebne prednosti teh procesorjev so »o priCele kazati pri sestavljanju le-teh v različne paralelne sisteme. Sistemski programerji so dobili spodbudo, da so modificirali obstoje-■Co sistemsko programsko-opremo enoprocesorskih sistemov tako, da Je z njo mogoCe kar najbolje
Poskuttali bomo kolikor mogoCe jasno predstaviti rezultate na novejfiem podroCju raCunalniSkih arhitektur oziroma organizacij, ki naj bi pokazali bistvene prednosti rabe paralelnih procesorskih sistemov. Prednosti se kaSeJo v novih aplikacijah, kot so robotika, razpoznavanje govora, umetna inteligenca, procesiranje slik itd., kjer so postale zahteve po hitrejflih procesorskih napravah vsa pomembnejtìe. Zaradi zaporednega izvajanja instrukcij, ki se izvaja-
jo druga za drugo, Je razvoj sekvenCnih arhitektur skoraj obstal, ftefiitve Je nadalje mogoCe iskati lo v novih paralelnih sistemih, pri katerih je snatno ved! j a učinkovitost mofrta le s posebno organizacijo materialne in njej ne programske opreme.
Polpretekla zgodovina kaSe, da je paralelizem mogoCe vpeljati na različnih nivojih, ki jih lahko opredelimo takale:
1.	Nivo posl'a
i.1., med posli ;
1.2., med fazami poBlaj
2.	Programski nivo
2.1., med deli programa}
2.2., znotraj DO sank;
3.	Instrukcijski nivo
3.1., me^ -fazami izvajanja instrukcije)
4.	Aritmetični in bitni nivo
4.1., med elementi vektorskih operaciji;
4.2., v sklopu aritmetičnih logiCnih vezij;
Proizvajalci raCunalniftkih sistemov, ki načrtujejo učinkovite vektorske operacije, bo izdelali posebne FORTRAN - prevajalnike. Ti razpoznavajo, Ce se zanka DO lahko zamenja z eno ali veC vektorskimi inetrukcijami. Zanka DO je tako «kalarna predstavitev mnoSice vektorskih operacij in zato more biti izvedena mnogo bolj učinkovito « strojnimi instrukciJami, ki so v taktì-no namene posebej načrtovane in se zato njihove operacije kar najučinkoviteje tudi izvajajo. Arhitekturne odlike se kafejo z uporabo bodisi cevanih aritmetičnih enot, kot Jih sreCuJemo pri cevanih vektorskih raCunalnikih (npr. pri CYBER 20S ali CRAY-i>, bodisi z večkratnimi procesnimi elementi, kot Jih sraCujemo pri vektorskem aritmetičnem mul tiprocesorJu (VAMP) ali pri matričnih procesorjih (npr. pri ICL DAP ali BBP).
Paralelne vektorske organisacije bodo predmet prvega dela naHe obravnave, medtem ko bo drugi del prispevka posveCen uporabi sistema s takfino organizacijo. Med drugim bomo obravnaval i opti-mizacijski postopek, ki Je zasnovan na matričnem raCunu in se lahko uporablja kot učinkovito orodje za implementacijo danega algoritma na paralelnem računalniku.
Videli bomo, da so lastnosti paralelnih implementacij »ekvenCnih kodov na materialni opremi paralelnega sistema moCno odvisne od prilagoditve prirojenega paralelizma v problemu, ki ga rettujemo, uporabljeni raCunalnifiki materialni opremi. To odvisnost lahko v doloCeni meri ugotavljamo na primeru, pri katerem uporabljamo paralelni računalnik za reSevanje problemov, ki so izrazito sekvanCni. Drug podoben primer je, Ce na speci-FiCnem paralelnem procesorskem sistemu izvajamo takfien algoritem, v katerem sicer obstaja nek paralelizem, je pa ta povsem drugaCne narave, kot paralelizem algoritmov, za izvajanje katerih je bil specifični paralelni sistem tudi zgrajen. Zato bomo v prispevku sledili ugotovitvam učinkovitosti rabe posameznih arhitektur pri raflevanju problemov določenega razreda.
2. PARALELNI SISTEMI
Klasifikacija procesorskih sistemov je dandanes ponovno vse manj zadovoljivo rešena. Znana avtorja na tem podroCju sta Flynn (1972) in Shore (1973), Problem ustrezne klasifikacije tiCi v 'te;m, da veC dobro vpeljanih sodobnejših arhitektur, predvsem učinkovitih cevanih računalnikov, ne sodi dovolj Jasno niti v en, niti v drug razred poznanih klasifikacij, oziroma, da tako kot raCunalnik ICL DAP, le-ta enako dobro ustreza hkrati dvema'razredoma. Alterna-
tivni pristop do primerne klasifikacije je, da dolodimo razrede glede na osnovne natine, na kakräne se paralelizem javlja v arhitekturi nekega računalnika. Tako lahkogovorimo o paralelizmu cevanja, kopiranja oziroma vektorske porazdeljenosti procesorjev, funkcionalnosti ali a multi procesorskem paralelizmu«
(1)	Paralelizem cevanja (pipelining) Je zasnovan na tehniki t.im. zbirne linije, ki poveCuje zmogljivost aritmetične ali krmilne enote;
(2)	Paralelisem funkcionalnosti Je zasnovan na veC med seboj neodvisnih enot, ki lahko izvajajo razliCne funkcije (kot so logiCne funkcije, funkcije setttevanja in množenja) in soCasno izvršujejo operacije nad različnimi podatki.
(3)	Vektorski paralelizem je zasnovan na polju identičnih procesnih elementov pod skupnim nadzorom ene same krmilne enote. Ti elementi eoCasno izvajajo enake operacije nad različnimi podatki, ki so shranjeni v njihovih zasebnih pomnilnikih. To so t.im. ko-raCne (lock-step) operacije z zaklepanjem;
(4)	Multiprocesorski paralelizam izkorifiCa veC-je «tevilo procesorjev, od katerih izvaja veak svoje inotrukci je, medtem ko navadno med seboj komunicirajo preko skupnega pomni Inika.
Flynn /1/ je razdelil paralelne računalnike, ki procesirajo veC podatkovnih tokov hkrati v dve kategoriji. To so veCinstrukcijski veCpodat-kovni stroji MIMO (Multipl-lnotruction Multiple-Data) in enoinstrukcijski veCpodatkovni stroji SIMD (Single-Instruction Multiple-Data).
Vsaka od obeh kategorij ima svoje značilnosti, □glejmo si jih na kratko.
Računalnik MIMD ima navadno majhno Število nepretirano učinkovitih in med seboj povezanih procesorjev, ki lahko sodelujejo med seboj ali pa delujejo neodvisno drug od drugega. Zato je potrebno, da so neodvisna programska opravila kodirana in dodeljevana posameznim procesorjem. Posebna skrb mara biti posveCena sinhronizaciji dodeljevanja informacij med procesorji, ko eden od procesorjev zahteva informacijo od drugega procesorja. Dokaj Jasno Je, da pri obseSnejßih opravilih in pogostejši sinhronizaciji nekateri procesorji Čakajo brez dela, dokler drugi procesorji no dokončajo svojega opravila. Primer stroja taktìnega tipa je Loughborough-ov računalnik Neptun, ki je zasnovan na atirih mini-raCunalnikih Texas Instrument 990/10. Ta računalnik in njegovo uporabo Je podrobneje opisal F!.H.Barlow /1
Računalniki, vodeni s tokom podatkov, ki pripadajo kategoriji strojev MIMD, predstavljajo lo-Ceno podkatergorijo. Vzrok loCitve tiCi v dru-gaCnem postopku vodenja, ki temelji na drugih in ne na klasiCnem von Neumannovem konceptu. Pri klasiCnem tj. sekvenCnem računalniku predstavlja program, ki se na njem izvaja, neko zaporedje (sekvenco) instrukcij. V računalniku MIMD pa se veC sekcij kode lahko izvaja paralelno na različnih procesorjih. V vsaki od sekcij lahko Öe nadalje obstaja vzporedje, ki pa je nevidno programerju, ker se le-to pojavlja' znotraj posameznih instrukcij.
Pristop vodenja s tokom podatkov se naslanja na direktno predstavitev z grafom. Vzemimo enaCbO!
y =	+ b.K^ + c.H + d
(sornjo enaCbo moremo izraziti na obiCaJen naCin v viskonivojskem programu. Moremo pa Je predstaviti tudi direktno z grafom, ki ga vidimo na sliki 1. V grafu na tej sliki predstavlja vsako vozlittCe eno od osnovnih operacij (tj. mnoSenJe ki je na sliki oznaCeno z ">i", in seftte-
Vdnje, ki Je označeno z "->-").
Vidimo, da CCS voiliaCa grafa povezana s pu&Ci-cami, ki so obrnjene v smeri toka informacij: vhod-iihod. Informacija potuje od izhoda voz-liäüa, ki je pravkar izvrSilo evojo funkcijo (tj. dolofienn operacijo), k vhodu' vozliéCa, ki bo ttvojo funkcijo Sele priäelo izvajati. Serijsko - paralalna odvisnost med operacijami postane oCitnejSa pri predpostavki, da lahko dano vozliace isvrSi svojo funkcijo samo tedaj, ko so prisotni vsi njeni vhodi. Ope?raciJe (funkcijo) so v tem primeru vodene le s tokont podatkov, Takten prietop izvajanja paralelizma ima prednost v tem, da programer uporablja pri implementaci Ji stroja, ki je voden s tokom podatkov, dokaj obiCajen jezik. Podrobnosti o taksnem stroju najdemo na primer v delih John-sona /3/ in Sauberja /4/ is firme Texas Instrumente in v de^h da-Silva in hloodsa /5/, ki sta delala na Manchester računalniku, pa tudi v seriji Člankov na temo podatkovno vodenih arhitektur v zadnjih dveh Istih v tem Časopisu (Informatica, letniki 85,86 in 97).
V	popolnem nasprotju s stroji MIMD imajo stroji SIMD navadna veliko tìtevilo (yCasih tudi veil tisoC) manj učinkovitih procescàrjev, ki oo prav tako, vendar bolj strogo povezani med seboj. OCitnejSa razlika teh strojev v primerjavi s stroji MIMD je, da so njih procesorji praviloma sposobni opravljati le enaka osnovna opravila.
V	nadaljevanju bomo uporabljali izraz vektorski procesor izklJuCno za raCunalnike razreda SIMD, ki uporabljajo kiasiCen (neaspciativen) pomnilnik z naklJuCnim dostopom (F!AM) , in izraz asociativni procesor za računalnike iz razreda SIMD, ki uporabljajo asociativen pomnilnik. Med vektorske procesorje sadijo navadno oboji: vektorsko cevani in matrični raCunalniki. Tipična primera taksnih računalnikov sta računalnika GRftVl in ICL DftP. Prvega uvrStamo med vektorsko cavane, drugega pa med matriCne računalnike.
V	pojasnilo naj posebej omenimo, da Sray-1 nima velika Število procesorjev. Kljub temu ga LivrS-Camo med stroje SIMD. Paralelni operandi so
vzemimo element raCunanja pomiCne vejice, tj. element mnoüenja dveh vektorjev x in y z rezultatom z, pri Čemer imajo vsi vektorji dolžino 64, Rezultat dobimo z izvajanjem naslednjih korakov:
(1) Izvršimo produkt posameznih komponent <2) Dodamo eksponente
(3) Normaliziramo in zaokroSimo rezultate
Pri preprostem sskvenCnem računalniku se vse tri stopnje izvajajo zaporedno za vsak par aperandov (vektorsV'.ih komponent). Pri tem se moramo zavedati, da med izvajanjem seštevanja eksponentov v 2,koraku, stopnja 1 izvajanja algoritma mlrujej podobno, ko se rezultati v 3.koraku normal i zirajo, mirujeta stopnja 1 in 2. V vektorsko cevanem računalniku, kot je npr. Bray-1, pa obstaja veC segmentiranih funkcionalnih enot, ki lahko delujejo paralelno. Na
<a)
Preprost sekvenCni raCunalni k
-- ►•'1-Vl
1	I
2	I
(b)
Vektorsko cevani računalnik
I
-V-:
1	I
— I
2	I
«2-y2
I
1 I
<- 3
-V-! 1 ;
2 !
Z2 3 I
<- 3
- «2-V2
Z2 <- 3 1
«s-vs i
-V-I
1	I
2	!
Z3 <- 3 I
«3-V3
I
V
1
V
i
v
) i
v.
I I
I I.
I b
_____v„
I
i
1 ;
v
I
v.
I d V
I i
_.v___
;	I
I + t
I_______ I
I
v v
Slika 1. Krmiljenje s tokom podatkov
1
23 <- s I
(C)	Paralelni vektorski	procesor
«1-Vl	i<2.y2 ----	«3-V3 1
V-i	-V-i	-V-t
1 1	1 ! ----	1 1 }
2 1	2 i ....	2 1
3 1 ■—!	3 1 * • • «	3 1
i V	y	1 y
	Z2 ----	Z3
namreC omejeni v takSnem smislu, da opravljajo enake operacije. Kot primer vektorskega cevanja
Slika 2. Cevanje pri vektorsko cevanem in paralelnem vektorskem procesorju
primer, tto se izvrtìi mnoSenje segmentov	in
Vj, ae priCne seštevanje eksponentov segmentov Hj in yj, toda istotasno starta tudi mnoSenjs naslednjih dveh segmentov «2	V2' itd.
Mimogrede povedano, tudi orednji in zelo hitri BBkvenüni stroji izkoriSCajo prednosti cevanj«. Obravnavani primer je prikazan na sliki 2. (za n=64).
Pri sakvenCnem stroju so potrebni trije urini impulzi za izraCun vsakega "delnega" rezultata, kar pomeni 192 impulzov, da ee izraCuna celoten vektor z. DrugaCe je pri vektorskem cevanju, kjer je po prvih dveh impulzih potreben le tie sn »am urin impulz na "delni" rezultat. To pomeni skupaj 66 impulzov, da se izraCuna celoten vektor z. V primeru matričnega procesiranja pa dobimo vseh n (delnih) rezultatov v treh urinih impulzih. Torej lahko zaključimo: Če primerjamo voe tri možne natine mmotenja n = 64 komponenttji h vektorjev in uporabimo)
a) eekvenCen raCunalnik, potrebujemo 3n=192 impulzov,
b> vektorsko cevani računalnik, potrebujemo 2+n
= 66 impuzov, c) paralelni vektorski procesor, potrebujemo 3 impulze.
ALBORITMI I
explicitni parftlelizem
i
SlhD
I ■ I __ I
CEVANJE
I
MIMD/SIMD I
)
MIMD
i i
implicitni
paralelizem
TOK PODATKOV (DATAFLOW) I
i
polje
(BlSTDLIeNO PROCESNIH
POLJE)
1_________
DODELJEN POMNILNIK
Tesno
povezan
sistem
ELEMENTOV
I
VEČKRATNI CEVANI SISTEM
_________I I
I
VEČKRATNI
PRDCEBNI
SISTEM
STRUKTURA SPOROČANJA PREKO VODILA
Rahlo povezan si stem
I
NOTRANJE
POVEZOVALNO
VEZJE
Rahlo povezan si stem
Slika 3. Klasifikacija paralelnih računalniških arhitektur.
naCin izvajanja paralelisma v algoritmu, lahko paralelne arhitekture klasificiramo /9/ po shemi na sliki 3.
2.1. Vektorski in asociativni procesorji
Slede na zgoraj omenjene vidike procesiranja pa lahko paralelne procesorske sisteme razdelimo v «est razredov, in sicer:
»PNSAR - bitno paralelni (BP), besedno serijski (MS>, vektorski (AR) procesor JiiUNGER, SOLOMON, VAMP, CDC 7600, GRAY-1, ILLIAC, BSP in NASFf WPBSAR - besedno paralelni (WP), bitno serijski (BS), vektorski (AR) procesorji! CLIP, DAP, STARAN in MPPj WBSPAS - besedno (W> in bitno' (B) serijski (S) in paralelni (P) asociativni (AS) procesorji ali tim. ortogonalni procesor-jiiOMENàO. Računalniki tega rasre-da se lahko zdrutujejo v ustrezne MSIMD <tj. večkratne SIMD raCunalnike); MAP, PM*;
BPWSAS - bitno paralelni (BP), besedno serijski (WS), asociativni (AS) procesorji (zasnova procesorja PEPE)j WPBSAS ~ besedno paralelni (WP), bitno serijski (BS), asociativni (AS) procesorji! STARAN, RELACSj UNCNAS - nepovezani (UNCN) asociativni <ftS) procesorji z besednimi rezinami (bitno serijski)! PEPE; CNAR - povezani (CN) vektorski (AR) procesorji z besednimi rezinami (bitno serijski): ILLIAC IV. Med pridruüene računalnike MSIMD tega razreda lahko prištevamo sistem PHOENIX.
Vsi vektorski procesorji podpirajo koncept ločenih podatkovnih pomnilnikov (DM) in procesnih onot (PU>, ki so povezani med seboj z nekim podatkovnim vodilom ali stikalnim elementom. Pri tem je nepomembno, da imajo nekatere implementacije enobitnih strojev WPBSAR procesno enoto in podatkovni pomnilnik realizirana na isti vtiCni enoti (tiskanini). TakSen procesor je npr. ICL DAP- Nevektorski procesorji pa predstavljajo alternativni pristop porazdeljeni procesorski logiki, ki je preneSena na pomnilnik. Imenujemo jih "Vsebinsko naslovljivi paralelni procesorji" (CftPP) ali "Asociativni procesorji" (LIMA) in zavzemajo obsežno področje procesorjev od najpreprostejÉih asociativnih pomnilnikov do zelo kompleksnih asociativnih procesorjev /10,11/. Shemo arhitekture te vrste procesorjev iz razreda LIMA prikazuje slika 4.
Podrobnosti o računalniku Bray-1 drugimi tudi v delih /6/ in /7/.
najdemo med
CU

pu
DM
Podrobnejšo klasi'f i kaci jo strojev je mogoCe podati bodisi z vidika organizacije njenih sestavnih delov, v katerih se vzporedje po-pojavlja, bodisi z vidika uporabljenega naCina krmiljenja. Slednja ki asi + i kad ja je obravnavana v delu B.RobiCa /S/. V nadaljevanju pa se bomo omejili predvsem na prvo omenjeno Islaaifi-kacijo, ki Je osnovana na paralelnem procesiranju bitov al i/in besed in na Številu uporabljenih krmilnih enot. Vse pogosteje .pa dandanes sreCujemo tudi sisteme, ki vkljuCujejo oba naCina vodenja: podatkovnega in nepodatkovnega, od katerih slednji sloni na klasiCni von-Neummanovi arhitekturi. Taköen "metìani" sistem Je npr. wavefront procesor. (JpofltevajoC obe omenjeni k 1 asi + i kac i j i , je mogbCe ki asi-f i kaci jo strojev Se bolj nadrobno razdelati. Glede na
Slika 4. Arhitektura procesorja iz razreda UMA.
Področje aplikacij strojev iz razreda LIMA. ki se odlikujejo po izredno hitrih operacijah in po lahkem programiranju, je zelo veliko. Med napogostejäe aplikacije sodijo: procesiranje radarskih podatkov, testiranja povezanosti, ki jo srečujemo v determinističnih implikacijah semantičnih mrež umetne inteligence, pravopisno popravljanje, digitalna diferencialno analiziranje analognih podatkov (kar predstavlja bistveno bolj učinkovito reäevanje problemov, ki jih sicer rešujejo analogni raCunalniki), nadzorovanje zraCnega prometa, urejanje, Časovno
vhodno/izhodno dodeljevanje med terminali in glavnim raCiunalni kom in reSevanje relfiksaci j-skih problemov v fiziki. Pogosto pa e» asocia-^ tivni paralelni procesorji uporabljajo tudi za opravljanje «tevilnih specifičnih -funkcij v konvencionalnih računalnikih.
Vsebinsko naslovi ji v paralelni procesor CftPP je zasnovan na vsebinsko naslovi ji vetn pomnilniku CAM, ki tnu je dodana sposobnost paralelnega vpisovanja v besede, ki so za ta namen posebej označene. Lahko se spreminja celotna vsebina besedi aH le del besede, ali celo samo posamezni biti v oznaCeni besedi. V takSnem primeru govorimo o sposobnosti pomnilnika z večkratnim zapisom. Po tej sposobnosti se loci CAM od CflPP. Le-ta omogoCa izvajanje paralelne aritmetike, sestavljeno iskanje in v splottnem emulaci jo vektorskih računalnikov, kot na primer računalnika ILLIAC. Prvi predlog pomnilnika CAM je dal Slade že leta 1956, v letu 1972 pa Je Qoodyear Aerospace Corporation poslal na trfitt-če računalnik STARAN, kot prvi komercialno dosegljiv CAPP.
Nadalje si oglejmo «e ostale razrede paralelnih procesorjev in vsaj po enega predstavnika vsakega razreda.
8PWSAR. Procesor je konvencionalne von Neumano-ve arhitekture z eno samo krmilno enoto (CD), procesno enoto (PU), krmilnim ali inštrukcijs-kim pomnilnikom (IM) in podatkovnim pomnilnikom (DM>. Enojni DM pri Čitanju predaja vse bite neke besede PU, da Jih le-ta paralelna procesira. PU lahko sestavlja veC funkcionalnih enot, ki so lahko tudi covane. Zato pripadajo temu razredu tako cevani skalami računalniki (kot Je npr. CDC 7600) in cevani vektorski računalniki (npr. GRAVI), ki se odlikuje po svoji preprosti arhitekturi. Shemo arhitekture računalnikov tega razreda ponazoruje slika S.
I I I I :	I
i IM i—>—1 CU l~>—I Horizontalna PU	i
I___„_i !______J I___ „_	I
! I i ! I i I I :
I	I
i Besedne rezine DM	I
i____________________I
Slika a. Arhitektura procesorja Iz razreda BPdISAR
UNSER (1938) je zasnoval računalnik za reševanje prostorskih problmov, predvsem za aplikacije kot so npr. razpoznavanje vzorcev. Ta računalnik, imenovan tudi "prostorski raCunalhik", ima dvodimenzionalno polje PE pod skupnim nadzorom. V računalniku SOLOMON je bil vpeljan koncept t.im. "zaklepanja po korakih" v operacijah strojev BtMD. Koncept je predlagal Slotnick (1902), realiziran pa je bil fiele v seriji strojev ILUAC in fle nekaterih kaenejöih strojih tipa SIMO. Tri leta kasneje Je bil zgrajen vektorski aritmetični procesor VAMP (Senzing in Smith), ki je sastavljen iz linearnih vektorjev PE z dodeljenimi jim pomni Ini skimi moduli in cevano aritmetično enoto. Vsaka PE je virtualni procesor, ki vsebuje samo nekaj delovnih registrov. Cevani vektorski procesor pa je bil načrtovan z namenom, da zmanjSa ceno materialne opreme, potrebne za vektorsko procesiranje.
Prvi model računalnika GRAY-1 je bil zgrajen pri Gray Research Inc. kot najhitrejši raCùnal-
nik na svetu in dobavljen laboratoriju "Los Alamos Scientific Laboratory" leta 197é>. To Je tudi prvi komercialno dosegljiv cevani vektorski procesor. Ima 12 funkcionalnih enot, sedaj le vse CBvane, hitrjfto urb	ns. It bank
bipolarnega pomnilnika z 10 besed in 50 ns ' ciklom ter osem ò4-bitnih vektorskih registrov za pomnenje é>4-ìh Števil v pomiCni vejici. Aritmetične operacije nad temi vektorji izvaja s pri bi i ino 32 strojnimi ukazi. Tri funkcionalne enote so namenjene vektorskim operacijam (pomiku, logiCnim operacijam in sedtevanju), tri enot® pa skalarnim operacijam (setttevanju v plavajoči vejici, množenju in reciproCni aproksimaciji) m.
Burroughs Corporation je igral vodilno vlogo pri razvoju vektorskih paralelnih procesorjih. Ta se Je priCel s procesorjem ILLIAC IV in nadaljeval s paralelnim procesorjem PEPE. Kmalu mu je sledil BSP s komercialnimi težnjami na perspektivnem tržifecu, ki ga je predstavljalo Se takrat ekspanzivno splofino znanstveno -raziskovalno okolje.
Projekt ILLIAC IV je imel cilj, da se razvije visoko parelelni raCunalnik z velikim številom aritmetičnih enot, ki bi izvajale vektorske ali matriCne izraCune s hitrostjo reda 10 operacij na sekundo. Da bi se dosegla takfina zmogljivost sistema, Je bil raCunalnik prvotno naCrtovan z 256 PE-mi, katere naj bi nadzorovale 4 centralne procesne enote (CU). Zarade previsoke cene in nesprejemljivih zakasnitev operacij dodeljevanja je bil prvotni sistem okrnjen na eno Četrtino. Tako konCni produkt, ki je namenjen y predvsem reševanju parcialnih diferencialni h enaCb (v numeriCni obdelavi vremenoslovni h kart za napovedovanje vremena, v nuklearnih raziskavah in v drugih Številnih aplikacijah), sestavlja le polje B X 8 po 64~bitnih procesnih elementov za računanje s plavaJoCo vejica pod ' nadzorom ene same CU. S taka okrnjenim sistemom so dosegli hitrost 200 milijonov operacij na sekundo. Vsaki PE je dodeljen pomnilnik z 2K besedami, ki dela po naCinu koraCnega zaklepanja z najbližjimi sosednimi povezavami.
Eden od problemov, ki so nastopali pri uporabi eksperimentalnega prototipa ILLIAC IV, so bile zakasnitve pri prenašanju podatkov na dolge razdalje preko polja PE zaradi omejitev v Številu najbližjih sosednih povezav med t4-imi procesorji in njihovimi 64-imi bankami pomnilnikov (PEM). Zato je Burroughs v svojih komercialnih produktih raCunalnik ILLlftC priCel proizvajati v "okrnjeni" izvedbi z li-imi procesorji in 17-imi pmniInittkimi bankami pod oxnako BSP (Burrougs Scienti fies Processor). Zaradi manJSega Števila procesorjev Je bilo mogoCe komunikacije izvesti preko posebnega "uvrstitvenega vezja" med poljubnima paroma procesorja in pomniInittke banke. VeCje Število pomniIniäkih bank v primerjavi s Številom procesorjev dopuBCa uporabo algoritmov preslikav, ki zmanjšujejo Število pomniInittkih konfliktov. Le-ti lahko sicer moCno narastejo v pogostih manipulacijah z matrikami. PE so procesorJi, ki omogoCajo raCunanJe s plavajoCo vejico, in so serijsko organizirani. Seštevanje ali množenje Izvajajo paralelno in dajo li- rezultatov tovrstnih operacij iz polja PE v 320 nsek. S skrbnim prekrivanjem Čitanja, vpisovanja in aritmetike, skupaj s povezovanjem viSka PEM s PE, raCunalnik BSP uCinkovito prepreCuje ozka grla in dosega pri reSevanJu veČine problemov, ki jih reSuJe, maksimalno procesno hitrost 50 Mflop/s.
V ta razred prištevamo tudi novejSi sistem NASF za potrebe NASAi Numerical Aerodynamic Simulation Facility. Sistem Je predlagal Stevens leta 1979, NaCrtal ga Je CDC na osnovi izpopolnjenih Štirikratno cevanih računalnikov CYBER 205, ki delajo po naCinut.im. "koraCnega zaklepanja",
7 dodatnim petkratnim cevanjem kot pripravljena rezervo, ki ne elektronsko vkljuäi v primeru, da se odkrije napake v delovanju sistema. Veako cevanjB lahko da en 64-bitni resultat «H dva 32-bitna rezultata na vsakih 8 nsek. V vsakem primeru pa je rezultat dobljen v najveC treh operacijah. Zato je pri tem računalniku doseäe-na doslej najveCja hitrost raCunaja v aritmetiki s plavajočo vejico, in sicer 3 Gflop/sek. K temu pripomore tudi hiter skalami procesor z là nsek uro.
MPBSflR. Procesor tega razreda se v bistvu razlikuje od procesorja iz prejflnega razreda samo v tem, da se pri Čitanju DM dostavi bitna resina vseh besed v pomnilniku, namesto da bi se dostavili vsi biti ene besede. Zato Je PU organizirana tako, da izvaja vse operacijo v bit-seri Jskem naCinu. V primeru, da je pomnilnik dvo-dimenzionai no polje bitov z eno besedo na vrstico, Cita raCunalnik tega razreda vertikalno rezino bi tov, .medtem ko «ita raCunalnik iz preJSnega razreda horizontalno rezino. Primera strojev iz tega razred« «ta ICL DAP in STARAN, shema arhitekture teh procesorjev pa Je prikazana na sliki 6.
Medtem, ko si bomo računalnik ICL DAR podrobneje ogledali v naslednem poglavju, bomo na tsm mestu nekaj pozornosti posvetili računalnikoma CLIP in MPP.
11	II IH l~>—I CU i—>—i ____I I I	I
Vertikalna PU
I
I I I I ! I i I I ! I I I I I I	I
I Bitne rezine DM ! I	I
Slika 6. Arhitektura procesorja iz raareda WPESAB.
Vektorski procesor CLIP-4 je procesor, ki Je zasnovan na bitnih rezinah. PE procesorja so rasporeJene v cel iCno rnreüo 96 x 96 PE. Vsaka procesna enota ima osem sosedov v dvodimenzionalnem polju, lahko pa Je povezana le s Stirimi najbližjimi sosedi.
Računalnik MPP je t.im. masivni paralelni procesor (massively parallel procesor), ki Je bil razvit v NASA Goddard Space Flight Center za procesiranje satelitskih posnetkov. "Masiven" se imenuje zato, ker ima I23 x 123 = 16384 mikroprocesorjev, ki lahko vsi hkrati paralelno procesirajo. Procesor MPP raCuna v aritmetiki bitnih rezin z operandi spremenljivih dolSin, Ima miteroprograml ji vo krmilno enoto, ,ki ji Je mogoCe vprogramirati popolnoma prilagodljive instrukcije za vektorske, skalama in 1/0 operacije. Sistem MPP je izveden v celoti v pol-prevodniSki integrirani tehnologiji vezij in uporablja mi kroprocosorske dJipe in bipolarne RAM-pomniIni ke.
MBSPAS. Procesorji tega razreda so sestavljeni iz kombinacije arhitektur strojev iz razreda BPWSAR in WPBSAR. Arhi,tekturo WBSPAR sestavlja dvodimenzionalni pomni ink, iz katerega se lahko Citajo ali besede ali bitne rezine, horizontalna procesna enota, ki procesira besede, in vertikalna procesna enota, ki procesira bitne rezine. To je v bistvu ortogonalni računalnik, ki ga je zasnoval Shooman Se leta 1770. Oba omenjena računalnika iz preJSnega razreda ICL OAP in STARAN sta sicer lahko programirana tako, da zagotavljata sposobnosti računalnika
iz tega razreda, ker pa nimata loCenih procesnih enot za procesiranje besed in bitnih rezin, ne pripadata temu razredu. Implementacijo strojev, katerih arhitektura popolnoma ustreza definiciji strojev iz razreda WBSPAS, predstavlja serija računalnikov QMEN-ÓO. RaCunalnik OMEN-60 je zasnoval Higbie te leta 1972, njihovo arhitekturo pa kaile slika 7.
I
IM
I_________I-------->—I	i 1	I
t CU I—>-! Horizon. PU I
I	i-<:—I______I i_____________I
I Vertikal.PU I	t t 1 I i 1 1
1 . _ _ I	l_I.J„l„l_i.. I
I ~r r"T r~i	i	:
t I j I I ! ;	_____i Podat.pomn. I
I ■ _	i
Slika 7.
Arhitektura procesorja iz razreda MBSPAR.
Serija računalnikov OMEN (Ortogonal Mini EmbedmeNt> je komercialna implementacija ortogonal-nega raCunalni&kega koncepta proizvajalca Sanders Associates za aplikacije, kot Ja procesiranje signalov. Ta serija uporablja raCunalnik PDP'^ll za konvencionalno horizontalno aritmetično enoto in polje 64 procesnih enot (PE) za asociativno vertikalna aritmetično enoto. Operacija te enote se izvajajo predvsem nad zlo-govnimi rezinami in ne toliko nad bitnimi rezinami, odvisno od modela; ali ima le-ta bitno serijsko aritmetiko z osmimi biti pomnilnika, ki pripada vsaki PE, ali pa ima aritmetiko v plavajoči vejici, izvedeno v materialni opremi z osmimi Ì6~bitnimi registri in petimi maskov-nimi registri. Posebna logika med PE-mi obraCa vrstni red zlogov v rezini ali izvrßuje popolno premeščanje (perfect shuffle) ali cikliCni pomi k.
Med MISMD primere tega razreda lahko prištevamo računalnik MAP (Multi-Associative Procesor), z osmimi CU, ki jim je dodeljeno 1024 PE in sistem PM , ki je bil načrtovan kot "rekonfigura-bilni" računalniški sistem. Posebnost tega procesorja je, da lahko procesira na tri naCi-nes kot MSIMD, MBISD in kot MIMD. TipiCno konfiguracijo PM sestavlja 16 centralnih enot s 1024 procesorckimi pomni 1 ni Ski mi enotami. Sistem je bil kasneje na Pardua University izpopolnjen v t.im. PUMPS tako, da ga je mogoCe uporabljati tudi za sploftne raziskave veCproce-sorskih sistemov.
BPWSAS. Arhitektura računalnikov tega razreda Je analogna arhitekturi računalnikov iz razreda UiPEiSAR, kot Jo kaXe slika 5. V ta razred spadajo asociativni procesorji, ki uporabljajo besedno serijski asociativen pomnilnik. Na tovrstnih procesorjih je zasnovan tudi raCunalnik PEPE. Ker je le-ta hkrati tudi t.im. "nepovezan asociativen procesor", ga bomo obravnavali v prav tako imenovanem razredu UNCNAB.
MPBSflS. Arhitektura računalnikov tega razreda je analogna arhitekturi raCunalnikav iz razreda WPBSAR (Slika 6). Predstavnik tega razreda je STARMsi. Uporablja bitno serijski asociativen pomnilnik. Ima do 32 asociativnih vektorskih modulov. Prvi raCunalnik tega tipa je bil namenjen procesiranju digitaliziranih slik leta 1973. Popolnoma paralelna struktura uporablja zahtevno in draga logiko v vsaki pomni Ini Ski celici in zapletene komunikacije med celicami. Bitno serijski asociativen procesor je bistveno
cenejtti od popolnoma paralelne strukture, kor se istoCaono primerja le ena sama bitna reiina. Va«k od.32 asociativnih vektorskih modulov ima 2S6 K 25é> bitnih besed večdimenzionalno dostopnega pomnilnika (MDfl), 236 procesnih elementov, permutacijsko (flip) veaje in «elektor. Vsak procesni element obravnava serijsko bit po bitu nad podatki v vseh besedah MDA pomnilnika. S pamoCjo permutacijskega vezja podatki, ki so shranjeni v pomnilniku DMft, dostopni preko I/O kanalov v bitnih ali besednih rezinah ali v kombinaciji obeh. Permutaci Jako vezje izvaja pomike ali opravila, ki omogoCajo paralelno iskanje, aritmetične ali logiCne operacije nad besedami pomnilnika MDft. Ta pomnilnik je Goodyear Aerospace implementiral tako, da je RAM Cipom dodal XQR logiCna ves ja, v kasnejSih, dražjih modelih pa je MDA-pomni1 ni k poveCal na 9216 H 256 bitov na modul in znatno poviftal hitrost I/O operacij in procesi ranja. Med izvajanjem ene instrukcija se podatki v vseh selektiranih pomnilnikih vseh modulov procesirajo soCasno s preprostimi procesnimi elementi, ki so dodeljeni vsaki podatkovni besedi. Vmesnittka enota vsebuje vmesnike e seri^ior jI , konvencionalne raCunalnike, signal procesorje, interaktivne prikazovalnike in masovne pomnilniftke naprave. RazliCne I/O opcije so implementirane v "custom design" vmaanittki enoti, ki omogotia direkten dostop do pomnilnika, I/O kanale z vmesnim shranjevanjem, zunanje funkcijske kanale in paralelni I/O. Vsak asociativni vektorski modul ima 256 vhodov in 256 izhodov, prav tako v posebni "custom design" vmesniäki enoti. Ta omogoCa veCjo hitrost komunikacij vektorskih podatkov računalnika z visoko pasovno fiirino I/O naprave in dovoljuje katerikoli napravi, da neposredno komunicira z asociativnimi vektoi'— Bkimi moduli. Navedene sposobnosti znatno povečujejo v zahtevnih aplikacijah prepustnoGt sistema, poenostavljeno komplsksnant programske) opreme pri soraxmeroma manjäi ceni materialne opreme. Posebna odlika raCunalnika STARAN je visoka I/O hitrost oz. zmogljivost komuniciranja z okolico in sposobnost preprostega povezovanja z konvencionalnimi računalniki. STARAN v sistemu a konvencionalnimi računalniki nadzoruje paralelno procesiranje opravil, konvencionalni računalniki pa opravila, ki se morajo procesirati serijsko.
Dandanes je veČina asociativnih procesorjev načrtovanih za iskanje informacij in obdelflvc podatkovnih baz. Taktne vrste je asociativen računalnik RELAC8, ki ga je predlagal Stevens (1979> in je namenjen raziskavam na Syracuse University. Zasnovan je na.uporabi večstopenjskih pomnilnikov med diski In gosti teljskim računalnikom.
UNCNAS■ Stroji tega raareda imajo pomnoSene PU in DM iz strojev prvega razreda BPWSAR. PU in pridruieni DM so v tem razredu definirani kot
procesni elementi PE. Vse PE izvajajo instrukcije ene same krmilne enote CU. Dobro poznan stroj tega razred« Je PEPE, Ker procesne enote med seboj niso povezane, je uporaba teh strojev omejena, po drugi strani pa obstajajo dolacene prednosti le--teh zaradi relativno enostavnih procesnih elementov. Arhitekturo računalnikov tega razreda kaSe sliki» 8.
Kot vidnejšega predstavnik» tega razreda ei bomo oglodali raCunalnik PEPE-
Obstajata dva tipa popolnoma paralelnih asociativnih procesorjev! besedno organizirani in s porazdeljeno logiko. Pri besedno organiziranih procesorjih je prisotna primerjava vsakega bita vsake besede in so možne logiCne odločitve po procesiranju vsake besede. Asociativni procesorji s porazdeljeno logika so manj zapleteni in aato tudi canejäi. Takflen procesor je PEPE, ki Je bil razvit v Bell Laboratory-jih za aplikacije procesiranja radarskih signalov. Procesor sestavlja sedem razliCnih funkcionalnih podsistemov. Od teh je pet nadzornih enot, kot so nadzorna enota izhodnih podatkov in pomnil-nifikih elementov, aritmetična, korelacijeka in asociativna izhodna nadzorna enota, ter nadzor— ni sistem in ötevilne procesne enote. Vsako PE sestavlja aritmetična enota, korelacijska enota, asociativna izhodna nadzorna enota in pomnilnik z 1024 32-bitniml besedami. 2BS PE je organiliranih v osem elementne svi tke. Selektirani deli opravil se nalagajo iz gosti teljskega računalnika CDC-7600 v procesne enote. Selekcijo doloCa inherentni paralelizem opravil in specifična arhitektura procesorja PERE, ki lahko obravnava posamesna opravila veliko bolj učinkovito kot gostiteljski univerzalni računalnik, in se uporablja le kot koprocesor tega raCunalni ka.
CNAR. Stroji tega razreda bo podobni strojem iz prejtìnega razreda s to razliko, da so pri strojih iz razreda CNAR procesne enote PE razvrSCe-
CentraVna procesna enota CU
PU S—>—< —
PU )—>
i
I DM 1
DM
<—\ PU I
I	I
I DM 1
• i
Slika 9.
Arhitektura procesorja iz razreda CNAR.
Centralna procesna enota CU
PU
, I — DM
I
__V___
I PU I
I______I
I . I
I	I
I DM I
-V —
PU
„ 1 DM
ne v liniji in med seboj povezane inajblisji sosedi. To pomeni, da morejo nekatere procesne enote naslavljati besede v svojem lastnem pomnilniku in tudi'od neposrednih sosedov. Na ta naCin je v doloCeni meri odpravljena slabost pracesorjav iz prejSnega razreda, ki izvira is nepovezanosti PE. Primer otroj« tega razreda je ILLIflC IV, ki omogoCa hkrati tudi direktne komunikacije med vsakimi osmimi procesnimi enotami PE. KarakteristiCno shema arhitekture procesorja iz tega razreda vidimo na sliki 9.
Slika 8. Arhitektura procesorja iz razreda UNCNAS.
Vidnejši predstavniki tega razreda ©o poleg računalnika ILLIAC IV tudi izvedenke MIMD tega računalnika, kot j» procesor PHOENIX. Njegov
90
predhodnik ja procesor, ki ga v bistvu sestav-ij» originalen ratìunalnik ILLIAC-IV. Ta je načrtovan tako, da ima Stiri centralne enote, ki jim Je dodeljeno 256 PE (4 krat 64 PE). V okviru raäunalniftkega projekta PHDEIMIV pa Je Edru-' iferiih 16 po 64 PE, kot razširitev računalnika ILLIAC IV za tte uüinkovitejäe MSIMD vektorsko procesiranje.
Med procesorje MIMD tega razreda lahko prištevamo tudi TRANSPUTERa<I SISTEM, ki ga bomo posebej obravnavali kasneje.
Pregled zmogljivosti nekaterih paralelnih računalnikov daje Tabela 1.
HOllflND
BIMD	Leto	Arhltek-	NajveCja
sistemi zasnove		tura	zmogi JIvost
(AR/AS)			v Hfl op/s
UNGER	1958	MB, AR	
SOLOMON	1962	WS, AR	
VAMP	1965	WS, AR	
CDC 7600	1969	MS, AR	
STARAN	1970	bB,AS	
ILLIAC IV	1972	MB, AR	00-200
PEPE	1973	bB,AS,UNCN	10g-2B8
CLIP	1967	bs.AR	10 pìKel op«
OMEN 60	1976	bs,AB,ORT	
MAP	1977	MS,AS,M	
SRAY-l	197B	MS,AR *	130-200
ICL DAP	Ì97S	bB,AR	10-30
BSP	1979	WS, AR	20-50
PM4	1979	MS,AS,M	
RELACS	1979	bs,AS	*
PHOENIX	1979	WE,AR,CN,M	
CDC NASF	1979	MS,AR *	lOOO-SOOO
CYBER 205	1982	MS ,AR	3000
MPP	19B3		200-6000
Tabela	1. SIMD-	-raCunalniSki	sistemi
Legenda:
WS beaedne rezine	M<SIMD) be bitne rezine
ftR vektorski	ORT
AS asociativni	CN
* SISD z MFE	UNCN
veCkratni (Multiple SIMD) ortogonalni poveš ani nepovezani
2.2. Pregled razvoja paralelnih sistemov.
V	Tabeli 2 je prikazan Časovni prikaz razvoja paralelnih vektorskih in asociativnih procesorjev. Med prve , ki so jih zasnovali, ©odijoi von Neumann, Holland in Shooman. Povezave med posameznimi procesorji v raipredelnici kaSejo v smeri od desne proti levi njihove prednike po arhitekturnih značilnosti.
V	zaključku tega sestavka naj omenimo, da bo stroji MIMD neprimerni za reSevanJe problemov, ki se nanaSaJo na reševanje sistemov parcialnih diferencialnih enaCb (p.d.e.s,} ali razseSnih optimizacijskih problemov. Tako nam sa reševanje tovrstnih problemov ostanejo le matrični in vektorsko cevani računalniki. Med prve sodi na primer ICL DAP, med druga pa Qray-l. Od obeh je za Omenjene raziskave primarnejSi vektorski računalnik ICL DftP iz naslednji razlogov;
1.	MreSa procesorjev (matrika) se enostavno preslika v mrežo, ki je za tip problemov, ki se reäuJejo, najustrineJSa, in
2.	Vektorski računalnik omogoCa poglobljen fttu-dij sočasnosti v algoritmu, medtem ko vektorsko cevani računalnik (npr. 6RAY-1) sočasnost (paralelizem) v algoritmu) na nek naCin prekriva.
MULTIMIKROPROCESDRSKE MATRIKE
i	I „
von
Neumann
ì __UNBER

VAMP (**)_^CL1P
MAP
SHOOMAN___STARAN___OMEN RELACS MPP

I
_____SOLOMON___(■»*)____I______DAP
I	PEPE_
I
_BSP.
NASF
I
.ILLIAC„_(«)_______PHOENIX
—55-----60-----65----
—70-----75-----80—>—85-
leto
Tabela 2, Razvoj paralelnih vektorskih in asociativnih procesorjev.
Zato se bomo v naslednjem poglavju poglobili v strukturo vektorskega računalnika, kakrSna sta na primer računalnika ICL DAP z jezikom DAP FORTRAN in novejai TRANSPUTERSKI SISTEM z jezikom OCCAM.
3. PORAZDELJEN VEKTORSKI RAČUNALNIK
Zgled vektorskega porazdeljenega procesorja je raCunalnik ICL DAP. Med novejSe tovrstne računalnike lahko prištevamo tudi sistem t.im. transputerjev. Transputer predstavlja procesot— ski modul. Vektorski raCunalnik pa dobimo. Ca te module matriCno poveSemo med seboj.
Ena od bistvenih razlik obeh računalnikov je v tem, da ICL DAP pripada razredu SIMD, medtem ko lahko TRANSPuterski Sistem pripada tudi razredu MIMD. Prvi ima v danaSnji verziji (Queen Mary Colege, University of London) v enoti 4096 procesorjev, ki so razporejeni v mariki 64 k 64, Ta enota ne dela kot samostojni raCunalnik, ampak le kot pomnilnittki modul, ki Je povezan z običajnim centralnim pomnilnikom. Podrobnejši opis računalnika DAP najdemo v delu Reddaway-a /S/. V TRASPS pa BO elementi matrike povsem samostojni procesorji tim. TRANSPUTERJI, ki pripadajo procesorjem iz razreda SISD /9/. Ne le v omenjenem primeru, ampak tudi sicer lahko ugotavljamo, da imajo procesorji v računalnikih, ki pripadajo računalnikom iz razreda SIMD, v primerjavi s procesorji v raCunalniki h, ki pripadajo razredu MIMD ali centralnim procesorjem, pravilom» zelo preprosto osnovno arhitekturo. V računalniku DAP so procesorji, ali bolje rečeno procesne enote PE, bitno organisi-rane, kar amogoCa sistemu veliko prilagodljivost aplikacijam, ki so skladne z njegovo organizacijo. Med takfine, najbolj pogoste aplikacije, eodijoi procesiranje slik, pregledovanje podatkov in simbolno procefeiranje, ki je združeno 2 osnovno aritmetiko.
3.1. Računalnik ICL DAP
ja prilagojena določenim 2ahtevam. Slednje bonii3 kasneje ogledali bolj nadrobno.
si
□snovno organiaacijsko shemo računalnika DfiP podaja slika 10.Iz nje je rasvidno, da je vsaka procBBna enota povezana a najbliiji mi Stirimi sosedi. Glavna podatkove poti, ki potekajo po kolonah in vrsticah, pa povezujejo vse procesorje dane vrstice oziroma kolons. Ts poti omogočajo hitro oddajanje in sprejemanje informacij preko matrike procesorjev in je mogoCe vzpostavljati diametralne povezave, tudi preko celotne matrike, ali več loüenih povezav. V računalniku DAP ima vsak procesor lòk bitov pomnilnika RAM, kar predstavlja skupaj 16k « 64 >t M / a = B megailogov. Pri tem naj omenimo, da imajo drugi podobni stroji, ki so danas instalirani, Is 4 kiloslogov na procesor ali skupaj 2 nisgazlaga.
64 K
H
/ /__
/ I
Procesna matriki 64 K 64
i
16k bitov pomnilnika na procesno enoto
!
! / i/
Sika 10. Osnovna organizacijska shema računalnika ICL DftP
Vsaka procesna enota poseduje svoje lastne podatke, ki jih obdeluje, sprejema pa tudi nekaj skupnih instrukcij, ki jih paSilja glavna kontrolna enota (MCU). Zato ta raCunalnik prištevamo med računalnike tipa SIMD.
Iz slike 11 je rasvidno, da ima procesna, enuta PE tri registrei delavni register fl omogoCa prekinjajoče delovanje procesorja v skladu z zahtevami programa, diödtem ko jo A enobitni akumulator in C register, ki hrani prenos.
Pomembna lastnost računalnika DAP je sposobnost izključevanja posameznih procesorjev v računalniku, Na primer. Ce reSujemo problem, za reile-vanje katerega zadosCa mreSa procesorjev, ki je manjSa od 64 >i 64, moremo s pomoCjo logiCna matrike odveCne procesorje z maskiranjem preprosto izključiti. Na ta naCin moremo izvajati
3.2, SISTEM TRANSPUTERJEV
Računalniki z veC med seboj povezanimi procesnimi enotami, imenovanimi TRANSPUTRJI /12,13, 14/, predstavljajo razliCne sisteme, med katerimi bo za nas posebej zanimiv porazdeljen vektorski raCunalnik.
Inmos, ena najpomembnejših britanskih -firm računalniških komponent, je ponudila trSiäCu 16-bitne (T 212 in ti 212) in 32-bitne <T 414), napoveduje pa tudi Se nove Se zmogljivejše (T 000) RISC procooorok© enote, imenovane transpu-terje, ki SD med seboj popolnoma združljive. Transputer je raCunalnik na Cip, ki omogoCa izvajanje vei! procesov hkrati in tudi sam skrbi za komunikacijo med njimi. Komunikacija poteka preko skupnega pomnilnika. VeC transputerjev se lahko povezuje med seboj preko kanalov v veC-
omogoCa konktirenCno Taksne sisteme je v Ss veCje sisteme in poljubnim Številom
procesorski sistem, ki izvajanje veCih procesov moe äe nadalje povezovati tako graditi sisteme s transputerjev. Transputerski sistemi niso veC zasnovani na von Noumannovi arhitekturi, kakršno ima sam transputer. Zato, in zaradi sposobnega transputer ja lahko dosegajo ali celo presegajo zmogljivost danaSnjih superraCunalnikov. Zaključeni transputerski sistemi »e povezujejo med seboj in s standardna mikroprocesorsko periferijo preko posebnih vmesnikov t.im. "link adaptor ji" IMS COOi in IMS C002, ki skrbijo za medsebojno sinhronizacijo veCih sistemov oziroma povezavo sistema z njegovo periferijo.
32-bitni transputer T 414 je splošno namenski in zmore 1Ü MIPSov pri 20 MHz, Prav tako je SpioSnonamenski njegov predhodnik T 212, medtem ko je M 212 namenski transputer za kontrolo inteligentnega diskovnega sistema. T 414, ki je predstavnik te druiine, je izdelan- v 1,5 mi-kronski CMOS tehnologiji s preko ISOk transis-tnrjev v B4-pinskem Cipu. Procesor ima 32-bitne notranje in zunanje izhode za naslove in podatke, ki sp (iiultipl eksi rani in dosegajo ^ hitrost prenosa 20 megazlogov, 4-gigazlQSni linearni naslovni prostor, PROM in 2k SRAM pomnilnika ter 4 medtransputereke komunikacijske kanale. V pogledu nabora ukazov transputer odstopa od običajnega nabora ukazov RISC arhitekture, predvsem po Številu vseh ukazov in prisotnosti ukazov za mnoSenje in deljenje.
N
i.
E I
SESTEVALNIK
/
Vektorska kontrolna enota
Slika 11. Arhitektura procesne enote procesorja ICL DAP.
/ l\
Vhodno / i zhodna enota
i :
\i/
Gostitel j. raCunalni k
:<;—>i
1
'i I I , I K — >1 I ' I
I.
I
K—>: 1<~>I K — >1 K —>1 i< —>1 K —>i K —>t. I I
f ; i
\i /
_____1____
Matrika 16 M 16 Traspu-terjev T414
K-
l<-K-l<-K-
I I I
->l ->1 -X ->l -> I ->1 -> 1
\ I/
P o v e z .
I
1
neke matematične operacije na delu matrike, ki
Slika 12. TransputerEki vektorski sistem
Slika 12 kaSe transputerski vektorski sistem, ki ga sestavi JaJi» kvadratna matrika Ns^IìikIò procesnih enot, sistem pomni Iniftkih vmesnikov, povezovalni Bistem, matriCna krmilna enota, goeti tel .jski raCunalnik in vhodno/izhodna enota.
Kot Bino te omenili, so transputer.ii zelo zmog-' IJivi 32 bitni procesorji (n.pr. T424)b statičnim pomnilnikom (pomni Ini itko banko) in rasliCne učinkovitimi komuni kaci jeki mi vmesniki. Vse te
skalar, vaktor ali matrika (kjer so vektorji in matrike dimenzije N oziroma N k N. Pri računalniku DflP je dimenzija N = 64). Medtem, ko lahko seštevanje dveh matrik v obi Čajnem FORTRftNu zapiHpmo kot
DO 10 1=1,N DD 10 J=1
Cd ,J)=A(I,J)+B<I,J) 10 CONTINUE,
bo v DAP FORTRAN gornji zapis preprosto skrCen na en sam stavek
t : : t
t 1. 1 . . .
1 2.
s
I N. 3 . . .
P O
V
i
;
i
PE !<----->1 MB i
i
i
. z.
0 v a
1 n
I I
K---
I
I t I . . . .
I
— >1 PE
K___i
n s
->l MB 1
i
X
j
a
K-
1
PE
l<-1
->l MB 1
Blika 13. Konfiguracija komunikacij znotraj vektorskega polja (PE»pracQsn@ enota, MB=pDnini Iniftki bloki, N=16«lći)
C=A+B
Jezik DAP FORTRAN ima «tevilne osnovne -funkcije, ki B® uporabljajo za izvajanje določenih operacij. TakSne -funkcije so na primer:
SUM Israfiuna skal arno vsoto matrike N k N. ABS Zapiße absolutno vrednosti vsakega elementa v matriki N m N. MAXV PoiSCe najveCje skalarne vrednosti v matriki N y. N .
MAXP PoìRCe posici jo najveCje vrednosti tozi-roma najveCjih vrednosti, Ce jih je veC) v matriki N x N. Pozicija je oznaCena z "1" v logiCni matriki. ALL IzraCuna logiCno vsoto IN vseh vrednosti v logiCni matriki (ali logiCni izraz, ki izraCuna logiCno matriko). Funkcija je laredno učinkovita pri konvergenCnem testu matrike.
Pomemben pripomoCek pri obravnavanju matrik predstavlja, kot smo ie omenili, sposobnost izklapljanja posamainlh procasarjov s pomoCjo logiCne matrike. V nadaljevanju bomo pokazali nekaj primerov.
Predpostavimo, da imamo matriko A razsežnosti 64 K 64, ki vsebuje pozitivna in negativna «te-vila, laCemo kvadratni koren vseh pozitivnih elementov matrike. To lahko zapiSemo v običajnem FORTRANu takolei
komponente so integrirane v enem samem Cipu, ki predstavijajo odliCne gradnike konkurenčnih procesnih vezij /15/. Transputerske zanke predstavljajo kanale za medprocesne komunikacije v materialni opremi. Tako obstaja zelo tesna povezanost med povezavami (zankami) transputer— ski h kanalov in komunikacijskim protokolom, ki je primeren za komunikacije v t.im. "wavefront" vektorskih procesorjih (wave-front array proces-sars).
Proizvajalec transputerjev Je poskrbel tudi za učinkovito in lahko programiranje v jeziku OCCAM, ki ga Je Inmos posebej razvil za transputer. Prevedli «o ga Sfe tudi za druga okolja, npr. WAX in IBM PC,
4. PROGRAMIRANJE VEKT0RSKE6A RAČUNALNIKA
Najpreje si bomo ogledali, kakäno Je programiranje na paralelnem vektorskem računalniku DAP, medtem ko si bomo programi ranje na TRANSPUTER-SKEM SISTEMU ogledali v nadaljevanju tega Članka.
RaCunalnik DAP na Univerzi v Londonu je pro-gramljiv v jeziku DAP FORTRAN. Ta jezik jo standardni FORTRAN, ki pa je razäirjen za vektorsko procesiranje. Opis jezika DAP FORTRAN najdemo v delu Flandersa /9/ ali v originalnih priročnikih računalnika ICL DAP.
V standardnem FORTRANu je osnovna raCunska enota skalar, v DAP FORTRANu pa je le-ta lahko
10
DO 10 Ii»l,N DO 10 J=1,N
IF(A(I,J)GT.O)=SQRT(A(I,J)) CONTINUE
medtem ko je v DAP FORTRANu bistveno preprostejši»
A(A.BT.O)=SQRT(A)
zgornji program
Izraz A.6T.O predstavlja logiCno matriko z vrednostmi "1" na pozicijah, kjar Je A veCJi od O, drugod pa vrednost "O". Rezultat na desni strani izraza je matrika, kateri je dodeljena vrednost le na tisti poziciji, ki Ji ustreza v logiCni matriki na levi strani vrednost "1". Oglejmo si bolj zapleten primer, kot je raite-vanje Laplace'ove enaCbe. V tem primeru Selimo zamenjati vsako vrednost v matriki s povpreCno vrednostjo njenih ätirih najbližjih sosedov, V standardnem FORTRANu bomo zapisali tonamensko kodo takole:
DD 10 1=2,N-1 DO 10 J=2,N-l
Y(I,J)B(X(I + l,J)-fX<I-I,J)+X<I,J + l) + X(I,J-1))/4
10 CONTINUE
Pri tem je potrebno zapisati posebno kodo, ki ureja meje matrike (tj, preprečuje napake, ki bi se lahko pojavile zaradi matričnih indeksov, ki presegajo meje matrike). Kodo zgoraj lahko v DAP FORTRANu izrazimo z enim samim stavkam
X=(X(+,)+X(-,)+X(,+)+X(,-))/4
V slednjem izraru iikorifedatno sposobnost ratu-nalni)<a DAP, ki omagoCa pomično indelisiranje. 1 israzodi *(+•, ) dossüemo toCtso na Boaednji vre-ticl matrike procesnih enot (ki u&treza izrazu X(I + i,J> v Borijsiki verziji), s X(-, > pa dose-iemo toefto v predhodni vretici , itd. Potnembnu je, da vemo, da so yea totke v matriki istoCas-no alurirane. V taktlnem primeru Eetttevalne matrike ne potrebujemo veC. Meje matrike se urejajo avtomatično z vnašanjem niCel na meeta, ki jih določa geometrija. Pri ravninski geometriji vstavljamo niCle na ustrezna mesta v ravnini, medtem ko pri ciklidni geometriji zato, da dobimo cilinder, povezujemo ali sever— juSne ali VEhod-zahodne robove, - neodvisno od uporabljene smeri pomika. Lahko pa pomikamo tudi vse Stiri robove in s tem dobimo "torus".
Sedaj predpostavimo, da telimo refiiti Laplaceov problem .z iregularno oblikovano mejo (ali s pravokotnim poljem, ki je manjtle od 64 x 64). DAP FORTRAN omogoCa kreiranje loqiCne matrike, imenovane DOMAIN. V njej oznaCimo pravilne vrednosti (TRUE: logična "1") v tiste dele matrike, ki odgovarjajo področju problema in neprave vrednosti (FALSHi logiäna "O") povsod drugod v matriki. Refiitev je podana v naslednji kodi I
DO 10 1=1,LIMIT □LDX=X
X (DOMAIN) (X < + , ) +X <-, ) +X (, + ) +X ( ,-) ) /4 IF(ALL(ABS(X-OLDX).LT.EPS)> GOTO 20 10 CONTINUE
CONVERtSED». FALSE. RETURN 20 CDNVERSED-.TRUE. RETURN
S. ZAKLJUČEK
S tem smo ee na kratko seznanili z naravo računalnika ICL DAP in načinom programiranja, ki je primeren za takflen raCunalnik. Iz teh spoznanj moremo sklepati, da je učinkovitost računalnika ICL DAP za reševanje tako linearnih kot nelinearnih p.d.e, izredno velika. To dejstvo je prepričljivo potrjeno tudi s strani uporabnikov tega raCunalnika,
Drugi del prispevka bo pasveCen progremiranju TRANSPUTERSKEBA SISTEMA in opti mi zacijskcmu postopku, ki je zasnovan na matriCnem raCunu in ga je mogoäe uCini'.ovito izvajati le na paralelnem vektorskem računalniku. Omenjeni postopek (se lahko uporablja kot učinkovito orodje za implementacijo uporabniških algoritmov, ki vsebujejo veliko stopnjo inherentnoga paralelizma, na paralelnem vektorskem računalniku.
6. REFERENCE
/1/ Flynn M., IEEE Transactions on Computers, C-21,9, pp. 948 - 960, 1972.
/2/ Barlow R.H., The Neptun Processing System, Loughborough University o-f Technology.
/3/ Johnson D.,et al., Automatinc Partitioning o-f Programs in a Multiprocessor System, TeM«B Instruments, Austin, Te«as, 1979.
/4/ Sauber Ul. , A Dataflow Architecture Implementation, Texas Instruments, Austin, Texas, 1980.
/5/ da-Silva J.S.D., Woods J.V., Design of a processing subsystem for the•Manchester data-flow computer. Proceedings of the IEEE, 128,3, 1981.
/&/ Hwang K,, Briggs F.A., Computer Architecture and Parallel Processing, McSraw-Hill Book Company, 19B5.
/7/ Kogge P.M., The Architecture of Pipelined Computers, McGraw-Hill Book Company, 1901.
/8/ RobiC'B,, Bile J., Razvrstitev novogene-racijskih raCunalniSki h arhitektur. Informatica 10,4, 1986.
/9/ Hockney R.W., Jjesshope C.R. , Parallel
Computers, Adam Hilger Ltd, Bristol, 1981.
/10/ Foster C.C.,. Content Addresable Parallel
Processors, Van Nostrand Reinhold Company, 1976.
/11/ Hsiao D.K,, Advanced Data Base Machine Architectura, Prentice-Hall, Englewood Cliffs, N.Y.,1983.
/12/ INMOS Limited, OCCAM language overview, November 19B5.
/13/ INMOS Limited, Transputer architecture, November 198S.
/14/ INMOS Limited, IMS T414 Transputer, November 19B5.
/15/ Mihovi lovi dB., Mavric S., Kolbezen P.,
Transputer - osnovni gradnik veCprocesor— skih sistemov, Informatica 4/81, 1986.
VEČOPRAVILNO OKOLJE ZA DELO V REALNEM ČASU NA RAČUNALNIKU IBM-PC
INFORMATICA 2/88
UDK 681.3.014
Vido Vouk Jure Ferbežar Andrej Brodnik Institut »Jožef Stefan«
This article presentE a multi tasking environment for the MS-DOS operating system on the tBM-PC computer. Real time scheduler was developed using real time clock Interrupts to perform process scheduling. Interprocess communication is implemented using semaphores and message exchange. All the support routines for the multi-tasking real-time environment are packed in a single module which offers all the routines needed to handle process manipulation, process synchronization and interprocess data exchange. All the software is developed using Logitech Madula-2 environment.
članek opisuje osnovno okolje za pisanje veiopraviInih programov na raiunalniku IBH-PC pod operacijskim Bistemom MS-DOS. V ta namen smo razvili raspore jevalnik procesorskega iasa, ki uporablja urine prekinitve. Za msdprocssno sinhronizacijo smo uporabili semaforje in izmenjavo sporočil. Uporabniku Je na voljo modul podprogramov, ki nudi vsa potrebne podprograme za delo v veaprocesorskemu okolju. V modulu so podprogrami za ustvarjanje, poganjanje, ustavljanje in izloftanje ter podprogrami za nadzor procesov, podprogrami za komunikacijo med procesi (send, receive) in podprogrami za sinhronizacijo (wait, signal) in ustrezni dodatni podprogrami za ustvarjamJe, nadzor In izločanje semaforjev. Vsa programska oprema Je napisana v programskem jeziku moduXa-2.
1. UVOD
Računalniki ždruiljivi z IBM-PC so pri nas vedno pogoetejii in tudi niso pretirano dragi. žal pa ti računalniki ne nudijo podpore v većopraviInem okolju. Ker se zaradi nizke cene vedno več potroinikov odloča za ta tip računalnika, so razvijalci prisiljeni poiskati,	ozi-rotna izdelati	čimbolj
univerzalna orodja za izdelavo zahtevnejših aplikacij. V tem ' članku predstavljeni razporejevalnik	procesorskega	časa
predstavlja eno od takih orodij, žal zaradi omejenosti operacijskega sistema MS-DOS iz fička ni mogoče narediti ferrarija.
1.1 PROCESI
Proces lahko definiramo kot asinhrono aktivnost, naprlmer izvajanje programa na centralni procesorski enoti CPE /H0L78/. S preprostim razmislekom ■ lahko pridemo do zaključka, da je proces lahko v poljubnem trenutku opazovanja le v enem od dveh moinih stanj
IZVRŠLJIV
NEIZVRžLJIV
Opisani stanji • lahko zaradi lažjega razmišljanja razdelimo naprej na naslednja podstanja:
Privzemlmo, da imamo več procesov. Vsi procesi iz skupine IZVRŠLJIV imajo pravico do Izvajanja na poljubnem CPE. V poljubnem trenutku pa se dejansko izvaja eden (na eno procesorskem sistemu) ali N procesov {na N procesorskem sistemu). Procese, ki se izvajajo, imenujemo TRENUTNI. Vsi ostali procesi, ki imajo pravico do izvajanja, pa se ne Izvajajo zaradi poaanjkanja prostih procesorjev; so v stanju PRIPRAVLJEN.
Procesi iz skupine NEIZVRSLJIV so zaeasno ustavljeni in nimajo pravice do procesorja, čeprav bi kak procesor bil prost ali celo brez dela. Ti procesi so ustavljeni in čakajo na nek zunanji dogodek, ki jih zbudi in jih prestavi v skupino IZVRŠLJIVIH procesov. Dogodki, ki jih zbudijo in prestavijo v drugo skupino, so različni glede na stanje v katerem so zaustavljeni.
Slika 1.1 prikazuje moina stanja procesov in .dogodke, ki vplivajo na prehode med stanji. K sliki 1.1 moramo dodati tudi kratek komentar. Iz slike Je razvidno, da klic podprograma CREATE ustvari nov proces In ga postavi v stanje PREKINJEN, Vpraianje, ki se pojavi je, kaj se zgodi s procesom, ki ga želimo odstraniti iz našega okolja. Klic KILL odstrani proces ne glede na to kje se je nahajal pred klicem. Proces nepreklicno izgine iz okolja..Pred odstranitvijo sprosti vse zasedene zmogljivosti v sistemu.
IZVRŠLJIV
NEIZVRSLJIV
TRENUTNI PRIPRAVLJEN
PREKINJEN
ĆAKAJOČ
SPEC
SPREJEMAJOČ
1.2 SINKRONIZACIJA
Za sinhronizacijo procesov v večprocesornih okoljih uporabljamo različne tehnike. Dokaiemo lahko, da so vse tehnike funkcionalno enakovredne /FILe4/, vendar so v
prekine, da dodeli procesor drugemu procesu. Vsakemu procesu dolotisna nek določen ias, ko lahko teče, ne da bi ga prekinili. Fo Izteku tega £asa damo v izvajanje na tem procesorju nek drug proces,
2. IZVEDBft
2.1 programsko okolje
Razporejevalnik £asa smo razvili za IBM-pc zdruiljlv ratunalnik. Za jiiodulo-2 smo se odloiill, ker Je vitji programski jezik ki Ima vse potrebne konstrukte za strojnem nivoju in obenem osnovno ža delo z veft procesi. Tako je in vzdrievanje programske opreme preprosto. Vsa programska oprema
/hir85/, delo na podporo razvoj relativna
razporejevalnika je napisana v Logitech Moduli-2/e6 pod operacijskim sistemom MS-DOS 3.20.
Slika 1.1
Stanja procesov in prehodi med stanji
razliinih Izvedbah razliino primerne. Razporejevalnlk opisan v tem ilanku za sinhronizacijo uporablja semaforje /DIJ75/ in izmenjavo sporoiil /JONS?/.
1.2.1 SEMAFORJI
Kot sta pokazala /DIJ7S/ in /HAN73/ lahko vse probleme sinhronizacije reiimo s P In V semaforjema. V nafil izvedbi razporeJevalnika smo uporabili sploäne (itevne) semaforje. Torej je binarni semafor v nafiem razporejeva.lniku le posebna oblika sploinega semafor ja.
1.2.2 IZHENJftVA SPOROČIL
Izmenjava sprožil (message exchange) je poseben mehanizem, kl omògoia procesu da prenese pripravljena sporočilo drugemu procesu. Izmenjava sporOAil je obenem poseben mehanizem sinhronizacije.. Glavna razlika med semaforji In izmenjavo sporočil Je v tem, da mora biti za vsak klic HAIT(sem) ustrezen klic SIGNAL(seir«) , v p.rimeru izmenjave sporočil pa to ni nujno. Izmenjava sporofiil je za sinhronizacijo preprostejša posebno kadar proces ne ve vnaprej koliko sporoiil bo dobil in kateri procesi jih bodo poslali. Ta nasln sinhronizacije je uporabljen v CSP /H0A8S/, modull-2 /HIRSB/ in v programskem jeziku OCCRM /JON87/. Dokažmo lahko, da je moino Izvesti izmenjavo sporoall samo s semaforji in obratno /FILBi/.
1,3 RAZPOREJANJE PROCESOV
Maia nadaljna razlaga bo zadevala enoprocesorske sisteme. Posploiitev na vetprocesorske sisteme Je oütna. Iluzijo vzporednega izvajanja već procesov na eno ali veiprocesorskemu sistemu lahko doseiemo s preklaplJanJein . procesorja (procesorjev) med več procesi.
4e hoaemo zagotoviti, da bodo procesi enakopravno izkoriščali dane procesorske zmogljivosti v sistemu, moramo uvesti razsodnika, ki odiosa, kdaj bo kateri od procesov zaiel z izvajanjem, kdaj bo izvajal svojo kodo In kdaj bo prenehal z delom, da bi prepustil procesor naslednjemu uporabniku. Ta razsodnik Je razporeJevalnik procesorskega £asa. Razporejevalnik odloCa kateri proces lahko zasede procesor in ga potem tudi lahko
2.2 UVOD
RazporeJevalnik Je zasnovan na primeru razporejevalnika za RT-11 /BR087/. Za razliko podobnih razporeJevalnikov, ki delujejo slnhrono /COL87/, naS razporejevalnik omogoča delo v realnem času. Informacijo o pretetenem času dobi od prekinitev, ki jih generira ura. Urine prekinitve so relativno pogoste, če bi razporejevalnik ob vsaki urini prekinitvi pregledal vse strukture; bi bil odziv sicer izjemno dober, vendar bi bila učinkovitost takega sistema zelo majhna, saj bi se tak sistem večino časa ukvarjal sam s seboj. Tako ob vsaki urini prekinitvi razporejevalnik pregleda 'le nekatere strukture, medtem ko splošni pregled stanja procesov naredi le na določeno žtevllo urinih	prekinitev.-
Uglaževanje izvedemo eksperimentalno. Razporejevalnik odloča, kateri od procesov bo dobil pravico do izvajanja svoje kode. Razporejevalnik ga tudi prekine med izvajanjem in to tako, da se proces tega ne zaveda. Ra2pore jevaInlk shrani vse potrebne podatke o procesu v lokalni■pomniInlk, tako da lahko v poljubnem trenutku ponovno zaiene prekinjeni proces. Proces nadaljuje z Izvajanjem v točki v kateri je bil prekinjen.
Med delom smo naleteli na več teiav. Največjo -težavo je predstavljala sama zasnova operacijskega sistema. MS-DOS 3.20 Je eno uporabniški eno opravilni operaciJski ,sistem, tako da nobeden od sistemskih klicev ni prekinljiv. To težavo smo poskueall zaobiti z uporabo	nedokumetlrane	lastnosti
operacijskega sistema /LOG85/, ki s posebno zastavico označi kdaj je v kritičnem odseku. Vendar se ta reiitev med izdatnim testiranjem ni izkazala kot dovolj zanesljiva, ker tudi nekateri podprogrami v moduli-2 niso prekinljlvl. Ta problem smo reäill tako, da smo celotni klic DOSa označili kot kritični odsek.	^
Celoten sistem je sestavljen iz treh funkcionalno Idčenih modulov:	jedro,
semaforji in izmenjava sporočil. Jedro omogoča delo nad procesi (ustvarjanje, prekinitev procesa), modul semaforjev omogoča delo s semaforji (ustvarjanje, SIGNAL, WAIT) modul za izmenjavo sporočil pa omogoča klice SEMD In RECEIVE. Ker Je na« sistem namenjen tudi uporabi v povsem realnih aplikacijah, so vsi trije moduli skriti v oklepajoče» modulu, ki zunanjemu uporabniku onemogoča dostop do kritičnih ukaznih in podatkovnih struktur. Iz modula navzven so izneiene le funkcije, ki so potrebne za učinkovito uporabo sistema. Uporabnik nima nobene motnosti, da bi z
nepravilno ali nepazljiva uporaba porušil konsistentnost sistema. V dodatku A je podan definicijski modul na£ega ra z po ire je valn ika, iz katerega je razvidno, katere podatkovne in ukazne strukture ter klici so uporabniku na voljo.
2.3 PODATKOVNE STRUKTURE .
Za uelnkovitej«e delo razporejevalnika smo morali dobr&en del prekinitvenih podprogramov zamenjati s svojimi novo napisanimi, ki uporabljajo konstrukte iz razporejevalnika. Podatkovne struktura Izven modula niso vidne /BR087/. Osnovne enote, s katerimi uporabnik lahko upravlja, so procesi, semaforji in sporo£lla. Z njimi operira preko klicev podprogramov in tako vpliva na tek procesov. Vsi podprogrami so monitorji /HAN75/.
2.3.1 PROCESI
Osnovna enota v sistemu je proces. Proces je v moduli-2 izvajanje podprograma brez parametrov /HIRBB/ in /LOGfle/. V naiem sistemu smo definicijo razširili z naslednjimi parametri i
-	ime
-	interno ime
-	življenjski prostor
-	prioriteta
Ime je sestavljeno iz niza alfanumerifnih znakov. Namenjeno Je le uporabniku. Ob zahtevi za ustvaritev procesa jedro preveri pravilnost parametrov in ustvari proces. Priredi mu interno ime, ki ga vrne uporabniku. Vsi nadaljni klici za delo z ustvarjenim procesom uporabljajo le interno ime. življenski prostor opredeljuje velikost procesu prirejenega delovnega pomnilnika /L0GB6/. Prioriteta je pomemben parameter, ki opisuje nujnost izvajanja danega procesa. Veaja vrednost predstavlja vasjo prloritetq. V osnovni izvedbi našega sistema velja, da se vedno izvaja tisti proces, ki ima najvitìjo prioriteto. Kadar je takih procesov več, si ti procesi enakopravna delija procesar med
i zvedbe naćine
Naslednje drugačne
seboj (round robin), sistema	omogo£ajo
razporejanja /VKSe2/.
Vsak proces opredeljuje spremenljivka stanja (prim, slika 1.1)< Kot smo omenili, se lahko proces v danem trenutku nahaja le v natanko enem stanju. Kratek opis stanj:
TRENUTNI (CURRENT): Ker Je IBH-PC enoproceaorski sistem je lahko v poljubnem trenutku le en TRENUTNI proces. Ta proces Izvaja svojo kodo. će sam ne kliče nobenega od podprogramov, ki bi mu lahko spremenil status (WAIT, RECEIVE, SLEEP, SUSPEND), ga prekine razporejevalnik ko potete njegov interval easa, ali pa is se prebudi kak proces z vi«jo prioriteto. Kadar je ta proces edini s tako prioriteto, bo ponovno dobival pravico do izvajanja po en interval, dokler ne bo končal ali pa zamenjal statusa.
PRIPRAVLJEN (BEADY)! V tem stanju so vsi procesi, ki imajo vse pogoje za izvajanje in čakajo le na prost procesor.
PREKINJEN (SUSPENDED): Po klicu za ustvaritev razporejevalnik ustvari proces in ga postavi med NEIZVR4LJIVE procese. Proces ostane v tem stanju, dokler ga ne obudi eden od TRENUTNIH procesov. Razporejevalnik lahko v to stanje postavi tudi poljuben PRIPRAVLJEN proces, Se to od njega zahteva kateri izmed procesov.
ČAKAJOČ (WAITING): Razporejevalnik poetavi poljuben TRENUTNI proces po klicu WAIT v to stanje, te je vrednost semaforja ob klicu nič. Proces ostane v tem stanju dokler ga s klicem SIGNAL ne obudi eden izmed TRENUTNIH procesov.
SPSĆ (SLEEPINC5): Proces postavi v to stanje razporejevalnik po klicu SLEEP. Iz tega stanja ga spet obudi razporejevalnik po izteku zahtevanega časovnega intervala.
SPREJEMAJOČ (RECEIVING): Po klicu RECEIVE proces preide v to stanje, ie zanj èe ni prispelo nobeno sporočilo in ostane v tem stanju dokler ne pride sporočilo od enega izmed TRENUTNIH procesov.
2.3.2 SEMAFORJI
Semaforji v nažem sistemu predstavljajo □snovni princip sinhronizacije. Uporaba in delovanje so obAirno opisani v literaturi (npr.	/HAN73/). V	nasi	izvedbi
razporejevalnika	uporabljamo	sploine
semaforje. Tip semaforja določimo s klicem podprograma za ustvaritev semaforja. Vsak klic SIGNAL (P) poveča vrednost semaforja za 1 če je vrsta «akajočih na ta semafor prazna, sicer pa za vsak SIGNAL obudi enega od čakajočih.
Po vsakem klicu WAIT (V) razporejevalnik pogleda ali je ètevec semaforja večji od nič. če ni, uvrsti klicajoči proces v vrsto čakajočih na dani semafor, sicer ga pusti naprej v izvajanje in popravi itevec semaforjev. Princip čakanja je po sistemu FIFO /WIR75/.
2.3.3 IZMENJAVA SPOROČIL
Razporejevalnik poleg semaforjev omogoča tudi izmenjavo sporočil. čeprav lahko naredi izmenjavo sporočil uporabnik sam s semaforji, smo se odločili, da jo izvedemo äe posebej, tako da zmanjšamo dodatno delo na najmanjšo možno mero.
Pošiljanje sporočil je izvedeno tako, da proces, ki pošilja sporofillo, ne preneha z izvajanjem tudi kadar ne more oddati sporočila. če sporočila ni mogoče oddati, dobi o tem ustrezen odgovor, vendar se njegovo stanje (TEKOČI) zaradi tega ne spremen i.
če Je proces te dobil kako sporočilo, pa ga še vedno ni prevzel, so vsa nadaljnja požiljanja temu procesu neveljavna vse dokler ga ne prevzame. Na ta način preprečimo, da bi se sporočila kopičila in porabila ves razpoložljivi prostor. Osnovna izvedba razporejevalnika omogoča poiiljanje naslova. Ker mara biti prostor za sporočilo izven prostora pošiljatelja (podiranje sklada), je prostor	za	sporočilo	v	jedru
razporejevalnika.
2.4 SPEČI PROCESI
Včasih Je potrebno, da proces sinhrono odstopi procesor za točna določen časovni interval. V ta namen je na razpolago poseben klic, ki postavi proces v vrsto spečih procesov.	Za take procese	je v
razporejevalniku izdelana posebna struktura, ki procese razvrsti po naraščajočem času čakanja. časi čakanja so podani relativno glede na predhodnika. Tako razporejevalnik ob vsaki urini prekinitvi pogleda le prvi proces v vrsti in ga prebudi, če je njegov čas spanja ie potekel.
spremeni 1 i tako da

2.5 STRATEGIJA RAZPOREJANJA
V osnovni izvedbi razporejevalnika je strategija razporejanja zelo preprosta. Procesi se med sabo lahko razlikujejo po priciteti. Izvaja se le en proces in to proces z najvišjo prioriteto. Kadar je teh procesov vea, se med seboj izitien ju je jo in si enakovredno delijo procesorski ±as. Procesi z nižjo prioriteto iakajo na izvajanje vse dokler vsi procesi z viijo prioriteto ne preidejo v eno izmed stanj skupine NEIZVRšLJIV ali ne zapustijo sistema (klic KILL). Vsak proces ima pravico do neprekinjene uporabe procesorja N urinih prekinitev (fie sam pred tem ne kliie kakega od podprogramov, ki mu spremeni stanje ali pa se ne zbudi kak proces z vižjo prioriteto). Tak algoritem razporejanja se Je izkazal kot zadovoljiv pri aplikacijah v realnem tasu, pri interaktivnem delu pa se ni izkazal ravno najbolje. Zato smo algoritem uvedbo dinamitne prioritete, podoben razporejevalniku na VAX VMS /VMS82/. Po • končanem čakanju na zmogljivosti razporejevalnik	dvigne	prioriteto
interaktivnega programa za določeno vrednost (to vrednost določimo z uglaievanJem). Ob vsakem poskusu prerazporeditve procesov se prioriteta tega procesa spusti, dokler ne pride do svoje začetne (osnovne) prioritete. Računski procesi (compute bound) so ves čas na isti (osnovni) prioriteti.
3. UPORABA
Razporejevalnik smo temeljito testirali in tudi uporabili v realnih aplikacijah. Izkazalo se je, da Je razporejevalnik dovolj robusten in univerzalen za uporabo na različnih področjih.
3.1 PROBLEM LAČNIH FILOZOFOV
Problem lačnih filozofov je prvia predstavil Hoare /H0A85/. Gre za omejeno itevllo zmogljivosti- Reiitev, ki Je predstavljena v dodatku B, preprečuje smrtni objem, vendar ne preprečuje stradanja. Kot Je pokazalo testiranje, do stradanja ni prlžlo, če smo opazovali sistem dovolj dolgo. Predstavljena reiitev uporablja monitor za zaščito kritičnega področja.
procesorskega časa (scheduling policy).
Razen tega nameravamo razširiti sistem z modulom, ki bo uvedel dodatne konstrukte za laije paralelno programiranje (COBEGIN In COEND, REGION... kot na primer CC-Modula /C0L87/ in jih priporoča /BS187/) in nadaljevati delo s paralelnimi algoritmi.
5. ZAKLJUČEK
Predstavljeni sistem se je izkazal i<ot dobro programsko orodje za delo v večopraviInem okolju tako na področju poskusov s paralelnimi algoritmi kot na področju aplikacij za končne uporabnike. Sistem Je dovolj robusten in kompakten tudi za teike pogoje dela v industriji.
LITERATURA /HAN73/
/DIJ7S/ /HIR76/
Hansen,P.B.i Operating Systems Principles, Prentice Hall, 1973
DlJkstra,E,W. ! Guarded	Commands,
Nondeterminancy and	Formal
Derivation of Programms,	Comm.ACM, v.18 n.3, 1975
Wlrth N.: Algorithms + Data Structure = Programs, Prentice Hall, 1976
/HOL78/ Holt R.C., G.S.Graham,
E.D.Lasowska, M.A.Scott ! Structured Concurrent Programming with Operating Systems Applications; Addison Wessley, 1978
/VMSB2/ Digital Equipment Corporation! VAX Software Handbook, DEC, 1982
/FIL84/ Filman,R.E., Friedman D.P.:
Coordinated computing: Tools and Techniques for Distributed Software, HcGraw Hill, 1984
3.2 DELOVNE POSTAJE
Razporejevalnik smo pr aplikaciji. Gre za r IBM-PC, na katerega delovnih postaj, ki podatke	po	kom
računalniku, ki Jih mo disk in narediti obde ter jo izpisati na posameznih postaj so Izkazalo se je, da je orodje, ki omogoča na način programiranje a procesi ranje.
eizkusill tudi v realni ačunalnik zdru.tljiv ■ z Je priključenih ves asinhrono pošiljajo unikacljskem	kanalu
ra urediti, zapisati na lavo zbranih podatkov zaslon. Aktivnosti med seboj neodvisne, razporejevalnik dobro preprost in pregleden plikacij za paralelno
i. MADALJNJE DELO
Razporejevalnik bomo preizkusili nekaterih problemih procesiranja v času. Poiskali bomo najboljšo časovnega intervala neprekinjenega
že na realnem doliino izvajanja
procesa v okolju, ki zahteva izjemno hiter odziv in obenem hitro obdelavo podatkov. Poleg tega načrtujemo tudi primerjalno analizo različnih strategij razporejanja
/COM84/ Comer,Douglas ; Operating System Design, the Xlnu Approach; Prentice Hall, 1,984
/WIRB5/ Wirth,N.: Programming in Modula-2, Springer■Verlag, 1985
/HOA85/ Hoare,C. A.R. : Communicating Sequential Processes, Prentice Hall, 1985
/LOG95/ Logitech! Hodula-2/B6, ■ Logitech, 1986
/JOH87/ Jones,G.: Programming In Occam, Prentice Hall, 1987
/BSI87/ BSI Modula-2 Working Group;
Standard Concurrent Programming Facilities, N 116 Issue 3, 1907
/BR087/ Brodnik A.; 'Programiranje z modulo-2, Informatica, 1967
/C0LS7/ Collado M.t A MQdula-2 Implementation Of CSP, ACM Sigplan Notices, Vol. 22, N6, June 1987
DODATEK A (DEFINITON MODULE PROCESS)
DEFINITIOM MODULE PROCESS;
FROM SYSTEM IMPORT ADDRESS;
EXPORT QUALIFIED
tProcState,	(* tip možnih stanj procesa *)
PCREATE, PRESOME, PKILL, PSÜSPEND, PSETPRIO, PGETPID, PGETSTAT, PGETCURR, PSLEEP, PRECEIVE, PSEKD, tPid, tPrio, tSemaphare, cProcNameLen,
tProcName, tProcStatus, tStatBlock,
SCREATE, SINIT, SDELETE, SWAIT, SSIGNAL, SSTATUS, SGETSID, cSenNameLen, tSenName, tSemStatus,
cPrlol, cPrloS, cPrio3, cPrioJ, cPrloS, cPrio6, cprio7;
CONST
cSemNaneLen = 20;
cProcNameLen = 20;	{* Procast! name lenghth «) TYPE
tProcState = (PrCurr,	(» Procees Current «)
PrReady,	(* Process Ready *)
PcSusp,	(* Process Suspended *)
PrWait,	(• Procese Wait »)
PrSleep,	(* Process Sleeping •)
PcRec,	(* Process Receiving *)
PrFree	(* Procees Slot in Table not Used *) );
tProcName = ARRAY [0..cProcNameLen-1] OF CHAR; tProcStatus = (NOTOK,OK); tStatBlock = RECORD (* undeveloped *) status : CARDINAL
END;
tPrio;	(• process prioritir - type *)
tPid;	(« process - type *)
tSenaphore;	<* semaphore - type ')
tSeiuName = ARRAY [0 . • cSeoNameLen-1 J OF CHAR; tSewStatus = (SNOTOK, SOK);
VAR
cPriol, cPrlo2, cPrioS, cPriođ, cPrioS, cPrio6, cPrio? : tPrio;
PROCEDORE PCREATE (process :PROC; anvsize; CARDINAL ; nameiARRAY OF CHAR;
VAR pid : tPidf prio! tPrlo) t tProcStatus; (» Creates a new process with PrSusp status »)
PROCEDORE PRESUME (pid : tPid) : tProcStatus;
{» Resumes a suspended process (PrSusp --> PrReady) *)
PROCEDURE PKILL (pid : tPld) J tProcStatus;
C* Deletes a process and releases its resources (PrFree) *■ )
PROCEDURE PSUSPEND (pid : tPid) : tProcStatus;
(» Suspends a process (PrReady or PrCurr --> PrSusp) •)
PROCEDURE PSETPRIO (pid ; tPid; prio : tPrio) : tProcStatus; (* Change process priority *)
PROCEDORE PGETPID (namerARRAY OF CHAR; VAR pid : tPid) : tProcStatus; (* returns process id (Pid) of process NAME •)
PROCEDURE PGETSTAT (pid : tPid; VAR statb1ock:tStatBlock) : tProcStatus; (» returns process status informaton • )
PROCEDORE PGETCURR () : tPld;
(• returns process own process id * )
PROCEDURE PSLEEP (time : CARDINAL):tProcStatus; (-• Sleep for time ticks (PrCurr --> PrSleep) *)
PROCEDURE PRECEIVE (VAR what : ADDRESS ; VAR whojtPid): tProcStatus;
(" Receive a message from anybody (conditional PrCurr --> PrRec) »)
PROCEDURE PSEND (whom :tPi d; what i ADDRESS) : tProcStatus; (* Send a message to a process ')
PROCEDURE SCREATE {name :tSemName; VAR sem :tSemaphore; count : CARDIMAL): tSexnStatus;
(* Create a new sempahore and return its ID in satn »)
PROCEDURE SINIT {sere :tSeroaphore; count ; CARDINAL) : tSemStatus; (* Init a Semaphoce to initial value count »)
PROCEDURE SDELETE (sem itSemaphore) : tSemStatus; (•Delete a eemaphore •)
PROCEDURE SWAIT (sem :tSemaphore ) : tSemStatus; (« Walt far a semaphore (PrCurr --> PrWait) *)
PROCEDURE SSIGNAL (semitSemaphore) : tSemStatus; (* Signal on semaphore sem «)
PROCEDURE SSTATUS- (sem : tSemaphore) : tSemStatus; (* Get Semaphore status information *.)
PROCEDURE SGETSID (name : tSemName; VAR sem :tSemaphore; count : CARDINAL) : tSemStatus; {* Get the ID of semaphore name *)
END PROCESS.
DODATEK B (DINING PHILOSOPHERS) MODULE SPAGHETTI;
FROM PROCESS IMPORT PCREATE, PRESUME, PSLEEP, PSUSPEND, PGETCURR, PRECEIVE, PSEND, PKILL, PSETPRIO, tPld., tProcStatus, cPriol, cPrio2, Port, SCREATE, SWAIT, SSIGNAL, SDELETE, tSemName, tSemStatus, tSemaphore, CURSOR, WRITEI, WRITE; FROM InOut IMPORT WrlteStrlng, WriteCard, WriteLn, HriteHe*; FROM Break IMPORT EnableBreak; FROM Strings IMPORT Concat;
FROM SYSTEM IMPORT GETREG, ADDRESS, WORD, BYTE, INBYTE, OUTBYTE, CODE; FROM Keyboard IMPORT KeyPressed, Read;
FROH Devices IMPORT SavelnterruptVector, RestorelnterruptVeetor; FROM HyRandom IMPORT Random;
TYPE
tPhil = [0...4];
VAR
pidl ,pld2,pld3,pid-l ,pid5 : tPid; procstat, status! tProcStatus; chn,I,j! INTEGER; w: WORD;
konec : BOOLEAN; ch! CHAR; vec! ADDRESS;
MODULE FORKS (7];
IMPORT tSemaphore, tPhU, tProcStatus, tSemStatus, SWAIT, SCHEATE, SSIGNAL,
tSemNarae, Concat, PSLEEP; EXPORT PICKUP, PUTDOWN; TYPE
tNumOfFork = [0..2]; VAR .
numOfForks : ARRAY tPhil OF tNumOfFork; ready : ARRAY tPhll OF tSemaphore; semstat i tSeidStatus; i : INTEGER; tmp : tSemNaroe;
PROCEDURE PICKUP (phil : tPhil); VAR
left, right : tPhll; BEGIN
IF (nuniOfForks[phil ] < 2) THEN
semstat != SWAIT (ready[phi 1]) END;
right := phil; left := (phil + 1) MOD 5; OEC(numOfForks[left] ); DEC(numOfForks[right ]) ; END PICKUP;
PROCEDURE PUTDOHN (phil : tPhll); VRR
left, right : tPhll;
BEGIN
right J= phil;
left := (phil + 1) MOD 5;
INC{nuroOfForks[ieft}) ;
INC(nuiiiOfForkB[right] ) ;
IF nuniOfForks[left] = 2 THEN
semstat := SSIGNAL (ready[1 eft ]) ; END;
IF numOfForkeCrlght] = 2 THEN
Eemstat i= SSIGNAL (ready[right]); END; END PÜTDOWN;
BEGIN (• FORKS FOR i ! = 0 TO J DO CASE i OF
0	: titip s= 'Bacon' |
1	: tmp 'Sokrates' I
2	J tmp J= 'Aristoteles' 1
3	: tinp != -Nletsche' I
4	; trop := 'Hegel' END (• case *);
semstat := SCREATE (tmp, readyti], 0);
numOfForkeCi] i= 2;	forks initialization » )
END; END FORKS;
PROCEDURE First; VAR
timesEaten : CARDINAL; BEGIN
timesEaten 0; LOOP
PICKUP(O);
HriteString ('Bacon eating '); HriteCardttimesEaten, 6); WriteLn;
INC(timesEaten); PUTDOMN(O);
proCBtat iE PSLEEP {Random(10) ); END; END First;
PROCEDURE Second; VAR
timesEaten : CARDINAL; BEGIN
timesEaten 1; LOOP
PICKUP(l);
WriteString ('Sokrates eating '); HriteCard{timeBEaten, 5); WriteLn;
INC(tinieeEaten) ; PUTDOWN(l);
procstat s= PSLEEP (Random(lO) ); END; END Second;
PROCEDURE Third; VAR
timesEaten : CARDINAL; BEGIN
timesEaten 1; LOOP
PICKUP(2);
WriteString ('Aristoteles eating ');
HriteCard(tir[ieeEaten, 5);
WriteLn;
INC(timesEaten};
PUTD0WN(2);
procstat := PSLEEP (Random(lO) ); END; END Third;
PROCEDURE Fourth; VAR
timesEaten i CARDINAL; BEGIN
timesEaten 1; LOOP
PICKUP(3);
HriteString ('Nletsche eating '); WclteCard(timesEaten, 5); HriteLn;
INC{tiineBEaten); PÜTbOWN(3);
procBtat.:= PSLEEP (Randoin( 10) ); END; END Fourth;
PROCEDURE Fifth; VAR
timesEaten : CARDINAL; BEGIN
tliaesEaten 1; LOOP
PIĆKUP(4);
WriteString ('Kegel eating '); WriteCard(tiinesEaten, 5); WriteLn;
INC(titnesEaten); PUTDOWN(J);
proeetat := PSLEEP (Random(lO) ); END; END Fifth;
BEGIN
EnableBreak;
status 1= PSETPRIO(PGETCURR(),cPrlo2); status i= PCREATE(First,aOOH,'Bacon',pidl,cPriol); status != PCREATE(Second,800H,•Sokrates•,pld2,cFrlol); status := PCREATE(Thlrd,800H,'Aristoteles-,pid3,cPrlol); status }= PCREATE(Fourth,800H,'Nietsche',pldi,cPrial); status != PCREATE(Flfth,800H,'Hegel',pid5,cPriol); status PRESUHE(pidl); status PRESUHE(pld2); status := PRESUME(pid3); status := PRESUME(pld4); status := PRESÜME(pld5); konec . := FALSE; LOOP Port;
status i= PSLEEP{18); IF Ke)'Pressed( ) THEM
EXIT; END;
END; (* LOOP *) status := PKILL{PGETCURR( ) ); WrlteString('END - Philosophers'); END SPAGHETTI .
TERMINOLOGIJA PROGRAMIRANJA V PROLOGU	INFORMATICA 2/88
TERMINOLOGY OF PROLOG PROGRAMMING
Igor Kononenko Fakulteta za elektrotehniko
Nada Lavrač
UDK 681.3.06. Protog: 001.4	Inštitut Jožef Stefan
Prolog je novejši programski Jezik, ki ss vse bolj uveljavlja tudi pri nas. Zaradi velikih razlik v terminologiji tako v svetu kot pri nas je potrebno poenotiti terminologijo, ki se uporablja v logičnem programiranju ter programiranju v prologu. Osnova aa delo sta slovarja v (Kononenko 87) in (Kononenko, Lavraö 87). V prispevku smo podali razHke med terminologijama matematične logike in logienega programiranja ter med prologom in ostalimi programskimi jeziki. Poskusili smo odpraviti tudi nekatere terminološke nesporazume.	Podani so tudi
prevodi nekaterih izrazov iz angleSöine v slovenSöino, za katere smo se zedinili na Fakulteti za elektrotehniko in Institutu Jo2ef Stefah v Ljubljani.
ABSTRACT
Lately, Prolog became a very popular language in the computer programming community. As there is a lot of diversity in the terminology we point out some of the main problems and try to eliminate them by defining the most useful terms. The basis for this paper is a glossary in (Kononenko & Lavraö 87). We present differences between mathematical logic and logic programming, between Prolog and other programming languages and show how these differences affect the terminology. He also point out some misunderstandings regarding the usual terminology and mention some synunyois used in the field of logic programming.
1. RAZLIKE MED MATEMATIČNO LOGIKO IN LOGIČNIM PROGRAMIRANJEM
1, MATHEMATICAL LOGIC AND LOGIC PROGRAMMING
Prolog (Clooksio, Melllsh 81) je najbolj
razširjen jezik logičnega programiranja. Ime	Prolog (Clockain & Hellish 81) is the most
"PROLOG" jo dejansko kratica za 'PROgramming in	widespread language of the logie programming,
LOGIC'. Ker prolog temelji na matematični	PROLOG is In fact an abbreviation for
logiki, je iz nje privzeta tudi terminologija,	PROgramming in LOGio. As it is based on
kar je v določenih primerih pripeljalo do	mathematical logic the similar terminology is
dvoumnosti.	adopted which in some oases leads to
ambiguities.
Osnovna, razlika msd stavki v logiki in stavki v prologu 09 ta, da imajo stavki v prologu tudi proceduralen in ne le deklarativen pomen. Pogosto se uporablja iaraz 'predikat' kot sinonim za 'proceduro'. Procedura je del programa <množica stavkov), ki definira predikat as določenim Številom argumentov. Definicija je kljub deklarativnemu 2naöaju proceduralna, zato je izraz 'predikat' zavajajoč. V logičnem programiranju zadostuje uporaba izrazov 'procedura', 'vgrajena procedura' ipd. namesto izrazov 'predikat', 'vgrajeni predikat', ipd.
Pogosto ee v literaturi mesata izraza 'prilagajanje' in 'unifikacija'. Prilagajanje v prologu je proceduralna implementacija unifikacije iz matematične logike, ki ponavadi zaradi učinkovitosti deluje drugače kot unifikacija. Neopredeljena spremenljivka se lahko prilagodi z izrazom, v katerem sama nastopa, ne more pa se z njim unificirati.
Preglavice povzročajo tudi izrazi 'atom', 'literal' in.'cilj'. Atom je v matematični logiki kratica za atomarno formulo, medtem ko v prologu pomeni vrsto preprostih nedeljivih podatkovnih konstruktov, ki so konstante različne od Števil. Tako ustreza atom v prologu kostanti v matematični logiki. Atom v matematični logiki ustreza strukturi v prologu, ki predstavlja klio procedure, öe se nahaja v telesu stavka (v pogojnem delu stavka), ali pa sklep, Ö® se nahaja v glavi stavka (v sklepnem delu stavka).
Kliou procedure v telesu prologovega stavka pravimo cilj. Nekateri avtorji (npr. Sterling, Shapiro 86) pravijo glavi stavka tudi cilj. Sintaktično so glava stavka in cilji v telesu stavka pozitivni (nsnegirani) literali. Literal je sintaktični konstrukt sestavljen iz imena predikata in seznama argumentov v oklepaju, ločenih z vejicami.
Funktor v matematični logiki doloCa.funkcijo, medtem ko v prologu funkcij ni. Funktor v prologu omogoča konstrukcijo struktur z imenom in mestnostjo (Številom argumentov) danega funktorja. Strukture so podatkovni objekti v prologu in nimajo veliko skupnega s funkcijami v matematiki.
Nerodno je, da sta se pri programiranju v
The main difference between logical statements and Prolog clauses is that Prolog clauses have also the procedural meaning vhile logical statements have only the declarative meaning. The term 'predicate' is often used as a synonym for a 'procedure'. A procedure is a part of a program (a sequence of clauses) that defines a predicate with the given name and arity. Although the definition may be viewed declaratively it is in fact procedural (the order of clauses and the order of goals in clauses are important). The term 'predicate' is therefore misleading. For programming purposes it is thus mora convenient to use terms 'procedure', 'built-in procedure', etc, instead of 'predicate', 'built-in predicate', etc..
In the literature is often m^e no explicit distinction between 'unification' and 'matching'. Matching in Prolog is a procedural implementation of the unification from mathematical logic and usualy differs from it (for the efficiency reasons). It allows an uninstantiated variable to be matched with a term in which it itself appears while it cannot be unified with such a term (this problem is usually referred to as the 'occurs check' problem).
There are difficulties with terms 'atom', 'literal' and 'goal'. In mathematical logic an atom is an abbreviation for 'atomic formula' while in Prolog an atom is a simple data structure, i.e. a constant that is not a number. Therefore an atom in Prolog corresponds to a constant in mathematical logic. An atom in mathematical logic corresponds to a Prolog structure representing a procedure call if it appears in a body of a clause (a condition part) or a conclusion if it appears in the head of a clause (a conclusion part).
A procedure call in a body of a clause is usually called a goal. Some authors (e.g. Sterling & Shapiro 86) call the head of a clause also a goal. Syntactically the bead of a clause and goals in the body of a clause are positive (nonnsgatsd) literals. A literal, is constructed from a predicate name and a list of arguments enclosed in brackets and separated by commas. A positive literal is a synonym for atomic formula. It is better to use a term 'literal' because of the previously mentioned definition of an atom in Prolog.
prologu v angleščini udomačila izraza 'bound variable' in 'free variable' za spremenljivko, ki ima OBiroma nima vrednosti. V matomatiCni logiki se namreö ta dva izraza uporabljata za kvantificirano (vezano) in nekvantificirano (nevezano) spremenljivko. Zato je bolje, da se namesto 'bound' in 'free' uporabljata izraaa 'instantiated' in 'uninstantiated'. Moani prevodi so instancirana (neinstancirana), prilagojena (neprilagojena) ali opredeljena (neopredeljena). Zadnji prevod je najustreznejši, saj se vrednost spremenljivke v prologu lahko med izvajanjem bolj ali manj opredeli.
2. EAZLIKE MED PROLOGOM IN OSTALIMI PROGRAMSKIMI JEZIKI
In mathematical logic a functor determines a function while in Prolog are no functions. Functors in Prolog are used to construct compound data structures with the given name and arity. Structures in Prolog have not much in common with mathematical functions.
It is awkward that the term 'bound variable' is used for a variable taht has a value and 'free variable" for a variable that doesn't have a value. In mathematical logic bound variable represents a quantified and free variable represents an unquantified variable. In Prolog all variables are universally quantified (except in questions). Therefore it is better to use terras 'instantiated' and 'uninstantiated variable'.
Za razliko od ostalih razširjenih programskih jezikov, ki imajo proceduralni znaöaj, je prolog je deklarativni jezik. Proceduralni pomen programa v prologu je definiran z načinom izvajanja prologovega interpreterja. Vsekakor je v angleščini pravilneje uporabljati izraz 'execution' za izvajanje prologovega programa namesto ustaljenega izraza 'computation' (raöunanje), saj je izvajanje prologovega programa dejansko sklepanje (inference) na osnovi pravil in dejstev. Zato računalnikom vse pogosteje pravijo tudi 'inference machine' (stroj za sklepanje) namesto 'computer'.
2. PROLOG AKD OTHER PROGRAMMING LANGUAGES
Prolog is a declarative language and it largely differs from other popular programming languages which are procedural. The procedural meaning of a Prolog program is defined with the way how it is executed by the Prolog interpreter. It is better to use the term 'execution' than 'computation'. Execution of a Prolog program can be viewed as inference and a computer can be also called an inference machine.
Med izvajanjem prologovega programa spremenljivke dobijo svojo vrednost s procesom prilagajanja. Pravimo, da vrednost spremenljivke postane (bolj) doloCena oziroma opredeljena. Ne more pa se vrednost spremenljivke spremeniti na neko povsem drugo vrednost, razen pri avtomatskem vraCanju. Pri avtomatskem vračanju vrednost lahko postane . manj opredeljena. 2a razliko od prologa pa se vrednosti spremenljivk v proceduralnih jezikih lahko spreminjajo. Zato ima izraz 'vrednost spremenljivke' v prologu nekoliko drugaCen pomen.
Prireditev v standardnih proceduralnih jezikih pomeni spremembo vrednosti spremenljivke na neko določeno vrednost. V prologu take prireditve ni. Izraz 'prireditev' se uporablja v prologu za vgrajeno proceduro 'is', ki se uporablja za izraCun vrednosti aritmetičnega izraza, in prilagoditev izračunane vrednosti s spremenljivko ali konstanto, Ce je to možno.
During the execution of a Prolog program variables get their values by matching. We say that the value of a variable becomes (more) specified or determined. While backtracking the value may become less specified. A variable is therefore more or less instantiated. The value of a variable cannot be changed as in other programming languages. Thus in Prolog the term 'the value of a variable' has a slightly different interpretation than in other programming languages.
An assignment in standard procedural languages causes the change of the value of a variable to a certain new value. There is no such assignment in Prolog. The term 'assignment' is used in Prolog for the built-in procedure 'is' which computes the value of the arithmetical expression on the right-hand side and matches the result with the argument on the left-hand aide, if this is possible.
Pri programiranju niz (string) predstavlja ponavadi niz znakov med dvema enojnima narekovajima agoraj, V prologu js nia seznam celih Števil, ki ustrezajo ASCII kodam znakov, in ga lahko namesto v standardni notaciji seznama napiSemo -tudi kot niz ustreznih znakov med dvojnima narekovajima zgoraj. Miz znakov med enojnima narekovajima zgoraj pa v prologu predstavlja atom.
In programming the term 'string' usually represents a string of characters enclosed in single quotes. In Prolog a string is represented with a string of characters enclosed in double quotes and is equivalent to a list of integers that correspond to ASCII codes of characters in the string. In Prolog a string of characters enclosed in single quotes is an atom.
3. DVOUMNOSTI V TERMINOLOGIJI
Prioriteta operatorjev je v prologu definirana z celimi Števili (ponav^i od 1 do 1200). öim manjös js Stavilo, tem močneje veže operator in obratno. Ta nekoliko neobičajna definicija dostikrat pripelje do nerazumevanja.
V literaturi se pogosto navajata 'green cut' (zeleni rez) in 'red out' (rdeöi rez) (glej npr. Bratko 86). Brez zelenega reza bi procedura pravilno delovala, le nekoliko poöaenejfie bi bilo izvajanje zaradi nepotrebnega vra&anja. RdeCaga reza pa iz procedure ne smemo odstraniti, ker bi bilo izvajanje napaCno. Mnogi avtorji napaCno ugotavljajo, da zeleni rez ne spremeni deklarativnega pomena medtem ko rdeöi rez spremeni deklarativni pomen procedure.
Rez je kontrolni konstrukt, ki nima deklarativnega pomena in zato tudi ne more spremeniti deklarativnega pomena. Vpliva pa na proceduralni pomen dane procedure, saj se zaporedje izvajanja z dodajanjem reza spremeni. Torej tako zeleni kot rdeči rez spremenita proceduralni pomen procedure, le da zeleni rez lahko odstranimo, ne da bi spremenili proceduralno pravilnost procedure, medtem ko z odstranitvijo rdeöega reza postane procedura nepravilna.
3. AMBIGUITIES IN TERMINOLOGY
The precedence of an operator is defined as a strength with which an operator binds its arguments. It is labeled with an integer (usually between 1 and 1200). The greater the label is the weaker the operator binds its arguments and vice versa. This somehow unusual definition often leads to misunderstandings.
In the literature we often meet 'green cut' and 'red cut' (e.g, Bratko 86), Without green cut the procedure would still perform correctly-although lees efficiently due to unnecessary backtracking. The red cut must not be removed because the execution of the procedure would be incorrect. Many authors incorrectly state that the green cut does not affect the declarative meaning while the red cut does. Cut is a procedural construct with no declarative meaning and thus oannot affect the declarative meaning. It certanly affects the procedural meaning of a procedure because the execution is changed. Therefore both green and red cuts affect the procedural meaning of a procedure. The greon cut can be removed without affecting the procedural correctness of the procedure while when the red cut is rémoved the procedure becomes incorrect.
4. SINONIMI
4. SYNONYMS
V seznamu sinonimov so ustreznejši izrazi na desni strani:
In the following list of synonyms the terms at the right-hand side	should be preferred.
atomarna formula - pozitivni literal
objekt - izraz
podatkovna struktura - izraz
atomic formula - (positive) literal
bound variable - instantiated variable
predefinirana procedura - vgrajena procedura sistemska procedura - vgrajena procedura struktura - sestavljeni izraz term - izraz
vgrajeni predikat - vgrajena procedura
5. USTALJENI PEEVODI KEKATERIH IZRAZOV
Za naslednje prevode izrazov smo se zedinili na Fakulteti za elektrotehniko in Institutu Joäef Stefan v Ljubljani:
cut - raz
instance - primer
instance of a tertn - primer izraza
instantiated - opredeljen
matching - prilagajanje
parent goal - nadrejeni cilj
term - izraz
built-in predicate - built-in procedure compound term - structure computation - execution control predicate - control procedure
data structure - term
evaluable predicate - built-in procedure free variable - uninatantiated variable joint variable - shared variable object - term
predefined operator - built-in operator predefined procedure - built-in procedure priority - precedence system procedure - built-in procedure stopping condition - boundary condition unbound variable - uninstantiated variable
ZAHVALA
Slovarja v <Kononenko 87) in (Kononenko, Lavraft 87), ki sta osnova tega dela, sta rezultat skupnega prizadevanja ob pomoöi mnogih sodelavcev. Zahvaljujeva se Tatjani Jane in Alenu Varsku za pripombe na rokopisi
AKNOWLEDGEMENTS
Many colleagues helped us in the preparation of a glossary in (Kononenko & Lavraö 87) which is the basis for this paper. We thank Tatjana Jano and Alen Varäek for their remarks on a manuscript.
LITERATURA - REFERENCES
Bratko, I. <1986) Prolog pi-ogramraing for artificial intelligence, Addison-Wesley.
Clocksin, W.F., Mellish, F.G. (1981) Programming in prolog, SpringerVerlag.
Kononenko, I, {1987) Uvod v prolog in zbirka nalog z rešitvami, skripta, Fakulteta za elektrotehniko, Ljubljana.
Kononenko, I., Lavraö, N. (1967) Prolog through examples - A practical programming guide, Sigma Press.
Sterling, L,, Shapiro, E. (19B6) The art of prolog - Advanced programming techniques, MIT Press.
StNHRONIZACIJA v PORAZDELJENIH RAČUNALNIŠKIH SISTEMIH
INFORMATICA 2/88
UDK 681.3.02
Jože Rugelj Institut »Jožef Stefan«
s Inhroni zac i jski mehanizmi v porazdeljenih računalniških sistemih so potrebni za definicijo in realizacijo urejenosti dogodkov v računalniškem sistemu. Članek podaja pregled mehanizmov za sinhronizacijo na ra.zliönih nivojih abstrakcije računalniškega sistema.
Synchronization mechanisms In distributed computing systems are necessary for the definition and the realization of event ordering In the computing system. This article gives an overvievbt of mechanisms, used on different levels of abstraction.
1. UVOD
Porazdeljen računalniški sistem Je množica procesnih elementov. Vsak procesni element ima	lasten	pomnilnik in procesne
zmogljivosti. V procesnih elementih se izvajajo procesi. Z oznako porazdeljenost Je mišljena porazdelitev nadzora in ne nujno tudi prostorska porazdeljenost. V tesno sklopljenih sistemih so procesi povezani med seboj s skupnim pomnilnikom, v Slbko sklopljenih sistemih pa so procesna mesta povezana s komunikacijskimi kanali, ki skupaj tvorijo komunikacijsko mrežo. Skupnega pomnilnika v takI h s 1 s temih nI. V tem primeru gre tudi za prostorsko porazdelitev procesnih elementov, ki jim zaradi tega redemo tudi procesna mesta.
Procesne elemente združuje v porazdeljen ststera porazdeljen operacijski sistem. Glavne naloge porazdeljenega operacijskega sistema so podpora raedprooesne komunikacije, dodeljevnje virov In upravljanje z njimi, upravljanje z Imeni ter reSevanje Iz napak. Jedra porazdeljenega operacijskega sistema na posameznih mestih so lahko implementirana kot jedra osnovnega operacijskega sistema na tem mestu ali kot procesi na uporabniškem nivoju /TripBT/.
V sistemih z veCiml procesnimi elementi se procesi Izvajajo sočasno, dokler ne potrebujejo medsebojnih stikov. Načrtovane In vodene stike jned procesi i me n u j emo p roc e s n a komunikacija in sinhronizacija. Procesi morajo	sodelovati	zaradi omejevanja
sočasnosti	in	zaradi	medsebojnega
razvrščanja /VerJ83/. Odnose med procesi bi glede na način sodelovanja lahko razdelili v dve glavni kategoriji: lahko tekmujejo med seboj ali so v odnosu pro 1 zv a J a 1ec-po troSni k /Hwan85/,
2. KONSISTENTNOST IN NEDELJIVOST
Predpostavljamo, da se procesi izvajajo v diskretnih korakih In . na vsakem koraku generlrajo dogodek. Dogodek je lahko lokalen
procesu in ga drugI procesi Je viden vsem In s I nhroii 1 zac t J e .
ne opuzijo ali pa vsebuje problem
Na vsakem nivoju abstrakcije lahko proces, ki se izvaja na nekem procesnem mestu, sproži operacije. Operacija na nivoju j Je implementirana kot množica aktivnosti na nivoju i, pri čemer velja i < j. Kar vidimo z nivoja j kot aktivnost Je definirano na nivoju 1 kot operacija. Tak model velja rekurzlvno za vse nivoje.
Operacije oziroma aktivnosti vodijo in upravljajo vire. Viri so predstavljeni kot podatkovni objekti. Na primer, periferna naprava Je lahko predstavljena s svojim trenutnim stanjem, ki Je določeno z vrednostmi množice parametrov. Zapis na datoteki lahko predstavimo z njegovo vsebino. Primeri aktivnosti, ki delajo s podatkovnimi objekti, so pisanje, branje, kreiranje in br i san j e,
Podatkovni objekti Imajo med seboj semantične povezave, to pomeni, da morajo njihove vrednosti zadoSčatl nekim omejitvam. Ko objekt kreiramo, briSemo ali vanj piserao. Je pogosto potrebno kreirati, brisati ali pisati Se v druge objekte, da bi, zadostili kons 1 stenčnIm omejitvam /LeLaBi/.
Vsaka operacija je definirana tako, da predpostavlja na vhodu množico objektov s konsistentnimi vrednostmi in zagotavlja kot rezultat množico novih vrednosti, k! so tudi kons I s ten tne,
Torej, če so konsistentna stanja, ki se prenašajo med procesi ali so shranjena. Je tudi procesiranje konsistentno.
Ce ni posebnih predpostavk, lahko ohranimo konsistentnost	samo z zagotavljanjem
nedeljivosti operacij. Z definiranjem nekaterih drugih zahtev lahko zahtevo za nedeljivost opustimo. Definirajmo nede 1 J i vos t oneraci le. Operacija je nedeljiva, Če zadoSča naslednjima pogojema:
1,	ali se Izvedejo vse aktivnosti popolnoma ali pa se ne izvede nobena in
2.	vmesna stanja prt izvajanju operacije
niso vidna nobeni drugI operaciji.
Nedeljivost je dobro poznan koncept /LmpsBl/.
Običajne Instrukcije na računalniku so nedeljive, ker se izvajajo strogo zaporedno na enf procesni enoti. V porazdeljenih sistemih se aktivnosti, ki pripadajo dani operaciji, lahko Izvajajo na razlienlh procesnih enotah brez doloCenIh Časovnih povezav. Zato implementacija nedeljivih operacij zahteva specifične mehanizme, ki jih v centraliziranih sistemih ne potrebujemo.
ZaCetek Izvajanja operacije ftele po Izvršitvi prejtnje operacije vodi k posebni vrsti dodeljevanja	imenovanega	zaporedno
dodeljevanje	(serial	scheduling). V
porazdeljenih sistemih pa je Izključna uporaba zaporednega dodeljevanja zelo neučinkovita. Ce aktivnosti, ki Jih sproZl dana mnoiica operacij, delujejo nad različnimi objekti, vzporedno Izvajanje aktivnosti nt	le možno, ampak celo
priporočljivo. Se veC, Ce so aktivnosti, k) delujejo nad danim objektom in pripadajo različnim operacijam, sprožene soćasno in postavljene v čakalno vrsto procesnega elementa, ki vsebuje objekt. Je neizkoriščen čas med dvema zaporednima aktlnostlma kraJSI kot v primeru. Ce Je naenkrat sprožena samo ena akt Ivnost.
Torej je lažeijeno, da dovoi/ujerao prepletanje	aktivnosti	(Interleaving),
kolikor je to mogoče, in s tem optimiziramo zmogljivosti. Odgovarjajoča dodeljevanja Imenujemo dodeljevanja s prepletanjem in nekatera od njih so ekvivalentna zaporednim, torej so konsistentna. Toda v sploänem to ne velja. Ce torej želimo Cim veCJo paraielnost In s tem veliko hitrost In dobro izkoriščenost virov moramo poiskati ustrezno prepletanje, ki ohranja konsistentnost. S s 1nbronIzaciJskimi mehanizmi pa lahko realiziramo potrebne odnose med procesi oz. operac1J ami.
3. RAZVRŠČANJE DOGODKOV
Določanje, kateri dogodek se Je zgodil prej, je oblCajno Intuitivno zasnovano na razmišljanju v fizikalnem Časovnem svetu. Tak pristop predpostavlja. da lahko definiramo nek univerzalen £as, ki je dosegljiv z različnih lokacij v sistemu. Vemo pa, da taka reSitev ne obstaja. V praksi lahko dobimo približke	univerzalnega časa z dano
natančnostjo. Vendar pa ni potrebno ali pa nI mogoče izraziti sistemskih specifikacij s fizikalnim časom /KopeST/.
Obstaja določena kronološka urejenost dogodkov v sistemu, ker procesi, ki generlrajo te dogodke, spoštujejo specifična pravila (algoritme, protokole), ki Izražajo relativno urejenost v množici dogodkov opazovanega sistema.
Taka urejenost dovoljuje procesom", ki opazujejo dogodke, da pravilno implementirajo sistemske aktivnosti. Glede na naravo teh aktivnosti Je potrebna delna ali popolna urejenost dogodkov.
Delno urejenost, oznaCeno z " --> " (beri "se zgodi pred") lahko definiramo nad katerokoli n^nožlco dogodkov, ki jih generirajo procesi, ki izmenjujejo sporočila /LampTB/. Za relacijo "se zgodi pred" velja:
1. če sta a In b aktivnosti istega Istega procesa In se a zgodi pred b, potem je a --> b
Z. Ce je a aktivnost za poSlIjanJe sporočila Iz enega procesa In je b aktivnost, ki sprejme to sporočilo v drugem procesu, velja a --> b
3. če je a a "> C
In b --> C, potem velja
Ce procesi komunicirajo med seboj z Izmenjavanjem sporočil, lahko razvrstltimo nekatere dogodke, ki se zgodijo v razllCnlh procesih. Urejenost " --> " je samo delna. Npr, Ce imamo dogodka a in b In velja a --> b In b --> a. potem je nemogoče reči. kateri od dogodkov se je zgodil prej. Za take dogodke rečemo, da so soćasni (concurrent). V spIoSnem dogodkov Iz procesov, ki med seboj ne komunicirajo, ne moremo razvrstiti. V odvisnosti od omejitev, ki Jih moramo upoštevati. Je to dopustno ali pa tudi ne. Eden od naClnov za dosego popolne urejenosti, kadar je le-ta nujna, je pridobitev popolne urejenosti iz delne z definicijo popolne urejenosti nad množico procesov.
Omeniti moramo enega od osnovnih problemov računalniških sistemov, to je problem zakasnitev	pri	procesiranju.	V
enoprocesorskI h sistemih lahko zelo natančno določimo Cas, ki ga procesne enota potrebuje za IzvrSItev danega ukaza. Seveda pa v sploSnem operacijski sistemi niso zasnovani na osnovi takih podatkov. Se manj pa bi bila taka zasnova uproviCena za porazdeljene operacijske	sisteme,	saj zakasnitve
vključujejo Se čas, ki je potreben za prenos podatkov in ukazov med posameznimi procesnimi enotami po komunikacijski mreži, V porazdeljenih sistemih, kjer se operacije Izvajajo na različnih procesnih enotah, spreminjanje zakasnitev v komunikacijskem podsistemu lahko zmoti določeno urejenost dogodkov, ki Jo pričakujemo. To se lahko zgodi celo v sistemih, kjer so zakasnitve stalne. Zato moramo uporabljati določene s 1 nhron 1 zac I jske mehanizme pri procesih, kjer se dogodki zgodijo In pri procesih, ki opazujejo te dogodke.
Namen s Inhron 1 zaci JskI h mehanizmov Je definiranje In realizacija razvrstitve poljubne množice dogodkov. Natančneje, rekli bomo, da je sinhronizacija način za definicijo In realizacijo delne ali popolne urejenosti nad neko množico dogodkov.
S 1nhron)zaclJskIh mehanizmi nudijo procesom pripomočke, s katerimi se sistem ohranja v konsistentnem stanju. Stanje računalniškega sistema Je konsistentno. Ce v vsakem trenutku ustreza nekim zunanjim doloCilom.
4, SPLÜSNE ZNAČILNOSTI S INHRON 1 ZAC 1 JSKI H MEHANIZMOV
S1 nhroni zac I jsk 1 mehanizmi temeljijo na opazovanju In spreminjanju določenih skupnih s 1nhron 1 zac i j ski h spremenljivk. V tesno sklopljenih sistemih so le-te v skupnem pomnilniku, v SIbko sklopljenih sistemih pa so na nekaterih ali na vseh procesnih mestih. Zahteve za branje ali spreminjanje spremenljivk In njihove vrednosti se prenaSeJo kot sporočila po komunikacijski mreži.
Pri	porazdeljenih	rešitvah	so
s I nhron I zac 1 Jske spremenljivke podvojene, razcepljene ali porazdeljene med mesta v s 1 s t emu.
Pod vo j enos t	ali	celo	|iomii o Z eno s 1,
SP remen IJ 1vke pomeni, da so verzije Iste spremenljivke v vseh ali vsaj v veClh mestih v sistemu. Ustrezen mehanizem poskrbi za delno ali popolno konsistentnost v vsakem t renutku .
Razcepljenost spremenljivk Je razdelitev vrednosti spremenljivke na veC komponent. Komponente so porazdeljene po mestih, kjer se spreminja njihova vrednost. Prava vrednost spremenljivke je linearna kombinacija vrednosti njenih komponent.
Porazdelitev spremenljivk pa je rešitev, pri kateri so s inhron 1 zaci j ske spremenljivke porazdeljene po razllinlh mestih v sistemu, vendar samo po ena verzija vsake.
Za izvedbo operacij nad s inhroni zac 1 Jski ml spremenljivkami na katerem koli nivoju abstrakcije se lahko sklicujemo na določen osebek,	ki ga Imenujemo centralni
s inhron I zaci jski osebek. Le-ta je dostopen vsakemu proizvajalcu vsaklC, ko zaćne novo operacijo.
Termin " s inhron 1 zac1 J s k 1 osebek" uporabljamo za usklajevalce procesov , semaforje, monitorje Ipd.; to »o torej aktivnosti, ki popravljajo	In	spremljajo	stanje
sinhronlzaciJsklh spremenljivk.
S inhron1 zac I Jsk1 osebek Je centralen, Ce velja:
nedeljivosti branja in pisanja vrednosti spremenljivk glede na prebrano vrednost. Primera takih ukazov sta test-and-set In comp are-and-swap	/Hwan85/. Taki ukazi
oinogoćajo	realizacijo.	kompleksnejših
s inhron i zac i jski h mehanizmov na visjem n1voJ u .
Obstaja Se drug s 1nhronizaclJsk 1 primitiv. Ki Je obltajno realiziran v strojni opremi in dopušča določeno stopnjo sočasnosti' pri uveljavljanju zaporednosti dostopa do skupnega	pomnilnika.	Imenuje	se
fetch-and-add. Primitiv Ima obliko F&A(X,e) in vrne vrednost spremenljivke X ter poveča njeno vrednost za e. Ce se izvede već takih ukazov hkrati, se vrednost spremenljivke poveča naenkrat za vsoto vseh e,, vsak proces pa dobi vrnjeno vrednost, kot oa bi se ukazi Izvajali v nakljuCnem vrstnem redu zaporedno.
V Šibko sklopljenih- sistemih uporabljajo kot osnovo	za	realizacijo	nekaterih
v iäJ en Ivojski h mehanizmov posebne S t e v nI k e , imenovane fizične ure. /Kope87/ predstavlja
posebno časovno s inhroni zac 1 Jsko enoto. ki poleg podatkov o realnem času vsebuje tudi mehanizme za usklajevanje in popravljane časovne baze, zasnovane na,/LampTS/. S tako enoto razbremenimo procesor, ki ga Je izvajanje sinhronizacije preveč obremenilo.
da ima edinstveno. Ime, ki ga poznajo vsi procesi, ki se s inhron 1 z 1 raj o med seboj,
katerikoli od teh procesov Ima dostop do s inhroni zac I jskega osebka v vsakem trenutku /LeLaBl/.
Nekateri sistèmi so zgrajeni tako, dn preživijo napake, ki se pojavijo pri centralnem	s 1nhronfzacIjskem	osebku.
Predvidene so tehnike za reševanje ob napakah, ki Izberejo nov s Inhroni zaci jsk1 osebek, ko pride do napake.
Vsak s inhron 1 zaci jskI mehanizem, ki temelji na centralnem s inhron i zaci jskem osebku Imenujemo centraliziran.
Ostal^e mehanizme imenujemo porazdeljene.
S inhron1 zac i j s k i	mehanizmi	so lahko
realizirani v elementih strojne opreme računalnika, kot primitivi v programskih Jezikih ali v operacijskih sistemih.
5. ELEMENTI STROJNE OPREME ZA PODPORO SINHRONIZACIJE
Eden od najbolj enostavnih mehanizmov, ki omogočajo	nedeljivost	in medsebojno
izključevanje procesov Je onemoeoC an J e prekinitev procesorja. Ta reSitev je bila uporabljena 2e na enoprocesorskI h sistemih, ki.	so	delovali	na	principu
kvazt-para ! e t nos t i . Na porazdeljenih sistemih nima posebnega pomena, saj procesorji med seboj ne morejo Izvajati takih ukazov /FInk86/.
V tesno sklopljenih s i s temi h,. k j er procesorji opazujejo	In popravljajo vrednosti
spremenljivk,	. je	za	ohranjanje
konsistentnosti	nujno	zagotavljanje
6. SIWHRONIZACiJSKI PRIMITIVI V PROGRAMSKIH JEZIKIH
Pomembna lastnost aplikacijske in sistemske programske opreme, ki povečuJ e preg 1 ed no s t in zmanjäuje kompleksnost, je njena modularnost. Navzven porazdeljen sistem tako izgleda kot množica programskih modulov , kjer je vsak modul zase enostaven zaporeden proces. Procesi v tesno sklopljenih sistemih sodelujejo med seboj tako. da komunicirajo preko skupnih spremenljivk.
Osnovna reSltev problema ■ medsebojnega Izključevanje procesov, ki komunicirajo preko skupnih spremenljivk, Je Dekkerjev algoritem /BenA83/. V njem so zdruZene vse dobre lastnosti enostavnejših reäitev, kot so akivno čakanje z opazovanjem skupne spremenljivke (busy-waiting) in uporabe spremenljivke	za	Izmeničen	dostop
(switch-variable). Hkrati pa odpravlja pomank! i 1 vos t i, zaradi katerih so te reSStve v praksi neuporabne. Mislimo predvsem na nedoslednost pri medsebojnem Izključevanju fn na veliko odvisnost med procesi pri uporabi takega mehanizma.
Dekkerjev algoritem je precej kompleksen in ni primeren za implementacijo.'
Veliko bolj enostavna reSltev je "semaf or /Dijkee/, ki ga je lahko implementirati In je dovolj. močan, da lahko z nJim elegantno reSlmo probleme medsebojnega izključevanja in razvrščanja	procesov. Uobra lastnost
semaforjev, ki Jo prej omnejeni mehanizmi nimajo, je tudi to, da so procesi, ki čakajo na doseg do določenega vira. blokirani. Zato medtem, ko čakajo, sprostijo procesne zmogljivosti za druge procese. Semaforje lahko uporabimo tudi pri implementaciji močnejših primitivov.
Semafor s Je strogo pozitivna ce 1oSteviI ska spremenljivka, ki ji Je pridružena 4e vrsta
procesov.
Definirani sta dve operacij] nad semaf o r J etn , P(b) In V(8)i ki sta nedetjtvt. Operacijo P(s) zmanJAa vrednost s za ena. Ce Je s > Cl , sicer pa odloil proces, ki Je Izvedel to operacijo v vrsto, operacija V(8) pa «budi prvi proces v vrsti, Ce pa Je vrsta prazna,' poveCa vrednost s za ena.
pomeni, da uporabniku ni treba vedeti, kje se nahajajo iskani viri niti tnu ni treba sestavljati sporočil. Vse to zna narediti operacijski s i s tem.
Konstrukt v jeziku Ada /USADSü/, imenovan rendezvous.	Je kombinacija RPC In
Izmenjavanja sporočil. Omogoča sinhronizacijo procesov v Času Izvajanja konstrukta /HenA82/.
Kritična področja odpravljajo edino slabo lastnost semaforjev, to je velika motnost za napake pri njihovi uporabi. Nepravilna razvrstitev operacij P In V Je lahko usodna. Kritična področja so deli programa, kjer proces dosega skupne spremenljivke, ki so v tistem času nedostopne drugim procesom. ZaCetek in konec kritičnega podroCja označujejo posebni Jezikovni konstrukti, kf so zelo enostavni za uporabo. Operacijski sistem potem sam poskrbi za dejansko realizacijo medsebojnega izključevanja.
Poeolna kritična področja so razširitev prejšnjega konstrukta. Vstop v krtično področje Je dovoljen Sele, ko Je izpolnjen nek pogoj. Ovrednotenje pogoja in Izvedba kritičnega področja sta nedeljiva,
Hoare in Brlnch Hansen sta definirala moii i tor /HoarT4/, /firHa73/. Podobno kot proces predstavlja	koristno abstrakcijo pri
mu 11 programi ran ju Je monitor abstrakcija za medprocesno komunikacijo.
Monitor je raztirltev pogo J n i h , kr 11 i čn i it sekcij. Monitor predstavlja telo, icl vsebuje skupne spremenljivke in procedure s kritičnimi sekcijami za delo z nJlml. S ten> postanejo te spremenljivke lokalne, skrite znotraj monitorja. Procesi, ki Želijo dostop do takih spremenljivk, ga lahko dobijo samo preko monitorskih procedur. Monitor je pasiven v sistemu In se aktivira samo takrat, ko procesi želijo dostop do njegovih sp remen 1J1vk .
Pri jibko sklopljenih sistemih pa Je sodelovanje med procesi v različnih programskih modulih povezano s pošiljanjem Iti sprejemanjem sporočil, ki siuZijo za sinhronizacijo tn prennfianje podatkov /SlomST/. Komunikacija med procesi je lahko ehosioerna ali dvosmerna.
Osnovna primitiva pri enosmerni komunikaciji sta 'posili' in 'sprejmi'. pri dvosmerni p« zahtevek z odgovorom (request-rep ly ) ,
oddalieni_klic_procedure (remote procedure
call) in rendezvous,
Primitive za slnhrono sprejemanje podatkov najdemo v večini jezikov, ki so primerni za porazdeljene sisteme ( ADA /USAD80/, CONIC /SlomBS/, CSP /Hoar78/, SR /AndrBl/, Pascal-m /Abra83/ ).
Modul, ki čaka na sprejem sporočila od drugega modula se blokira ter postavi v vrsto Čakajočih In se aktivira Sele po prejemu pričakovanega sporočila.
CSP Ima podoben konstrukt tudi za poSllJanJe sporočI 1 .
Vsi	dvosmerni	primitivi	omogoCaJu
sinhronizacijo med procesi, ki jih Izvajajo. Čeprav se v podrobnostih in načinih izvedbe nekoliko razlikujejo.
Zahtevek z odgovorom Je dvosmerni primitiv, ki Je v bistvu kombinacija slnhronega pošiljanja tn sprejema sporočila.
Oddaljeni klic procedure ima določene prednosti, saj omogoča transparentnost lokacije virov in procesov /Stan82/. To
7. SimiRONlZACiJSKI MEHANIZMI V OPERACIJSKIH SISTEMIH
7.1.	Centra I i z i ran 1
meh an i zrni
s 1 nh ron i zac i J sk i
Skupna	lastnost	centraliziranih
s inhroni zac 1 JskI h mehanizmov Je v tem, da temeljijo na enem s 1nhron1 zac 1 jskem osebku.
F I z I Cna ura predstavlja osnovo za razvrščanje dogodkov podobno kot v centraliziranih sistemih. Operacijski sistem za realizacijo tega mehanizma uporablja posebne elemente strojne opreme. Procesi so razvrSCeni na osnovi Časovnih značk, ki Jih dobijo, kadar Je potrebna sinhronizacija.
Čeprav Je ta metoda enostavna, ima mnogo pomanjkljivosti.	Pravilno	zaznamovanje
dogodkov s časovnimi značkami je popolnoma odvisno od sprejema stanja ure ob vsakem dogodku. Napaka pri prenosu sporočila s tem podatkom Je lahko usodna za pravilno razvrstitev. Potrebujemo tudi vnaprejšnje točno poznavanje zakasnitev v prenosnih kanalih. Natančnost Je odvisna od zahtev sistema oziroma aplikacije.
Števec dogodkov Je objekt, ki Steje dogodke, ki so se zgodili v določenem razredu (npr. aktivnosti). Definirani so trije primitivi: poveCaJ vrednost Števca, preberi vrednosti Stevcfl in odloži klicoči proces, dokler nI vrednost Števca irsaj enaka podani konstanti. Pomembna prednost tega mehanizma Je v tem, da dovoljuje sočasno Izvajanje teh prlmltlvov na istem Števcu brez medsebojnega izključevanja. Pojem "Števec z enim upravl J a icem" Je definiran kot Števec dogodkov, ki deluje pravilno, dokler Je prepovedano sočasno povečanje vrednosti Števca. Vendar pa je sistem zelo občutljiv na Izpad posameznih elementov, saj je vsaka napaka Števca usodna za pravilno razvrščanje.
Statični razvrSčevalnik uporabljamo za popolno razvrstitev dogodkov v danem razredu (Števec dogodkov omogoča samo delno urejenost). KazvrSCeva1n1 k Je celoštevilčna spremenljivka. Za delo z razvrSCevaln1 kom Je definiran samo en primitiv, Imencvan vstopnica (ticket), ki vrne tekočo vrednost razvrSčevalnika In poveča njegovo vrednosti za 1. RazvrSčevalnik zahteva ločen mehanizem za medsebojno izključevanje zato, da Je primitiv vstopnica nedeljiv. Dva proizvajalca ne moreta dobiti vstopnice hkrati. OlavnI problemi pri uporabi razvrSčevalnika so pri izbiri mehanizma za medsebojno izključevanje in vpliv napak ali Izpada razvrSčevalnika na preživetje s inhron1 zac i jskega mehanizma.
Jasno Je, da centralizirani pristopi v porazdeljenih sistemih ne Izpolnjujejo zahtev, ki so bistvene za take sisteme. Sistemi, zgrajeni na takih osnovah, ne morejo imeti visoke stopnje razpoložljivosti in imajo v sploSnem slabSe zmogljivosti zaradi
oztceea enota.
grla. ki ga predstavlja centralna
7,2,	Porazdeljeni
mehanizmi
8InhronI zac IJ sk i
S porazdelJentnil mehanizmi dosegamo veCjo stopnjo paralelnosti In s tem hltvejSe delovanje, boljio 1zkor1Sćenost opreme In veCJo	zanesljivost.	Porazdeljeni
sinhronizaclJskl mehanizmi so večkratne fiziCne ure, večkratne logične ure, abstraktni Izrazi,' skupne spremenljivke, kroieCl žeton In krožeči razvričeva I nt k .
Cilj uporabe fizičnih ur Je v določitvi enotnega fizičnega časa v sistemu. Konsistentno dodeljevanje lahko dobimo Iz popolne kronoloike razvrstitve aktivno&tl, ki se pojavljajo v sistemu. Pri uporabi več »r nI pomembna samo točnost vsakje posamezne ure ampak tudi medsebojna usklajenost, tako da Je razlika med poljubnima dvema urama tnanJSa od vnaprej določene konstante. ReSltev tega problema Je podal Lamport /Lamp 76/, Sistem modeliramo kot povezan graf procesov s premerom d. Premer grafa predstavlja minimalno Število procesnih mest. preko katerih mora Iti sporočilo Iz poljubnega procesnega mesta. da doseZe poljubno drugo procesno mesto. Vsak proces Ima uro In periodično	(perioda	t)	poèllja
sinhronizaclJska sporočila vsekemu drugemu procesu. Vsako s Inhron 1 zac i J s ko sporočilo vsebuje fizično časovno znaCko. Po prejemu s i nhronlzaci Jskega sporočila proces pomakne svojo uro naprej, če Je časovna znaCkn večja od stanja ure. Predpostavljamo. da poznamo spodnjo mejo (u) in zgornjo mejo (u + z) zakasnitev na komun I kad jskl mrei! . Naj bo k natančnost vsake ure (k<10" ) in e dovoljen zamik med poljubnima dvema urama. Ce Je e/(l-k) < u In e << t, potem Je možno izračunat! približno vrednost e, ki Je prlbl Ižno d(2kt * z).
V odvisnosti od zahtev glede relativnih zamikov in veljavnosti predpostavk glede zakasnitev tia komunikacijski mreii se lahko
odločimo za tveganje, da	bomo občasno
izgubili sporočila zaradi	prevelikih
zakasnitev In taka dosegli	verjetnostno sinhronizacijo ali pa ne bomo tvegali. Tak
pristop bistveno . zmanJSa	zmogljivosti
sistema. Ključni parameter	pri tem Je razmerje z/u.
Večkratne logične ure so prvič opisane v /Lamp 78/. Implementirane so kot funkcije C, ki dodelijo Število vsaki začeti lokalni aktivnosti. To so torej navadni Števci. V sistema, kjer Ima vsak proizvajalec svojo logično uro Je problem, kako zagotoviti globalno razvrstitev. Funkcija C Ima lastnost, da velja C(i.a) < C(J,b), če ste a In b aktivnosti v procesih t in J In velja a > b. Pri realizaciji logičnih ur moramo upoštevati dve pravili:
Pravilo 1: Vsak proces i poveča vrednost Števca	C(l)	med dvema
zaporednima aktivnostima.
Pravilo 2: Ce aktivnost a v procesu 1 poSIlJa sporočilo In potem sporočilo m vsebuje časovno značko T(m) =■ C(l ,a) , Po prejemu sporočila m proces j postavi svoj Števec C(j) na vrednost, ki Je večja ali kvečjemu	enaka	trenutni
vrednosti in Je večja od T(ra).
Vsako funkcijo C, ki Ima zgoraj navedeno lastnost.	lahko uporabimo za popolno
razvrstitev poljubne množice aktivnosti. Za to potrebujemo 3e popolno ureditev procesov (npr. glede na njihova Imena), Aktivnost a se Je zgodila pred b. Če Je C(i.a) < C(j.b) oziroma C(l.a) - C(J,b) in je I < j. SihronizaciJskl mehanizem definiran s pravili 1 in 2 in popotna razvrstitev dovoljujeta konsistentno	dodeljevanje	aktivnosti.
Definirana popolna razvrstitev ni edinstvena in nI ekvivalentna kronološki razvrstitvi. To je razlog, da je včasih treba implementirati sistem logičnih ur na sistemu fizičnih ur.
Mehanizem z uporabo abstraktnih izrazov /Herm83/ temelj! na uporabi logičnih ur. Vsaka ura Ima poleg osnovne verzije, ki je pri procesu, ki povečuje njeno vrednost Se dodatne verzije pri drugih procesih, ki samo spremljajo njeno vrednost. Predpostavimo, da mora vedno veljati abstrakten Izraz
kjer sta c In k konstanti, Xj vrednost logične ure In i Število logičnih ur. Dodatne verzije logične ure imenujemo njene slike. Označene so z w ( it ^ ) In se lahko razlikujejo od osnovne verzije, Z uporabo dodatnih verzij pri delu s s 1nhroni zac i JskImi spremenljivkami zelo zmanjšamo komunikacijske potrebe In povečamo učinkovitost. Kamenjava osnovne verzije logične ure z njeno sliko je dopustna, če pri zamenjavi upoštevamo tako stroge omejitve, da je kljub dopustni napaki slike izpolnjena zahteva osnovnega abstraktnega Izraza,
To pomeni, da lahko pri slikah logičnih ur. kjer je konstanta c negativna popravljamo njihove vrednosti z zakasnitvijo, ne da bi s tem ogrozili pravilnost abstraktnega izraza. Podobno lahko vrednosti slik logičnih ur, ki imajo konstanto vnaprej.
Cj pozitivno, popravljamo
S inhroni zac i JskI mehanizmi, ki temeljijo na fizičnih ali logičnih urah Imajo skupno lastnost, da ne temeljijo na medsebojnem Izključevanju, To je velika prednost pri uporabi v porazde 1 J en i h s I s temih, saj omogoča paralelnost Izvajanja..
Sinhroni zaci Jsk1 mehanizmi lahko pri svojem delu uporabljajo dejstvo, da imajo procesi edinstvena in stalna lift.:na. To določa popolno razvrstitev procesov, ki daje opazovalcu občutek, da so procesi povezani v neko verigo ali obroč. Vsak proces ima natančno določenega prednika In naslednika. Taka logična razvrstitev ni nujno povezana s fizično topologijo sistema.
Obroč služI kot osnova za prenašanje posebnih pravic med procesi v sistemu. Pri sinhronizaciji taka posebna pravica pomeni dostop do sinhronizaclJskega osebka.
Skupne
spremen IJ Ivke:
uporaben Las tn i S tvo
Sinhronizacijski mehanizem, ki temelji na konceptu logičnega obroča Je bil predstavljen v /Dl}k74/. Ker predvideva uporabo skupnih spremenljivk Je v tesno sklopi jnlb sistemih, nad posebno pravico lahko zaznamo z opazovanjem spremenljivke. ki jo proces deli z enim od obeh sosedov v obroču. Z znanimi algoritmi lahko dosežemo stabilno stanje. kjer od večih posebnih pravic ostane samo Se ena, ki potem kroil po obroCu In zagotavlja medsebojno Izključevanje.
Odkrivanje napak, reSevanJe Iz napak In dinamično Širjenje sistema so tnolnl z metodami, opisanimi pri podajanju žetona.
Krožeči žeton je s tnhront zac 1J s k 1 mehanizem zasnovan na enaltih osnovah kot deljene spremenljivke, le da se posebna pravica prenaSa med procesi s sporoćllom, kl Ima edinstveno obliko In ga Imenujemo ieton. s tem	principom	dosežemo	medsebojno
Izključevanje procesov v Slbko sklopljenih sistemih. Po prejemu ietona lahko proces opravi ieEJeno aktivnost, Xeton lohko obdrži največ nek naprej doIoCen čas in potem ga mora predati nasledniku v obroču. Mehanizem mora zagotoviti obnovitev logiCnega obroča In podajanja žetona, Ce izpade trenutni lastnik žetona ali katerikoli element lot'lCnuga obroCa, hkrati pa iahko dopu£Ca v sistemu samo en žeton.
Protokoli za odkrivanje napak In njihovo reševanje so podani v /LeLa77/ In v /LeLa78/. Tak s Inhron1 zac 1 jsk1 mehanizem je uporabljen tudi na logičnem nivoju lokalnih mrež In definiran v /ISOSOZ/.
učinkovitost s I nhron 1 zac 1 Jsk1 h mehanizmov za medsebojno izključevanje med procesi Je v veliki meri odvisna od časovne obsežnosti kritičnih sekcij. Ce kritične sekcije vključujejo	izmenjavo	sporočil	med
proizvajalci in potro&nlki, potem to daje nizke zmoglJ E vos t i.
KrožeCI razvrSčeva1nI k Je ra z vräCeva1ni k, ki stalno kroži po logičnem obroču in uporablja mehanizem podajanja žetona.
Po prejemu žetona lahko proces aktivira več primitivov tipa vstopnica In potem poSlJe žeton nasledniku. Na ta način dosežemo medsebojno	izključevanje	uporabnikov
razvrSčevalnika. V Članku /LeLa78/ je opisani protokol, ki uporablja opisani koncept.
Protokol uporablja Številko obhodov,	ki jo nosi žeton In se poveča vsakic, ko doseže
proces, katerega naslednik ima	nižjo itevilko, kot je njegova.
shranjevanje potrebnih Informacij.
Konvergentnost In poätenost
sta lastnosti zagotavljata, da se v konfliktnih viri in zmožnosti enakomerno
s i s tema, k 1 situacijah porazdelijo med tekmujoče procese,
fiazSlrlJivost	Je zahteva, da mora
s inhton 1 zac 1 jsk1	mehanizem	dopuičati
dodajanje ali ponovno vključevanje procesnih elementov, ne da bi s tem motil delovanje s i s tema,
Ulli nčunoM t. Sistem Imenujemo determini« ličen,
če Je zasnovan tako, da v vsakem primeru doseže s 1 nhron i z Irano s t. Ce pa doseže sinhronizacijo samo v večini primerov. se sistem imenuje verjetnosten.
ReSevaiiJe Iz napak Je zelo pomemben nabor funkcij. Posamezni mehanizmi se razlikujejo po količini pomoči, ki jo nudijo pri reSevanju iz napak, po vplivu pokvarjenih elementov sistema na pravilno delujoče in po času, ki Je potreben, da se popravljeni elementi spet vključijo v normalno delovanje s i s t ema.
Povezanost med elementi sistema se zelo razlikuje glede na različne mehanizme. Slabi so taki mehanizmi, ki zahtevajo polno povezanost, saj Je le-ta zelo draga v primerjavi s tipi povezanosti, kjer so elementi združeni v verigo ali obroč.
Vzpostavitve
z ačetnega
stanja
kompleksnosti lahko zelo razlikujejo.
po
Razumljivost In enostavnost mehanizma nam olajšata delo pri snovanju, formalnem preverjanju, implementaciji, testiranju In vzdrževanju.
9. LITERATURA
8.
KRITERIJI	ZA
S I NHRON I ZAC USKIH MEHANIZMOV
/Andrei/ G.R.Andrews;	The	Distributed
VREDNOTENJE	Programmine	Language, Software
Practice and Experience vol.12 1982
S I nhron I zaci Jski mehanizmi nimajo Identičnih lastnosti, v nadaljevanju podajamo važnejSe kriterije za vrednotenje s I nhron I zaci J s k i h mehanizmov. Pomembnost vsakega od njih je odvisna od omejitev. ki jih postavimo s I s t emu,
Odzivni čas in propustnost;
Vsak mehanizem upoštevati in paraleinosti
mora v največji možni meri izkoriščati vse prednosti delovanja porazdeljenega sistema. Paralelnost v sinhronizaciji in. pri komuniciranju omogoča veliko propustnost in kratke odzivne čase.
Prožnost pomeni, da mora sinhronizacijski mehanizem biti odporen na pojav napak In «manjäanja Števila procesov v sistemu. Potrebujemo pravzaprav bolj natančno merjenje te lastnosti, ki bi izrazila Število hkratnih napak oziroma Izpadov procesov, ki jih sistem Se prež Ivi.
Dodatna poraba virov (overhead) Je pi'Hvza]>rav cena, ki Jo moramo plačati, da lahko izvedemo sinhronizacijo. Kaže se v povečanju prometa med procesi in je odvisna od Števila in velikosti paketov, v večji potrebi za procesiranje	(dodatnih	sporočil	za
sinhronizacijo) In v uporabi pomnilnika za
/BenASa/ M. Đen-ArI; Principles of Concurrent P r o g r amm ing.	Prentice	Hall
International 1983
/BrHa73/ P.Brinch Hansen; Operating Systems Principles, Prentice Hall 1973
/Dijk68/ E.W.	Dljkstra:	Cooperating
Sequential Processes In Programming Languages, Academic Press 1968
/Dljk74/ E.W. Dljkstra: Se I f-s t ab i 1 I z ing Systems In Spite of Distributed Control, Comm. ACM vol.17 no.11 1974
/FlnkB6/ R.A.	Finkel:	An	Operating
SystemsVade Mecura, Prentice-Hail 1986
/Herm83/ D. Herman; Towards a Systematic Approach to implement Distributed Control	of Synchronization, In
Distributed	Computing Systems,
Academic Press 19S3
/Hoar74/ C.A.R. Hoare; Monitors: An Operntlng Systems Structuring Concept, Comm. ACM voi .17 no.10 1974
/HoarTa/ C.A.R.	Hoare:	CommunI ca t Ing
Sequential Processes, Comm. ACM vol . 21 no.8 1970
/Hwan85/ K. Hwang, F.A. Briggs: Computer Architecture and l^urut«! Processine, McGraw-HIM 1985
/isoa02/ ISO IS 8802/4 Local Area Networks, token Passing Bus Acces Protocol
/Kope87/ H. Kopetz, W. Och s enre I t er ; Clock Synchronization	in Distributed
Real-Time Systems, IKEE Trans. on comp,, vol.36 no.8 1987
/Lainp78/ L. Lamport; Time, Clocks, and the Ordering of Events, Comm. ACM, vol.21 no.7 1978
/Lmpsei/ L. Lampson: Atomic Transactions, LNCS 105, Springer Verlag 1981
/LeLa77/ G. LeLann ; Distributed Systems
Towards a Formal Approach, Proc.!F!P Congress Toronto, Nort h-Ho 1 1 and 1877
/LeLa78/ G.	LeLann:	Algorithms	for
Distributed Data Sharing System which Use Tickets, Proc. 3.Berkley Workshop, August 1978
/LeLaSl/ 6. LeLann: Synchronization, LNCS 105, Springer verlag 1981
/SlomaS/ M. Sloinan et. al: The CONIC Toolkit for Building Distributed System Proc. 6.IFAC Workshop on Dlatr. Contr, Comp. Sys., Pergaraon Press 1985
/Slome 7/ M. SI omari','" J. Kramer: Distributed Computer Systems and Computer Networks,	Prentice	Hall
International 1987

/Stanež/ J.A. Stankovlc:	Software
Communication Mechanisms:	Procedure
Calls vs. Messages. IEEE	Computer, April 198 2
/Trlp87/ A.Tripathl: Distributed	Operating
Systems, Tutorial of	7th ICDCS Berlin, september 1987
/IISAD80/ USA Departement of	Defence:
Reference Manual for	the ADA
Programjnlng Language,	Proposed Standard Document 1980
/VerJ«3/ J.P. Verjus: Synchronization In Distributed Systems, in Distributed Computing Systems, Academic Press 1963
MATEMATIČNO SODILO O MOŽNOSTI STROJNE INTEÜGENCE (ali o Churdievem teoremu)
UDK 519.713.6
Lucijan Vuga Iskra Delta, Nova Gorica
Church Je 193ó. leta izdelal teorem, ki ima globok vpliv na •filozofske osnove kibernetike in na razre&itev dileme ali Je mogoCe Izdelati inteligenten stroj.
Vpražan iei flli lahko človek predvidi vselej in za vse primere, kakSen bo odgovor stroja, će vnesemo vanj ustrezen program in mu postavimo primerno vprašanje?
Odgovor^ po Churchovem teoremu, ki gradi na diagonalnem postopku stroja, Je lahko nepredvidljiv - vendar ne zaradi naključnih napak v HW ali SW, ampak zaradi matematično - logičnega mehanizma diagonalnega postopka, ki omogoča poleg znanih tudi nove kombinacije elementov, kar imenujemo ustvarjalnost'.
Churc 'made in 1936 the theorem, that has deep influence on the philosophycal bases of cybernetics and the solution of dilemma whether is possible to make an Intelligent machine.
The question; Could one foresee allways and for all instances, what will be the answer from oiachina, if inserts an adequate program and puts an appropriate question?
The answer, according to Church's theorem, which builds on the diagonal machine procedure, could be anpredictable - but not beacouse of accidental HM or SM faults, however owing to mathematical - logical mechanism of the diagonal procedure, that enables besides known also new combinations of elements, what we call creativity.
Človekov mladič si pridobi najosnovnejše pojme o geometriji s èestimi leti, z devetimi leti pa je sposoben, da manipulira z razdaljami v prostoru - in po trditvah nekaterih proučeval-cev - odrasel èimpanz nič ne zaostaja za njim. Daleč od tega, da bi enačili človekove in opičje intelektualne zmogljivosti, toda že iz enega primera je videti, da so potrebna človeku dolga leta učenje, preden je sposoben kolikor toliko uporabljati najspecifitnejèi organ -možgane. Torej, kot je znano, niso dovolj le fitoigani, temveč jih je treba napolniti z ustreznim znanjem, podatki itd. ter jih tudi ustrezno "splošno kondicionirati" - kot vsakdanje pravimo, ustvariti osebnost, ki je nujen pogoj aa ustvarjalno delo. Pogoj pravimo zato, ker sicer brez tega manjka moiganom odločitveni kriterij in dinamika vzgiba. Bogatenje duéevno-sti omogoča za ustvarjanje potrebno iniciacijo in za labilna, zamegljena stanja potrebno odločitveno nagnjenost.
Vendar se Se vedno, tudi v načelu bije bitka,, ali je sploh mogoče strojno oponaèati delovanje človekovih možgan, ne oziraje se pri tem ie na
to, če smo sploh sposobni dokopati se do tehnologij, ki zagotavljajo uporabnost takega stroja. Tudi nekateri načelni privrženci strojne inteligence mislijo, da je do njene ostva-ritve ée selo daleč.
Zanimivo bb pogledati neki matematični prijem, ki skuša po svoje razrešiti dilemo ali je stroj zmožen, za človeka nepredvidenih reéitev torej, ali stroj lahko preseneti človeka? Pri čemer seveda ne mislima na napake med delovanjem, slabo programiranJe ipd.
Diagonalni postopek
Z diagonalnim postopkom Je Cantor dokazal, da Je množica realnih Števil neštevna in da Je zato moč take množice večja od moči množice racionalnih àtevil. Dve množici imata enako moč ali sta ekvivalentni, če Je mogoče njune elemente drugega drugemu paroma prirediti, moč množice izrazimo s kardinalnim Številom.
Vzemimo kot ilustracijo poseben primeri (1) realna 4tevlla naj bodo le tista med nič in ena (kar se lahko nato posploäi), (2) privzamemo, da vsako realno àtevilo v tem intervalu lahko zapièemo kot binarna itevi lo s karakteristiko nič in neskončno dolgo mantisa (če od določenega mesta naprej nastopajo v mantisi le ničle, potem imamo opraviti 2 racionalnim številom). Izhajamo iz možnosti, da je mogoče paroma prirediti elemente množice naravnih àtevi1 in naie množice realnih itevi 1. S tem oštevilčimo vsa realna itevila na naslednji način:
it. 1 ima	Ai2 ^IB-'-
it . 2 i ma O, «21 ^22 • ■ ■ it. 3 ima 0,A3i fl32 «33-•■
itd.
Ker smo se tako dogovorili, ima vsako ètevilo Ai j vrednost O al i 1 in bi lahko imeli tudi neki tak primeri
1
2 3
itd.
0,0 0,0 0,1
0
1 o
1, o. o.
Sedaj pa se odločimo, da si skombiniramo novo itevi lo tako, da ga sestavimo iz diagonale zgornje tabele;
O.ftil Az2
oziroma bi to bilo v naiem izbranem primeru:
0,0 1 o...
vendar pa bomo vsako cifro invertirali ter tako cTobili iz ničle enko in iz enke ničlo:
O,Bi
B2
B3...
kjer je Bi=0, če je Aijsi in Bi=l, će je flij=0. V aapisaneni primeru bo étevilo dobilo tole vrednosti
0,1
1. . .
Oćitno se to étevilo razlikuje od kateregakoli étevila v prvotni tabeli.
Ker smo se pa dogovorili, da bomo ožtevilćili vsa realna itevi la, i diagonalnim postopkom pa je vselej mogoće ispitati eno dodatno realno étevilo, ki ga ni najti med prejénjimi, je oćitno to protislovno. (Z druge strani pa lahko preslikamo množico naravnih žtevil na delno množica realnih žtevil med nič in ena tako, da uporabimo inverzna realna étevilai 1, 1/2, 1/3., 1/4,.,, ampak slednje nas tu sedaj ne zanima.) Church je 1936. izdelal teorem, ki se po njem tudi imenuje, ima pa globok vpliv tudi na filozofske osnove kibernetike in, videli bomo kasneje, tudi na razrešitev dileme ali je mogoče izdelati inteligenten stroj.
ki ga lahko krajèe napiéemo kot algoritem za étevilo X, torej D<X1.
Predpostavimo, da je področje, ki ga definira ta algoritem definitno, tj. da za vsako naravno število zna odgovoriti ali pripada ali ne pripada temu področju. Zato lahko sestavimo algoritem B<X) podan z enačbama;
B(X)
D(X), če je D uporaben na X O , če D ni uporaben na X.
Kot vsak drugi algoritem ima tudi B neko ètevilko, ki naj bo recimo M torej B(H), tedaj lahko napižemo B(X) = U(X,M). Od prej vemo, da je E<(X) povsod definiran - zato je definiran tudi B<M) in s tem ée U{M,M) . Ce p.a je tako, je definiran tudi U(M,M)+1, ekvivalentno zgoraj opisanemu D(M). Po konstrukciji B velja B<M)=D(M)=U(M,M)+1. Z druge strani uvrstimo X=M z enačba B(X)=U(X,M), dobimo B<M)=U<M,M). To pa je protislovno, kar pomeni da obstaja étevna množica, ki pa ni definitna. To je Churchev teorem,-
Churchev teorem
Preden se lotimo samega teorema je prav, dogovorimo o nekaterih definicijah.
da se
Prvo; Kaj je algoritem? V veljavi je več ekvivalentnih in točnih definicij, kaj je algoritem - dovolj je dokazati nek teorem, za eno od njih in avtomatično velja za vse ostale. V intuitivne« sail si u razuaena al gori tem kot navodilo za postopno izvrfievanje nekih operacij na nekih objektih (znakih. Črkah), da bi dobili nek rezultat.
Množico (naravnih ètevil) imenujemo definitno, če obstaja algoritem, ki lahko odgovori na vpraSanje, ali neko naravno ètevilo pripada tej množici.
Zato.je umestno vprašanje ali je vsaka étevna množica tudi definitna*?
Množica je žtevna, če obstaja algoritem, predpis, program, ki postopoma proizvede vse njene elemente. Ce torej imamo tak program ali smo tudi gotovi, da bo izpisano tudi neko konkretno število? Program je dokončni tekst (je determiniran), ki 9a lahko do popolnosti proučimo in iz njega izvlečemo vso informacijo, zato, na prvi pogled izgleda, ne bi smelo biti nikakrènih zadržkov za izpis katerega koli števila.
Ce vzamemo poljuben programski jezik (algol, basic ltd.) lahko prepoznamo vsako njegovo besedo kot del programa ali pa kot njegov ne-dal, ker Je točno podana sintaksa programiranja v enem ali drugem jeziku. Qd tod izhaja, da lahko programe dtevilčimo z naravnimi števili. Tako lahko sestavimo univerzalni algoritem U, katerega izhodidčnl podatki vsebujejo- par cifer« številko algoritma N in ètevilo X nad katerim naj Izvajamo N-ti algoritem. Tako bo U vseboval navodilo, kako naj de&lfrira Število N in ga preoblikuje v tekst ustrezajočega programa, nato pa èe tak algoritem izvede na številu X. Ce lahko N-ti algoritem označimo z ft|M, lahko vse skupaj napišemo v obliki!
UCX,N) = A(gCX).
Church je raziskal "diagonalni algoritem" U (X,X), ki deluje tako, da oponaša uporabo algoritma I na ètevilu l,nato uporaba algoritma 2 na' številu 2, itd. Tako dobljena množica bo področje, ki ga definira algoritem U <X,X)-t-l,
Uporaba Churchevega teorema na strojih
V.N. Trostnikov opozarja, da je mogoče Churchev teorem uporabiti za preiskavo možnosti izdelave inteligentnega stroja in navaja, da se običajno definira razliko med človekom in strojem na naslednji način;
"Vnašanje stroja Je strogo deteroinirano, nedteo, ko ima človek lastnost, ki Ji praviao svobodna volja.'
Vendar kje naj bi se kazala ta svobodna volja, v kateri razvojni stopnji ustvarjalnega razmišljanja? Ni potrebno, da se strinjamo prav z vsem kar je za naslednjo shemo, ki jo uporabljajo nekateri psihologi, je pa primerna pri obravnavi te snovi, da je ustvarjalnost postopnost naslednjih dejavnosti; priprave, inkubacije, inspiracije in preverjanja.
Priprava naj bi zajemala kopičenje potrebnega znanja, podatkov ter ustrezno urejanje. Inkubacija za katero mnogi trde, da se odvija med določenimi podzavestnimi procesi; asaciranjem, odpadanjem nekaterih idej po zakonu nedavnosti i. p. Navdihnjenje - inspiracija, ko se povežejo nazlični podatki, dejstva, elementi v novo kombinacijo. Preverjanje iz inspiracije nastale zamisli, ki terja zavestna, organizirano in sistematično delo z äe določenim ciljem.
Izhajanje iz te sheme, bi lahko rekli, da je le v inspirativni fazi potrebna neka "svobodna volja", ki naj zagotovi novo, doslej neznano kombinacijo znanih elementov. Tako pripravljalna faza, kakor tudi inkubacija nimata sestavin ustvarjalnosti, kljub nekaterim neznanim dejstvom o podzavestnih procesih, saj gre za pripravo gradiva. Res je tudi, da pri tem z«lo verjetno nastopajo medsebojna prekrivanja faz ter različne interakcije, povratne zveze i.p. Toda pravi trenutek stvaritve je zavedanje, da ja nastala nova kombinacija, čemur pravimo inspiracija. Poslednja aktivnost, preverjanje, pa je sicer izredno pomembna, včasih dolgotrajna in naporna, toda ta deluje po ustaljenih principih, v kolikor seveda ni potrebno tudi za preverjanje usA:variti kaj novega, kar terja postopek podoben prejènjemu - sicer seveda ostane nova zamisel brez potrditve zgolj hipoteza.
Oglejmo 3i zato nekolika podrobneje inspirativi no fazo. Trostnlkov trdi,da bi morali definirati vpraéanje o svobodni volji, ki naj bi bila osnova ustvarjalnemu mišljenju, nekolika drugače, pri tem poklićena pomoč tudi' Turinga, ki je rekel, da bi moral i "postati stroj, će bi hoteli vedeti kaj stroj ćuti". Zato si je treba pomagati drugače z vpraèanjem;"flli se človek in stroj natelno razlikujeta v tem, da je obnašanje stroja, potem ko poznamo njegov program, v vsakem primeru (naCelno) predvidljivo, medtem ko je pri Človeku nepredvidljivo?" Ker smo Se spoznali diagonalni postopek in Churchev teorem, ki sta nam pokazala, da kljub jasnemu algoritmu nismo nikoli prepričani - in to načelno, ne zaradi tehničnih napak v delovanju stroja - ali je ali ni proizvedeno étevilo del množice, ki jo generira algoritem. Zaključek tega razmišljanja Je torej tak, da tako Človek kakor stroj nista determinirana in iaiata ustvarjalne sposobnosti.
"Svobodna volja"
Seveda pri tem ostaja èe vedno odprto vpraéanje svobodne volje, ki naj bi bila posebna človekova sposobnost, da lahko pri razsojanju, hotenju in delovanju odloča in izbira med dvema ali več odločitvami. Fi 1 ozo-f i-i deal i sti imajo človekovo svobodo za absolutna, kar pomeni, da ni odvisna od objektivne nujnosti naravnih zakonov, in je glede na to znak človekove pripadnosti nekemu nadnaravnemu svetu. Za materialiste Je svoboda "nujnost, ki je preéla v zavest"; ne obstoji v namièljeni neodvisnosti od naravnih zakonov, ampak v spoznanju teh zakonov in na tem znanju temelječi moinosti izkoriščanja naravnih zakonov in sil v prid določenemu cilju; "svoboda volje pomeni sposobnost odločevanja na temelju poznavanja stvari" (Engels); "svoboda je spoznana nujnost" (Hegel'). Izhajajoč iz slednjih opredelitev tudi človek pri ustvarjanju ni svoboden, čeprav izgleda, da je prav prebijanje omejitev, ki so bile "spoznane kot nujnost", ustvarjalni napor, je tudi res, da tudi nekonvencionalnost, iskanje novega, potreba po ukinjanju aksiomov itd. , Je za človeka "spoznana nujnost". Nujno je spoznavanje narave, nujno je reéevanje neznank, nujno je iskanje novih kom-r binaci j, v tem je osnovni vzgib ustvarjalnosti, torej tudi "nepredvidljivosti" - čeprav lahko vsaj načelno pričakujemo nova spoznanja, odkritja, izume.
Nedvomno je mogoče tudi stroju priskrbeti toliko izkustev, znanja, podatkov, da bo "spoznal nujnost" posameznih odločitev, dasi seveda pri tem ostaja vpraéanje tehničnih moinosti kot reèljiv a zelo zahteven problem. Program, ki je potreben za delovanje stroja izdela človek, ni pa rečeno, da tega ne bi znal tudi stroj, če bi mu dali progran; ss, programe morda vsaj za posamezne dele prvotnega programa. In prav to je pomembna pri ustvarjanju "svobodne volje" samega stroja. Program vsebuje ukaze za računanje, naslove, odločitve i.p. To ±e pri programiranju razumemo dobesedno in pričakujemo,da bo stroj stoodstotno izvršil naä ukaz; če se tako ne izvréi smatramo, da je stroj pokvarjen «li da obstaja neka druga napaka, ki preprečuje dobesedno natančnost. Pri človekovem moiganskem delovanju pa ni tako. Pojmi s katerimi operiramo so megleni, mehki, nenatančni. Za obravnavanje takih stanj, pojmov in postopkov je razvita teorija zamegljenih (mehkih) množic - +uzy sets theory. Tako n.pr. pri odločitvi 1 da - ne, nimamo ostrega in jasnega prehoda, temveč obstaja vrsta vmesnih stanj med 1-0 da-ne. Za oponašanje take situacije moramo dati stroju možnost da se s pomočjo podprograma, na osnovi vloženega znanja, izkušenj itd., opredeli za stopnjo odločenosti med O in 1, med ne in da. Se več! Tak podpro-
gram ne bo en sam, niti ne bodo taki podprogrami fiksni, temveč bodo odprti, dogradi ji vi, spremenljivi. S tem bomo dobili možnost za ne-determiniranost strojnega dela in za približevanje načinu možganskega funkcioniranja, pri čemer bo posamezna odločitev odvisna od kopičenja potrebnega znanja, podatkov, okoliščin itd.
Nedvomno lahko na tak način pridemo do adeter-minističnega stroja, torej takega, katerega proizvodi ne bodo vselej pričakovani, saj bodo sledili -funkcioniranju notranjega ustroja, katerega logika se bo lahko razlikovala od človekove, ki je zgrajena na zgodovinskem izkustvu konkretne realnosti v katerem poteka filogeneza človeštva. Tako kot nas sedanji stroji prekaSa-jo po hitrosti, natančnosti in obsežnosti obdelav, lahko domnevamo, da na» stroji lahko prekašajo tudi pri drugih zmožnostih kombiniranja, analize i.p.
Tako pridemo do usodnega sklepa, če naj ima stroj vložene vse podatke, ki so potrebni za njegovo delovanje in odgovarjajoče funkcioniranje adeterminističnege dinamičnega programiranja, mora vedeti tudi, da sam obstaja: Cogito ergo sum. Poznati mora svoje sposobnosti, zmogljivosti, obseg obdelav, spomina, samoprogra-mi ranja itd. ter tudi to, koliko je ±e obremenjen, na katerem delu so proste kapacitete, poznati mora stopnjo zasedenosti spomina i.p.-skratka zavedati se mora sam sebe. Tudi človek si z leti pridobiva samozavedanje z vzgojo in izobraževanjem, kar- prinese na svet z rojstvom so zelo primitivne oblike zavesti. Človekova čustva in najrazlIčnejia duèevna stanja so bodisi delna ali sploèna podprogramska kondicija, ki ustvarja primerne pogoje za odločanje, zlasti, ko gre za zamegljena stanja okoli neodločne lege, za iniciacijo novih zamisli, ki pač ne nastajajo iz niča ali same od sebe, temveč zaradi vzdrževanja "duševnega tonusa", ki je individualno stanje možganskega sistema potrebno za zagotavljanje neapatičnosti, mrtvila. (V množičnih psihozah i.p. je mogoče močno poenotiti duševna stanja posameznih individuumov.) To bi nekako lahko primerjali z inkubacijsko stopnjo mišljenja. Podobno je mogoče ustvariti v stroju "programe za programe", katerih ena od nalog je prav ta, da ustvarja "duševno razpoloženje", ki je potrebno tudi stroju, ko črpa iz zalog znanja, podatkov, informacij, da črpa po nekem sistemu, tudi če je to naključni sistem, ter da kombinira znana dejstva v nova dejstva, čemur rečemo ustvarjanje t,j. inspirativna faza. Formalno je najmanj težav videti pri samem preverjanju nove zamisli - ampak preverjanju v "teoretičnem smislu", ko niso potrebne dodatne meritve ali celo ustvarjanje novih merilnih postopkov in naprav, da bi potrdili novo zamisel. Stroj bi lahko novote primerjal le s tistim, kar si je nabral v svojem pomnilniku. Vendar je prav enako tudi s človekom! Sele, ko si človek z meritvami pridobi dodatna dejstva, podatke, informacije in Jih primerja e novimi zamislimi, ve, pri čem je. Prav verjetno Je, da Churchev teorem omogoča ustvarjati, ko pojasnjuje načelno nedeterminiranost strojnega delovanja, take rešitve in računalniške zasnove, ki bodo približale strojna človekovi inteligenci, ter slednjo v specifičnem.smislu tudi presegle.
Li teratura:
1.	Douglas R. HOFSTADTERi .BOEDEL, ESCHER, BflCH (ADELPHI Ed.), 1984, Milano
2.	Gerald Oay. SUSSMflN: A COMPUTER MODEL of SKILL ACQUISITION, (AMERICAN ELSEVIER), 1975, N.York
3.	Georg K. ZIPFi HUMAN BEHAVIOR and the PRINCIPLE OF LEAST EFFORT (ADDISON-WELSEY PRESS), 1949, Cambridge