home *** CD-ROM | disk | FTP | other *** search
- - 1 -
- 3rd March, 1991
-
- About TS1ST in General (Statistical univariate measures)
- ======================
-
- Apply question mark ? with the program call for a brief description
- of a program.
-
- This package may be used and distributed freely for NON-COMMERCIAL,
- NON-INSTITUTIONAL, PRIVATE purposes, provided it is not changed in
- any way.
- ┌────────────────────────────────────────────────────────────────┐
- │ For ANY other usage (such as use in a business enterprise or a │
- │ university) or the full scale version contact the author for a │
- │ personal or a site license. │
- └────────────────────────────────────────────────────────────────┘
- Please do not distribute any part of this package separately.
- Uploading to BBSes is encouraged.
-
- The registered version is strictly for the registrant only.
- Identical programs must NOT be running on more than one computer at
- a time. Site licensed programs must not be run outside the licensed
- site.
-
- The programs are under development. Comments and contacts are
- solicited. If you have any questions, please do not hesitate to use
- electronic mail for communication.
- InterNet address: ts@chyde.uwasa.fi (preferred)
- Funet address: GADO::SALMI
- Bitnet address: SALMI@FINFUN
-
- The author shall not be liable to the user for any direct, indirect
- or consequential loss arising from the use of, or inability to use,
- any program or file howsoever caused. No warranty is given that the
- programs will work under all circumstances.
-
- Timo Salmi
- Professor of Accounting and Business Finance
- School of Business Studies, University of Vaasa
- P.O. BOX 297, SF-65101 Vaasa, Finland
-
- CONTENTS:
-
- 1. Acknowledgements
- 2. General Description
- 3. Formulae
- 4. The Other Statistical Programs in the Set
- 5. The Statistics Set for Other Computers
- 6. Release Notes for Statmeas
-
- - 2 -
-
-
- 1. ACKNOWLEDGEMENTS
-
- Useful help by the following colleagues in testing STATMEAS and
- discussing the statistical concepts is gratefully acknowledged:
- Associate Professor Martti Luoma, Lecturer Antti Kanto, and
- Assistant Markku Malkamäki.
-
-
- 2. GENERAL DESCRIPTION
-
- STATMEAS (Ver 1.9)
-
- STATistical MEASures program is part of the interactive
- statistical system by Timo Salmi. It is the first program in the
- set, and is intended for univariate statistical analysis. (For the
- other programs in the set see Chapter 4.)
- STAMEAS includes a built-in help system, which can be invoked by
- typing ? at any interactive question. Because of this built-in help,
- and the interactive nature of the program's user interface, no
- long-winding instructions have been included. (Who reads
- instructions anyhow?)
- The program calculates for a set of observations in accordance
- with the user's choice:
- 1) Standard statistical measures
- 2) Autocorrelations
- 3) Runs
- and
- 4) Draws frequency diagrams.
- The data can either be given from the keyboard or taken from a
- file. If the input is to be taken from a file it must first be
- prepared with some editor, or some word processor which includes an
- option for preparing ordinary ascii text.
- The data is given to the program in the following format:
-
- X1 X2 X3 !variable names (! denotes a comment)
- 3.56 6.32 -1.73
- 5.12 -4.21 9.18
- 14.2 5.11 0.31
- END !END is optional in a file
-
- A missing item in an observation is marked by a hash (#). E.g. if
- the first item of the second observation were missing, the
- observation should be written as # -4.21 9.18
- The items in an observation can be separated with blanks, as in
- the above, or with commas (,) e.g. 5.12,-4.21,9.18. The number of
- the intervening blanks is irrelevant, and can be customized for
- increased readability.
- Thus e.g. 5.12 -4.21 9.18
- and
- 5.12 -4.21 9.18
- are equivalent.
- A row can be continued using an ampersand (&). E.g. the variables
- could be given as
-
- X1 X2 &
- X3
- Alternatively, * can be used instead of & as the continuation
- marker.
- Comments can be added to the input data. If ! appears on a line
- all text after ! will be considered as a comment.
- A header can be entered on each page if output is directed to a
- file. To accomplish this start the very first line on the input file
- with a double exclamation mark (!!) and the rest of the line will be
- used as the header. Thus !! indicates a header, a single ! an
- ordinary comment.
- The maximum number of variables is 25. The maximum number of
- observations is 400 (for each variable). The public domain version,
- however, sets the limits at 4 and 100 respectively.
-
-
- 3. FORMULAE
-
- This chapter gives a selected summary of the univariate
- statistics formulas used in STATMEAS.
-
- Denote
- N = number of observations (of the relevant variable)
- x(j) = value of the j'th observation (for the relevant variable)
-
- Mean kx = Σ x(j) / N
-
- ____________________
- Standard deviation s = √ Σ (x(j)-kx)²/ (N-1)
-
-
- Variance s² = Σ (x(j)-kx)²/ (N-1)
-
-
- Coeffecient of variation = standard deviation / mean
-
- 3 3 ½
- Skewness g(1) = Σ (x(j)-kx) / (Ns ) standard error (6/N)
-
-
- 4 ½
- Kurtosis g(2) = Σ (x(j)-kx) / (Ns ) - 3 standard error (24/N)
-
-
- N-lag
- Σ (x(j+lag)-kx)(x(j)-kx)
- j=1
- Autocorrelation r(lag) = ----------------------------
- N
- Σ (x(j)-kx)²
- j=1
-
- _______________________
- │ lag-1
- Standard error std(r(lag)) = │ (1 + Σ 2 r(i)²) / N
- √ i=1
-
-
- t-value = r(lag) / std(r(lag))
-
-
- lag
- Ljung-Box Q = N(N+2) Σ r(i)² / (N - i)
- i=1
-
- Run statistics:
-
- Denote n1 = number of negative changes
- n2 = number of no changes
- n3 = number of positive changes
- n4 = n1 + n2 + n3
-
-
- expected number of runs = (n4(n4+1) - n1² - n2² - n3²) / N
-
- number of runs - expected number of runs + 0.5
- Standardized k-value = ----------------------------------------------
- std deviation of runs
-
-
- 4. THE OTHER STATISTICAL PROGRAMS IN THE SET
-
- STATMEAS is the first statistical program in a set of programs
- intended for analyzing statistical data. The second program in the
- set is STATREGR (STATistics: multiple REGRession), which also has
- been released. Plans include STATTRAN (STATistics TRANsformations)
- for effecting transformations (such as logarithms, differences,
- roots, etc.) on the data, and STATRCOR (STATistic Ranks and
- CORrelations) which is intended for parametric and non-parametric
- correlation analysis.
- STATTRAN, STATRCOR, and STATLADR have now been released to
- complete the set. Each of the programs appears in a different .arc
- file. The names are:
- TS1STxx.ARC <-- STATMEAS
- TS2STxx.ARC <-- STATREGR
- TS3STxx.ARC <-- STATTRAN
- TS4STxx.ARC <-- STATRCOR
- TS5STxx.ARC <-- STATLADR
-
-
- 5. THE STATISTICS SET FOR OTHER COMPUTERS
-
- The Statistical programs by Timo Salmi are (were) also available
- for ill-fated the Sinclair QL computer. Named STATPREP the system is
- part of a Public Domain library for QUANTA members. The descriptions
- of the files in the Quanta Library are given below as taken directly
- from a QL database.
- The central part of the package has also been programmed for the
- VAX/VMS by the author, but he input and output commands of the VAX
- "REGRE" and "LADRE" programs are in Finnish, and the VAX versions
- are not public domain. (LADRE is a Least Absolute Deviation multiple
- REgression analysis.)
-
- In Quanta Library:
-
- STATPREP_DOC
- Quill documentation of Timo Salmi's statistical package.
- Associated programs STATPREP_EXE, STATTRAN_EXE, STATMEAS_EXE,
- STATRCOR_EXE, STATREGR_EXE, STATDIAG_EXE, and STATMENU.
-
- STATMENU
- Besides EXEC_W'ing the statistical package programs you can
- LRUN STATMENU and let it take care of the calls. Makes the
- package still easier to use. Alter the default device(s)
- (FDK1_) for your own configuration, if necessary.
-
- STATPREP_EXE (statistics preparation editor)
- An editor of a kind used for entering the data for the
- associated statistical programs. Maximum 10 variables with 200
- observations (each).
-
- STATTRAN_EXE (transformations on variables)
- Used for making transformations on observations prepared with
- STATPREP_EXE. Transformations include taking logarithms,
- raising to power, taking roots, scaling, and multiplying
- between observations. Takes its input from a <statprep> file
- prepared by STATPREP_EXE and creates a new <statprep> file with
- the original and the transformed variables.
-
- STATMEAS_EXE (statistical measures)
- Standard statistical measures and frequency diagram for any of
- the variables in the data. Measures include minimum, maximum,
- mean, median, standard deviation, variance and kurtosis.
-
- STATRCOR_EXE (ranks and correlations)
- Calculates both the usual (Pearson's) correlation coefficients
- and Spearman's rank correlation coefficients. Also gives, on
- request, ranks for the observations of any of the variables.
- Note that computing Spearman's rank correlations takes quite
- awhile.
-
- STATAURU_EXE (autocorrelations and runs)
- Calculates autocorrelations up to the desired lag. Also
- calculates runs statistics.
-
- STATREGR_EXE (multiple regression)
- Full-scale multivariate regression on the data prepared with
- STATPREP_EXE (and STATTRAN_EXE).
-
- STATDIAG_EXE (scatter diagram)
- Draws scatter diagrams of the data prepared with STATPREP_EXE
- (and STATTRAN_EXE). The user has the choice of the X-axis and
- the Y-axis. If the user specifies a regression model (estimated
- by STATREGR_EXE), the predicted (i.e. regressed) values can be
- plotted on the scatter diagram. Furthermore, a scatter diagram
- of the residual can be drawn against any of the variables if
- the user has specified the regression model.
-
-
- 6. RELEASE NOTES FOR STATMEAS
-
- Versions 1.1 and 1.2 of STATMEAS include some minor changes and
- corrections in the user interface to make it fully compatible with
- the other programs in the set.
-
- Version 1.3: If the input file is not found, you have the choice
- of listing directories from within STATREGR. The directory routine
- has been rewritten for a more relaxed syntax. (For details see the
- information on DIRW in TSUTIL.INF in TSUTILxx.ARC version 1.8 or
- later.)
-
- Version 1.4: An attempt to rewrite a protected file no longer
- crashes the program.
-
- Version 1.5: This version introduces input recall and line
- editing. The special keys Del, CrLf, CrRg, CrUp, Home, End, and Esc
- are functional for this purpose. PgUp is the recall key. Line
- editing uses insert mode.
- Disk access has been made faster (the program has its own cache).
- The directory routine has been updated.
- Read-only files can be read properly.
-
- Version 1.6: While fixing a bug in the multiple regression
- program I also made some enhancements in the routines which are
- common to all my statistical programs. Most importantly the insert
- key has been made functional in the line editing.
-
- Version 1.7: The line editing potential of the program has been
- improved. When the task is given from the keyboard, and continuation
- lines are used, the repeated input recall (CursorUp) gets each line
- in turn. Ctlr-C and break-key abort have been enhanced.
- A line can be continued using an &, or a * at the end. Now also the
- backslash \ is accepted. This was suggested by Tuomas Eerola.
- Some help texts within the program have been extended.
-
- Version 1.8: Several improvements to the nuts and bolts of the
- user interface.
- The new usage of the call is
- PROGNAME [/h(elp)] [/iInputFileName] [/oOutputFileName] [/cColumnsPerRow]
- (the /c option, which regulates the width of the output, is for
- registered versions, only). If you use the /i switch, it stuffs the
- InputFileName into the appropriate recall buffer. This means that
- when the program asks you for the input file name, you can invoke
- the input file name just by pressing the CursorUp key. (The same
- goes for the /o switch, respectively.) This is very convenient, if
- you use the program many times successively making small changes in
- your data in between. (This assumes, of course, that you have a
- command line editor like DOSEDIT or CED to recall previous MsDos
- commands. These common shareware programs can be obtained from any
- well-stocked BBS or FTP site.)
- The printer readiness test has been rewritten to be more general.
- The earlier test failed for some printers, because the codes the
- printers send when they are offline are not standardized.
- The "file exists, overwrite?" question is no more asked when the
- output file is prn, in other words when the output is directed to
- the printer.
- The user has now a choice of a left margin from 0 to 20 blanks
- when output is directed to the printer.
- The user has now a choice between formfeed and four blank lines
- to start each new page of output.
- When an input file is not found, the user is given the choice of
- listing a directory. The directory routine has been rewritten.
- The file ready message now also includes the file side besides
- the name.
-
- Version 1.9: The calcualtion of the coefficient of variation was
- added.
-