Data Analysis on PC's and UNIX
Gary Perlman
perlman at wanginst.UUCP
Fri Oct 3 07:48:59 AEST 1986
Time for some announcements concerning UNIX|STAT, my little stat package
for UNIX and MSDOS.
I have just finished a new Handbook. It incorporates the printed manual
entries that until now were printed separately. It is about 100 pages.
Separate printed manual entries are no longer available. The same
liberal copyright for UNIX|STAT software has been extended to the Handbook.
UNIX|STAT is now comes with an online version of the example in the
Handbook, and several new programs. There is a new tutorial overview
of the package written with the MSDOS user in mind, however a distribution
scheme has not been worked out yet.
The first thing that follows is general information about UNIX|STAT.
After that, I have put a list of the changes for the last few versions
of the package. Use it to see if you should order an update.
----------------------------------------------------------------------
UNIX|STAT 5.2
Gary Perlman
DATA MANIPULATION & ANALYSIS PROGRAMS
FOR UNIX and MSDOS
UNIX|STAT is a set of over 20 data manipulation and analysis programs
developed at the University of California, San Diego and at the Wang
Institute of Graduate Studies by Gary Perlman. The programs are designed
with the UNIX philosophy that individual programs should be designed as
tools that do one task well and produce output suitable for input via pipes
to other programs. Interactive use is supported in the command line
interpreter which also provides a programming language for complex
analyses. Typical usage involves a pipeline of transformations of data
followed by input to an analysis program, summarized schematically by:
INPUT DATA | TRANSFORM | ANALYSIS | OUTPUT RESULTS
Functions built into many statistical packages (e.g., graphics and editing)
are not re-invented in UNIX|STAT which delegates such responsibility to
standard tools.
DATA MANIPULATION PROGRAMS:
abut join data files
colex column extraction
dm column oriented data manipulator
dsort multiple key data sorting filter
maketrix create matrix type file from free-form file
perm permute line order
probdist probability distribution functions
ranksort rank order columns
repeat repeat a pattern or file
reverse reverse lines, columns, and characters
series generate a series of numbers
transpose transpose matrix type file
validata verify data file consistency
DATA ANALYSIS PROGRAMS:
anova multi-factor analysis of variance
calc interactive algebraic modeling calculator
contab contingency tables and chi-square
desc descriptions histograms, frequency tables
dprime signal detection d' and beta calculations
oneway one-way anova and t-test with error-bar plots
pair paired data statistics, regression, plots
regress multivariate linear regression and correlation
stats report summary statistics
ts time series analysis and plots
FEATURES:
easy to use programs (negligible training period)
simple input formats (free format field oriented)
used in pipelines with other utilities (sort, print)
flexible data manipulation
several simple lineprinter plotting options
data validation (range and type checking)
documentation includes online manual entries, Handbook
consistent option conventions with online help
runs on any UNIX System (V6, V7, 2.8BSD, 4BSD, System V, etc.)
runs on MSDOS 2.0 and 3.0 with 96K (IBM, Wang, AT&T, etc.)
usually less than a few seconds per analysis
liberal copyright (but can't be distributed for gain)
in use at hundreds of MSDOS and UNIX sites since 1980
WARRANTY AND DISTRIBUTION CONDITIONS:
Carefully read the following warranty and conditions. If you do not
find them acceptable, you should not order or use UNIX|STAT.
Warranty. UNIX|STAT is provided ``as is'' and without any warranty
expressed or implied. There is no claim that UNIX|STAT is suited to your
needs, nor that it will run on your particular hardware/software
configuration. The availability of and programs in UNIX|STAT may change
without notice.
Copyright Conditions. You may make copies of any tangible forms of
UNIX|STAT, provided that there is no material gain involved, and provided
that the information in this flyer accompanies every copy. You may not
modify the source code for any purposes other than getting the programs to
work on your system. You may not distribute any modified source code to
any sites other than your own. Any costs in compiling or porting UNIX|STAT
to your system are your's alone, and not any other parties.
Use At Your Own Risk. All risk of loss of any kind due to use of
UNIX|STAT is with you, the user. You are responsible for all mishaps, even
if the programs prove to be defective. You must be trained at your expense
to learn to use the programs. In particular, you are hereby given fair
warning that data analysis programs make many assumptions about data, These
assumptions affect the validity of conclusions you may make based on the
programs. References to appropriate statistical sources are made in the
UNIX|STAT Handbook and in the manual entries for specific programs.
ORDERING INFORMATION:
Carefully read the instructions below. Orders not following them may
be be returned or even discarded. All prices include delivery and should
be prepaid to G. Perlman. Checks must be in US funds, drawn on a US bank.
Orders that include any terms or conditions other than those in this notice
may be returned or discarded. Company purchase orders should be avoided in
favor of a personal check, with a request for a receipt if needed. Orders
must include a delivery mailing label acceptable to the post office, and
international orders must include the country name on the label. Orders
without mailing labels may not be deliverable or returnable.
UNIX Version of UNIX|STAT:
Price: $20
Contents: Programs (C language) & Preformatted Manual Files
Space: 450K bytes for source and manuals, 450K bytes for binaries
Format: 600 foot half inch 9 track mag tape, 1600 bpi tar format
MSDOS Version of UNIX|STAT:
Price: $15
Contents: Preformatted Manuals and Executables
Space: 700K for binaries and online manuals
Format: 2S/2D MSDOS 5.25 inch floppy diskettes
Handbook:
Price: $10
Contents: Examples, Reference Materials, CALC & DM Manuals, Manual Entries
Format: Typeset Manual (over 100 pages, 3 hole, shrink wrapped)
Note: This handbook incorporates the previously separately printed manuals
CONTACT:
Gary Perlman
Wang Institute of Graduate Studies
Tyng Road
Tyngsboro, MA 01879 USA
UUCP: decvax!wanginst!perlman.uucp CSNET: perlman at wanginst.csnet
NOTES:
UNIX is a trademark of AT&T Bell Laboratories.
MSDOS is a trademark of MicroSoft.
UNIX|STAT is NOT a product of any company or organization.
----------------------------------------------------------------------
Changes for Release 5.0 March 1985
General
reworked to increase portability, reliability, usability
most commands now use standard option parser (getopt)
all calculations now done in double precision
improved error messages
better approximations for F-ratios
efficiency of I/O improved
most programs ported to MSDOS
improved random number seeding on UNIX (perm, dm)
standard exit status (0) on successful runs
version control added
Specific Programs
regress: NEW partial correlation analysis
colex: added as alternative to dm
trans: added as alternative to dm
Changes for Release 5.1 November 1985
General
several minor bugs removed
full package ported to MSDOS
Specific Programs
calc: some bugs fixed, ported to MSDOS
stats: NEW program for simple statistics
trans: no longer distributed (dm)
Changes for Release 5.2 January 1986
General
on-line help in most programs (-LOV options)
UNIX|STAT Handbook and new manual entries
on-line manuals on MSDOS
Specific Programs
probdist: 5 probability distributions with random number generation
pof: deleted from distribution (probdist)
chisq: deleted from distribution (probdist)
contab: NEW crosstabs and chi-square program
pair: plotting options added
dataplot: deleted from distribution (pair)
anova: program more robust
oneway: error bar plots, unweighted means solution
regress: better support for residual plotting
vincent: no longer distributed (use ts -l option)
Changes for Release 5.2 October 1986 (Some changes not yet on MSDOS)
General
Second Edition of Handbook (with manual entries)
Handbook examples now online
Manual entries no longer distributed separate from handbook
for infinite F ratios, 9999 is used
Specific Programs
cat: NEW program added for MSDOS compatibility
colex: formatted output of columns added
dm: some new operators added, bugs fixed
dm: random seed can now follow R[AND] operator
dm: no longer checks for non-numerical inputs (use number(si))
dsort: NEW program for sorting data files by columns
ff: NEW program for pagination, simple text formatting
fpack: NEW program for packing files into plain archives
perm: sorting options added
regress: improved matrix calculations
--
Gary Perlman Wang Institute Tyngsboro, MA 01879 (617) 649-9731
UUCP: decvax!wanginst!perlman CSNET: perlman at wanginst
More information about the Comp.unix
mailing list