XLSTAT使用说明_高中生题库网|高考真题|高考试题-「密云二中」

-

2021年2月2日发(作者：味同嚼蜡)

STAT

Generalities

Installation

Running

STAT -

STAT Manager

STAT direct access

General remarks

Dialog boxes

Numerical efficiency

Missing data / Numerical and categorical variables

Error messages

Shortcut keys

References

Help files

Statistical tools

6D Plots

ANOVA / ANCOVA

AxesZoomer

Categorical sorting

Clustering 1 & 2

Correlations / Principal Component Analysis (PCA)

Correspondence analysis

Crossed sorting / Flat sorting

DataFlagger

Descriptive statistics

Discretize data

Discriminant analysis

Easy labels (two clicks to add labels on a plot)

Extract a sample of rows from a dataset

Factor analysis

Fit data to density functions and test fit

Histograms

Kruskal-Wallis test

MicroMover

MinMax Search

Models for binary response data (Logit, Probit, ...)

MotriMax

Multidimensional Scaling (MDS)

Multiple correspondence analysis

Non- linear fitting (GenFit)

Odds ratio

Similar rows detection

Plot Transformer

Regression

Test for comparing two proportions

Tests for comparing two samples (Student, Wilcoxon, Fisher, ...)

Tests on contingency tables (Chi- square and Exact tests)

Transposition

Microsoft Excel? and Statistics

License conditions

Generalities

Installation

Required software : Excel ? 5.0 or above

Required

hardware

STAT

compatible

with

all

PCs

that

can

run

Microsoft

Excel?

5.0.

However,

for

optimal performance, a 486DX33 PC with 8Mb RAM or a PowerPC-based Macintosh is recommended.

To install

STAT you need to copy all the files from the

STAT disk to the hard disk. You load

STAT by

opening the file with the open command in the File menu of Excel, or better, you can use the add-

in manager in the tools menu. A double click on the file name or icon in the Windows Explorer of file manager

is also possible.

The professional version of

STAT is exactly the same as the shareware one, the only difference being that it

is not time limited and that a free update service is provided if the author is contacted at fahmy@.

STAT website is :

Running XLSTAT - XLSTAT Manager

To run XLSTAT, please load the file by opening it with Excel File|Open command or by using the

Tools|Add-in Manager menu.

Once

STAT

opened,

the

available

commands

can

directly

accessed

from

the

menubar

from

the

STAT Manager

STAT toolbar.

Using the XLSTAT Manager to open the tools makes it possible to access them in the

STAT toolbar, as they

are added to it after selection. To know what tool corresponds to what icon of the toolbar, leave the mouse cursor

on the icon, and read the

using

the

STAT

Manager

select

tools,

you

can

also

save

configuration

Saving

configuration as

STAT will load, it will

automatically open the tools you saved in the configuration. When you click on save the options corresponding to

the “O” button of the toolbar are also saved for the next sessions.

You

can

also

activate

the

STAT

Manager

any

time,

using

the

STAT

Manager

command

the

STAT menu.

To edit the

STAT standard ordering sheet, you can use the

STAT

or open from your

STAT directory. After you registered you can use the

your

personal

license

number,

given

the

author

the

distributor.

You

will

then

immediately

become

STAT-Pro !

STAT direct access

If you want to

open

and

STAT directly at any time by just pressing a button, you only need to open

once in Excel the macro. It will add a blue color button in the standard toolbar of Excel, close to the

button.

You

can

move

delete

this

button

any

other

Excel

button

using

the

General remarks

In the

plots will be stored in standard Chartsheets. If you don't select this option, the plots will be stored on the same

sheet as the output tables, in the upper left corner of the sheet.

STAT sometimes creates intermediate sheets

that are stored in the results workbooks. If you want you can view them by selecting the corresponding option.

Each time you load

STAT, the reference option is set to A1 if it was on R1C1. This is necessary for

STAT

to work properly. If you change that option while working, don’t forget to change it back before using

STAT

again.

Each

STAT

copy

protected

laws,

and

can

only

used

one

computer

time.

Every

single shareware release of

STAT is time limited.

To obtain the latest version (updates are released frequently) please contact Thierry FAHMY at the following e-

mail addresses : fahmy@

Don't

hack

this

software,

and

importantly

don't

spread

illegal

copies.

Respect

other

people's

work,

shareware will disappear. Don’t forget that if you buy

STAT you will get free updates for a year!

The Author disclaims any responsibility for damages or subsequent loss caused by using

STAT.

Dialog boxes

Dialog boxes have been created to be as straight-forward and uniform as possible.

The

Excel sheet the cell which will be the upper left corner of the results tables.

When you select data, variables always need to be stored in columns.

column

labels

correspond

the

labels

names

the

explanatory

variables.

you

select

the

variable labels within the data range you must tick the corresponding option.

within the data range, then you must specify tick the corresponding option.

NB: The Outputs range does not need to be on the same sheet or on the same Workbook than the input table.

Numerical efficiency

STAT is efficient for most methods, but it can be slow when there are a lot of variables and rows.

depend on your computer processor and on the RAM. This occurs because

STAT is fully written in VBA and

Excel macros are not compiled. But the fact that

STAT is only VBA makes it possible for you to use it on any

computer and any system.

Missing data - Numerical and categorical data

STAT is able to detect missing data for the following tools : PCA / Disc. Analysis / Classification / CA / MCA

/ Histograms / Descriptive Statistics / Factor analysis.

Missing data must be coded with empty cells.

Categorical

variables

can

coded

any

symbol

except

free

cells

(the

only

way

code

missing

data).

Numerical data needs to be numbers.

Error messages

There are very few error messages and they are explicit enough so that you can easily understand what is wrong.

The most frequent message is :

specify

variable

observation

labels are

included,

because

you

forgot

select

the

upper

left

corner

output tables.

Sometimes it can also occur because there is some non-numerical data in a column that should correspond to a

numerical variable.

References

To create XLSTAT, many references have been consulted within the which :

a (1991), Probabilité

s, Analyse de donné

es et statistiques, Technip, Paris

one (1993), Biomé

trie , Masson, Paris

A. Agresti (1990), Categorical Data Analysis , Wiley Interscience

Shortcut Keys

Ctrl +M : activates the XLSTAT Manager

Help files

STAT help file

and

Macintosh

computers,

help

topics

can

accessed

from

either

the

STAT

menu,

from

each

dialog box. On PCs the help file can also be accessed from the file manager.

For users who would like to have a printed version of the help file, they can print the Word6 version of the help

file

Ordering XLSTAT

know

the

how

order

XLSTAT,

please

visit

and

the

order

page,

open

which is distributed with XLSTAT modules. For any further information, please contact Thierry Fahmy

at fahmy@

6D Plots

This tool is one of the simplest ones of XLSTAT for the mathematical background it requires from the user, but it

helps creating very surprising charts that will impress of lot of the people who ignore XLSTAT. With this tool

you can represent up to 6 dimensions at a time (or even seven if you use the labels to display information), 4 of

which can be numerical and 2 categorical. This tool can easily be used to plot 2 dimensions and differentiating

the belonging to various groups (a third dimension), or four dimensions to distinguish the financial results over

two

years

for

various

geographies

and

products.

The

only

compulsory

entries

for

this

tool

are

the

first

two

numerical dimensions and the output location (it can be on an Excel spreadsheet or on a separate chart sheet. Or

you can simply use this tool to do two dimensional plots using other object formats than the

traditional

Excel

ones (instead of using the small circles to represent data you can use any image or object you want !).

Dialog box entries

Select Xs

: select in this entry box the data on the Excel sheet that correspond to the first dimension (numerical

data only). This entry is compulsory and is represented on the abscissa axis.

Select

select

this

entry

box

the

data

the

Excel

sheet

that

correspond

the

second

dimension

(numerical data only). This entry is compulsory and is represented on the ordinates axis.

Select 3

: select in this box the data on the Excel sheet that correspond to the third dimension (optional). The

third dimension is represented by the

size

of the object.

Select 4

: select in this box the data on the Excel sheet that correspond to the fourth dimension (optional).

The fourth dimension is represented by the

size

of the object.

Select Groups 1

: select in this box the data on the Excel sheet that correspond to the fifth dimension. Whatever

the data type in this selection, XLSTAT will take them as categorical data. The fifth dimension is represented by

the

color

of the object.

Select Groups 2

: select in this box the data on the Excel sheet that correspond to the sixth dimension. Whatever

the data type in this selection, XLSTAT will take them as categorical data. You can select data in this zone even

there is no

type

of the object.

Data

labels

included

this

option

selected,

you

must

have

included

and

selected

the

data

labels

when

selecting the Xs data. The labels must be on the left side of the Xs.

Variable

labels

included

this

option

selected,

you

must

have

included

and

selected

name

for

each

dimension.

Outputs :

the output is the chart itself. However you can decide to put it on a special area of a spreadsheet (then

select a cell of your choice on this spreadsheet in the

(then select

Autocolor :

deselect this option if want to control the color and the shape of the objects that will be used instead

of the usual Excel formats (dots, circles, squares). A

default colors suggested by XLSTAT. A

XLSTAT suggests only four shapes : circles, squares, triangles and diamonds, any kind of shape or image can be

added.

With

Excel97

(and

later

versions)

you

can

take

full

advantage

the

shapes

and

textures

that

are

available.

Example

This example shows a simple 3 dimension plot where the first two dimensions are represented on the X and Y

axes,

and

the

third

dimension

represented

using

the

object

size.

examples

can

found

Table 1 : data on Excel sheet

Figure 1 : dialog box as it should filled in

produce

the

plot

below

the

autocolor

option

has

been

deselected.

Then

the

default

object

suggested

XLSTAT in the

axes and of the background have been manually changed by the user. The Micromover tool of XLSTAT has been

used to move all labels a little bit up, all at the same time.

Figure 2 : 3D representation of the data

AxesZoomer

A simple but time saving utility tool. Select a plot, then run this tool and you will be able to modify the min and

max of the axes until you are satisfied.

Categorical sorting

This

tool

makes

both

increasing

and

decreasing

sorting

(with

the

usual

alphabetical

conventions)

several

Categorical variables easy to achieve, with decreasing priorities from the first to the last variable. The sorting is

done first on the first variable then on the second variable, but within the first variable categories, and so on,

until the last variable.

Note : if you want to sort numerical data it is better to use the classical Excel? sorting tool.

Cluster analysis 1 & 2

Cluster analysis is a data analysis tool that allows you to classify into groups, a set of observations described by

numerical

variables.

You

can

specify

the

number

groups

you

want

create.

STAT

offers

you

two

different clustering techniques. It is often a good idea to try both and compare the results.

NB: If you want to classify observations described by Categorical variables it is advised to use first the Multiple

Correspondence Analysis and then use the observations coordinates on factorial axes (in the last results table of

MCA) as numerical variables to finally classify the observations.

XLSTAT offers now the possibility to normalize and weight the variables, some very important options for the

two methods suggested here which are based on euclidean distances : the scale of a variable might have a major

effect

the

final

result.

One

can

cancel

the

scale

effect

normalizing

the

variables.

Weighting

variables

enables

you

give

less

importance

the

variables

depending

how

much

you

want

them

influence the final result of the clustering.

Cluster analysis 1

Classification using centro?

ds method (also know as k-means). It is a quick and powerful method , but it does

not give any help in finding the ideal number of clusters.

In the dialog box,

that it might give some different results each time (because the starting point is selected by random). Only the

best classification (in sense of inertia) is presented in the results tables.

Note : sum of Between-groups and Within-groups inertia is constant and is equal to Total inertia. To choose

between the different iterations this tool selects the one with the highest between/within rate.

Cluster analysis 2

Ascendant hierarchical cluster analysis using Ward's clustering technique. If there are over 85 entries, it is not

possible to see the dendrogram graphic.

If you want to use a dissimiliraty measure which is not the euclidean distance (Ward's method), you may give as

input to

STAT the dissimilarity matrix instead of the raw data. You can also select a correlation matrix as the

input data in that case, the dissimilarities are automatically computed using the following formula : dissimilarity

(i,j) = SquareRoot [ 2 x (1-correlation(i,j))] which produces a sphere with radius 1.

Results :

If you choose option

groups and only prints the abscissa (on dendrogram) table and the knots description table, and then creates the

level-histogram

and

the

dendrogram.

there

are

than

entries

XLSTAT

stops

ask

you

how

many

groups you want to build. If you don’t know, put 1 and follow the advice that will be written below one of the

tables.

If you choose option

described above, but with two more tables containing the data classification results.

If you choose the

inertia criterion.

Correlations / Principal Component Analysis

Introduction

Correlation coefficients enable analyzing relations between numerical or categorical ordinal variables. It's a way

check

two

variables

evolve

the

same

way

opposite,

they

are

independent.

you

have

variables,

the

result

this

analysis

NxN

symmetric

table

with

correlation

coefficients

for

each

pair

variables.

The

correlation

coefficients

matrix

the

most

common

basis

for

starting

Principal

Component

Analysis,

which most important results are the building of N independent factors (they are linear combines of the initial

variables, which have the property of have null correlation coefficients), and an optimized representation of all

the data on a P-dimensional plot, where P is often 2 in practice, although it can be from 1 to 3. The quality of

the plot is given by the % of the N-dimensional variability is has been able to show.

Dialog box entries

Reference on sheet : select in this box the table that contains the data, with or without the columns and rows

labels (if necessary select the labels options). The data must be numerical variables. If you want to do

full

PCA, then this table must include rough data (several individuals - in rows - described by several variables - in

columns). But if all you have is a correlations or a covariance matrix you can analyze its structure and see the

correlations circle.

Data

must

numerical

data.

there

are

missing

data

your

dataset

(coded

with

free

cells),

you

can

ask

STAT to remove the corresponding rows for all the calculations or only when the missing data are involved

in the calculations. If the

deleted. For the correlations you can choose if you want to keep the rows with missing values or not.

Output range : select the cell that will correspond to the upper left corner of the results printed by

STAT.

Number of additional rows : if they are some data that you don't want to use for the computations but only to

plot them afterwards, you only need to include them at the bottom of the selected table and specify the number

of data that need to be used only at the end.

Number of additional variables : if they are some variables that you don't want to use for the computations but

only to plot them afterwards, you only need to include them at the right of the selected table and specify the

number of variables that need to be used only post- computations.

NB : the classical correlation coefficient is usually calculated for numerical continuous integer or real data. Be

very careful before using it with ordinal numerical data Kendall's and Spearman's rank correlation coefficients

are much better for ordinal data.

Results : Correlation coefficients / Covariances

you

want

use

the

covariance

matrix

for

the

PCA

and

the

covariance

matrix

within

the

results,

please select the

matrix, because it removes the scales effects (the scale of the values of the various variables is not taken into

account to make them easy to compare in trends).

This

allows

you

calculate

the

correlation

coefficients

matrices.

You

can

either

calculate

the

classical

parametric coefficients, or the Spearman's or Kendall's non-parametric rank correlation coefficients. Spearman's

and Kendall's coefficients can be useful if you want to do PCA with ordinal data.

If the PCA option of the dialog

box

isn't

activated,

you

will only

get

the

correlation

matrix

result.

Two

calculation techniques are possible. If you choose

containing missing values are deleted. If not they will be deleted only when the variable corresponding to the

missing value will be involved in calculations. If you think it is worth it, this will allow you to keep as much

information as possible.

If you don't want to keep the rows with missing values, then if

STAT comes across missing data in a row, this

row will be ignored for all the calculations. If there are missing value that you haven't seen and if you require a

PCA, XLSTAT will automatically detect and remove the corresponding rows.

To make the reading of the correlation matrix easier, you can choose to highlight the

significant

correlations

which are determined using the test that compares the absolute values to 1/sqr(n-1), where n is the number of

variables.

for

Kendall's

and

Spearman's

coefficients,

find

out

significantly

different

from

zero

for

significance level you choose, you need to look in the corresponding tables which can be found at the end of any

book on statistics.

To help you in reading the correlation matrix, you can select in the dialog box a range within which all values

will appear in a light green color. If you want to see all values then put nothing or two equal values.

Note

for

Kendall's

and

Spearman's

coefficients,

know

there

significantly

different

from

zero

for

significance level you choose, you need to look in the corresponding tables which can be found at the end of any

book talking about statistics.

Suggestion : you should use the

DataFlagger

tool of XLSTAT to make more visible some particular values on

large correlation coefficients tables.

Results : Principal component analysis (PCA)

After

the

calculation

the

correlation

matrix,

the

PCA

option

the

dialog

box

activated,

some

calculations will start, beginning the Principal Component Analysis, at the end of the which you will be able to

see the

you are projecting some n-dimensional data on a 2-dimensional

plot,

much

variability

possible

has

been

saved.

The

calculated

from

the

eigenvalues

gives

you

idea

the

global

variability

which

represented when using the axes of interest.

If you want to view some other two dimensions data representation, select other axis numbers in the dialog box or

change the series references on the chart. It's of course advised to always start with axis 1 and 2.

The

correlations

circle

2-dimensional

plot

helpful

for

interpreting

the

correlations

between

the

initial

variables and the new variables which are linear combines of the initial ones : the closer a variable is to an axis

and

the

circle,

the

higher

the

correlation

with

the

corresponding

factor

will

be.

also

way

plot

the

correlations

between

the

initial

variables

two

variables

are

opposite

quarters,

they

are

negatively

correlated

;

two

variables

are

perpendicular

the

correlation

;

two

variables

are

close,

the

correlation is close to 1.

A biplot is also plotted. It is not exactly a mix of the to previous plots, as the variables coordinates are modified

to take into account the representation power of each axis and the scale of the plot of the data.

To avoid any mistaken interpretation on the positions of the data in the new representation space, one can use the

squared

cosines

printed

the

last

table

the

closer

squared

cosine

one,

the

closer

the

data

the

corresponding axis.

Example

Here is a simple example with 8 observations being described by 6 variables. PCA is used here to visualize the

observations on a map to quickly see which years have been comparable and which not.

Table 1 : data on Excel sheet

Figure 1 : the dialog box as it must be filled in

The

first

result

displayed

XLSTAT

the

correlation

coefficients

matrix.

Here

the

DataFlagger

(see

the

XLSTAT - Excel Utilities folder) has been used to highlight the strong positive and negative correlations.

Table 1 : correlations matrix

Among the 3 plots displayed by XLSTAT, the biplot is the nicest one when there aren't too many observations

and variables (if not, it can be messy) as it shows the mapping of the data simultaneously with the mapping of the

initial of the variables on the factorial axes.

Figure 2 : biplot of variables and observations

For example, using this chart one can say that years 1965, 1966 and 1968 are close because production of citrus

fruits have been high and production of rice has been low.

Correspondence Analysis

Introduction

Correspondence

analysis

generally

useful

for

analyzing

survey

results

when

two

questions

have

been

asked

with several possible answers. It can of course be used for the analysis of any two-way contingency table. For

example it is possible with this tool to study the relation between parents jobs and studies of children. A plot is

generated to make the interpretation of the numerical results easier.

Dialog box

Contingency

table

select

this

box

the

table

that

contains

the

data,

with

without

the

columns

and

rows

labels (if necessary select the labels options). The data must be numbers. A contingency table is a table for which

a cell (i;j) corresponds to the frequency observed for simultaneous characteristics: category

-

本文更新与2021-02-02 10:52，由作者提供，不代表本网站立场，转载请注明出处：https://www.bjmy2z.cn/gaokao/599153.html

返回列表：英语

电子产品英文说明书

PC1D5.9说明书

当前您在：主页 > 英语 >

XLSTAT使用说明

-

-

-

-

-

-

-

-

-

返回列表：英语

XLSTAT使用说明的相关文章

爱心与尊严的高中作文题库

爱心与尊严高中作文题库

爱心与尊重的作文题库

爱心责任100字作文题库

爱心责任心的作文题库

爱心责任作文题库

爱心长在作文题库

爱心中国感恩励志作文题

爱心助考作文题库

爱心助农作文题库

爱心尊重宽容拒绝作文题

爱心尊重作文题库

爱心作文题库好段

爱心作文题库120字

爱心作文题库读者

爱心作文题库分论点

爱心作文题库简短

爱心作文有哪些题库

爱需要被尊重作文题库

爱需要传递200字作文题库

爱需要公平作文题库

爱需要行动作文800高中作

爱需要行动作文题库

爱需要交流与沟通作文题

当前您在： 主页 > 英语 >

-

-

-

-

-

-

-

-

-

XLSTAT使用说明的相关文章

当前您在：主页 > 英语 >