-
毕业论文(设计)
外文资料:
Information
management system
Wiliam n
U.S.A
Abstract:
An
information
storage,
searching
and
retrieval
system
for
large
(gigabytes)
domains
of
archived textual dam. The
system includes multiple query generation
processes, a search process,
and a
presentation of search results that is sorted by
category or type and that may be customized
based
on
the
professional
discipline(or
analogous
personal
characteristic
of
the
user),
thereby
reducing the amount
of time and cost required to retrieve relevant
results.
Keyword:
Information
management
Retrieval system
Object-Oriented
UCTION
This
invention
relates
to
an
information
storage,
searching
and
retrieval
system
that
incorporates
a
novel
organization
for
presentation
of
search
results
from
large
(gigabytes)
domains of
archived textual data.
OUDN
OF THE INVENTION
On-line
information
retrieval
systems
are
utilized
for searching and
retrieving
many kinds
of
information. Most systems
used
today work
in essentially the
same
manner; that
is,
users
log
on
(through
a
computer
terminal
or
personal
microcomputer,
and
typically
from
a
remote
location), select a
source of
information (i.e., a
particular database)
which
is
usually something
less than the complete domain,
formulate a query,
launch
the search, and then review the search
results displayed on the
terminal or
microcomputer,
typically
with documents
(or summaries of
documents)
displayed
in
reverse
chronological
order.
This
process
must
be
repeated
each
time
another source
(database) or group of sources
is
selected (which
is
frequently
necessary
in order
to
insure all
relevant
documents
have been
found).Additionally, this process
places on
the
user
the burden of organizing and
assimilating
the
multiple
results
generated
from the
launch o
f
the
same query
in each of the
multiple sources (databases) that
the
user
needs
(or wants)
to search.
Present
systems
that
allow
searching
of
large
domains
require
persons
seeking
information
in
these
domains
to
attempt
to
modify
their
queries
to
reduce
the
search
results
to
a
size
that
the
user can assimilate by browsing through
them (thus, potentially eliminating relevant
results).
In
many
cases
end
users
have
been
forced
to
use
an
intermediary
(i.e.,
a
professional
searcher)
because
the
current
collections
of
sources
are
both
complex
and
extensive,
and
effective
search
strategies
often
vary
significantly
from
one
source
to
another.
Even
with
such
guidance,
potential
relevant
answers
are
missed
because
all
potentially
relevant
databases
or
information sources are not searched on
every query. Much effort has been expended on
refining
and
improving
source selection by
grouping sources or
database
files together. Significant
effort
1
毕业论文(设计)
has
also
been
expended
on
query
formulation
through
the
use
of
knowledge
bases
and
natural
language processing.
However, as the
groupings of
sources become
larger, and the
responses to
more
comprehensive
search
queries
become
more
complete,
the
person
seeking
information
is
often faced with the
daunting task of sifting through large unorganized
answer sets in an attempt
to find the
most relevant documents or information.
Y OF THE INVENTION
The
invention
provides
an
information
storage,
searching
and
retrieval
system
for
a
large
domain
of
archived
data
of
various
types,
in
which
the
results
of
a
search
are
organized
into
discrete
types
of
documents
and
groups
of
document
types
so
that
users
may
easily
identify
relevant
information
more efficiently and
more
conveniently than systems currently
in
use.
The
system
of the
invention
includes
means
for storing
a
large domain of
data contained
in
multiple
source records, at
least some of
the source
records being comprised of
individual
documents of
multiple
document
types;
means
for
searching
substantially
all
of
the
domain
with
a
single
search
query
to
identify
documents
responsive
to
the
query;
and
means
for
categorizing
documents
responsive
to
the
query
based
on
document
type,
including
means
for
generating
a
summary
of
the
number
of
documents
responsive
to
the
query
which
fall
within
various
predetermined
categories of document types.
The query
generation process
may contain a knowledge base
including a thesaurus that
has
predetermined
and
embedded
complex
search
queries,
or
use
natural
language
processing,
or
fuzzy
logic,
or
tree
structures,
or
hierarchical
relationship
or
a
set
of
commands
that
allow
persons seeking information to
formulate their queries.
The search process can
utilize any
index and search
engine techniques
including Boolean,
vector, and probabilistic as
long as a substantial portion of the
entire domain of archived textual
data
is searched for each query and all documents found
are returned to the organizing process.
The
sorting/categorization
process
prepares
the
search
results
for
presentation
by
assembling
the
various document types retrieved by the
search engine
and then
arranging these
basic
document
types
into
sometimes
broader
categories
that
are
readily
understood
by
and
relevant
to
the
search
results
are
then
presented
to
the
user
and
arranged
by
category
along
with
an
indication
as
to
the
number
of
relevant
documents
found
in
each
category.
The
user
may then examine search
results
in
multiple
formats, allowing
the
user to
view as
much of
the document as the
user deems necessary.
DESCRIPTION OF THE DRAWINGS
FIG
. 1 is a block diagram
illustrating an information retrieval system of
the invention;
2
毕业论文(设计)
FIG
. 2 is a diagram
illustrating a query formulation and search
process utilized in the invention;
3
毕业论文(设计)
FIG
. 3 is a diagram
illustrating a sorting process for organizing and
presenting search results.
MODE FOR CARRYING OUT THE INVENTION
As
is
illustrated
in
the
block
diagram
of
FIG.
1
,
the
information
retrieval
system
of
the
invention
includes
an
input/output
process
,a
query
generation
process,
a
search
process
that
involves a
large domain of
textual data
(typically
in the
multiple
gigabyte range), an
organizing
process, presentation of the
information to the user, and a process to identify
and characterize the
types of documents
contained in the large domain of data.
Turning
now to FIG. 2, the
query
generation process preferably
includes a knowledge base
containing
a
thesaurus
and
a
note
pad,
and
preferably
utilizes
embedded
predefined
complex
Boolean
strategies.
Such
a
system
allows
the
user
to
enter
their
description
of
the
information
needed
using
simple
words/phrases
made
up of
language and
to
rely on
the system
to
assist
in
generating
the
full
search
query,
which
would
include,
e.g.,
synonyms
and
alternate
phraseology. The user can then request,
by a command such as
document selected
from the list, giving, in this case, complete
information about the identity and
credentials of the expert.
FIG. 3 illustrates how five typical
sources of information (i.e., source records) can
be sorted
into
many
document
types
and
then
subsequently
into
categories.
For
example,
a
typical
trade
magazine
may contain
several
types of
information
such as
editorials, regular columns,
feature
articles,
news, product
announcements, and a calendar of events. Thus,
the trade
magazine (i.e.,
the source record) may be sorted into
these various document types, and these document
types in
4
-
-
-
-
-
-
-
-
-
上一篇:欧美说说短句经典
下一篇:O2O商业模式及发展策略(英文)