- Trending Categories
Data Structure
Networking
RDBMS
Operating System
Java
iOS
HTML
CSS
Android
Python
C Programming
C++
C#
MongoDB
MySQL
Javascript
PHP
Physics
Chemistry
Biology
Mathematics
English
Economics
Psychology
Social Studies
Fashion Studies
Legal Studies
- Selected Reading
- UPSC IAS Exams Notes
- Developer's Best Practices
- Questions and Answers
- Effective Resume Writing
- HR Interview Questions
- Computer Glossary
- Who is Who
How is this statistical information useful for query answering?
The statistical parameters can be used in a top-down, grid-based approaches as follows. First, a layer within the hierarchical architecture is decided from which the query-answering procedure is to start.
This layer generally includes a small number of cells. For every cell in the current layer, it can compute the confidence interval (or estimated range of probability) reflecting the cell’s relevancy to the given query.
The statistical parameters of higher-level cells can simply be calculated from the parameters of the lower-level cells. These parameters contain the following − the attribute-independent parameter, count, and the attribute-dependent parameters, mean, stdev (standard deviation), min (minimum), max (maximum); and the type of distribution that the attribute value in the cell follows, including normal, uniform, exponential, or none (if the distribution is anonymous).
The irrelevant cells are removed from further consideration. Processing of the following lower level tests only the remaining relevant cells. This phase is repeated until the bottom layer is acquired. If the query description is met, the areas of relevant cells that use the query are restored.
STING offers several advantages which are as follows −
The grid-based calculation is query-independent, because the statistical data saved in each cell defines the summary records of the data in the grid cell, separate of the query.
The grid architecture supports parallel processing and incremental refreshing.
The technique efficiency is a major benefit. STING goes through the database because it can calculate the numerical parameters of the cells, and therefore the time complexity of generating clusters is O(n), where n is the total number of objects.
After making the hierarchical architecture, the query processing time is O(g), where g is the total number of grid cells at the lowest level, which is generally smaller than n.
Because STING need a multiresolution method to cluster analysis, the quality of STING clustering based on the granularity of the lowest level of the grid architecture. If the granularity is very fine, the value of processing will improve substantially; however, if the bottom level of the grid architecture is too rude, it can decrease the quality of cluster analysis.
STING does not treated the spatial relationship among the children and their neighboring cells for the development of a parent cell. As a result, the shapes of the outcoming clusters are isothetic; i.e., some cluster boundaries are horizontal or vertical, and no diagonal boundary is discovered. This can lower the quality and certainty of the clusters despite the quick processing time of the technique.
- Related Articles
- How is Java Useful for Testers?
- How cow dung is useful for agriculture?
- How is clayey soil useful for crops?
- How is sulphur useful in agriculture? How is sulphur useful in medicine?
- Why is wavelet transformation useful for clustering?
- MongoDB - interpret information on the query plan for the db.collection.find() method
- What is fermentation? How is it useful?
- How Can Augmented Reality be useful for Modern Education?
- What is the purpose of OPTIMIZE FOR ROWS in DB2 SQLs? How is it useful?
- What is carbon monoxide? How is it useful?
- How is Earthworm useful to the farmer?
- How is plastic useful in healthcare industry?
- Statistical Mechanics
- Statistical Physics
- Statistical Series
