- Trending Categories
- Data Structure
- Operating System
- MS Excel
- C Programming
- Social Studies
- Fashion Studies
- Legal Studies
- Selected Reading
- UPSC IAS Exams Notes
- Developer's Best Practices
- Questions and Answers
- Effective Resume Writing
- HR Interview Questions
- Computer Glossary
- Who is Who
What is Multirelational clustering?
Multirelational clustering is the phase of partitioning data objects into a group of clusters depends on their similarity, using data in multiple relations. CrossClus represents Cross-relational Clustering with user guidance. It is an algorithm for multirelational clustering that analyse how to use user guidance in clustering and tuple ID propagation to prevent physical joins.
The main challenge in multirelational clustering is that there are several attributes in multiple relations, and generally only a small area of them are relevant to a definite clustering task.
It can cluster students, attributes cover several elements of information, including courses taken by students, publications of students, advisors and research teams of students, etc.
A user is generally interested in clustering students using a specific element of data (e.g., clustering students by their research areas). Users can have a better grasp of their applications needed and data semantics. Thus, a user’s guidance in the form of a simple query, can be used to enhance the effectiveness and quality of highdimensional multirelational clustering.
CrossClus accepts user queries that include a target relation and one or more pertinent attributes, which define the clustering aim of the user. In the multirelational clustering process, CrossClus required to search pertinent attributes across several relations.
CrossClus should address two major challenges in the searching phase. First, the target relation, Rt, can generally join with each non-target relation, R, via several join paths, and each attribute in R can be used as a multirelational attribute.
It is inaccessible to implement any type of exhaustive search in this large search space. Second, between the large number of attributes, some are relevant to the user query whereas some are irrelevant (e.g., a student’s classmates’ personal data).
CrossClus should confine the search phase. It can be treated the relational schema as a graph, with associations being nodes and joins being edges. It adopts a heuristic methods, which begins search from the user-defined attribute, and then repeatedly searches for beneficial attributes in the neighborhood of current attributes. In this method, it gradually develop the search scope to connected relations, but will not go far into random directions.
CrossClus views at how attributes cluster target tuples. The pertinent attributes are selected depends on their relationships to the user-defined attributes. If two attributes cluster tuples very separately, their similarity is low and they are improbable to be associated. If they cluster tuples in a same way, they must be considered related.
- Related Articles
- What is Clustering?
- What is Conceptual Clustering?
- What is K-means clustering?
- What is Agglomerative Hierarchical Clustering?
- What is Prototype-Based Clustering?
- What is model-based clustering?
- What is Multi-relational Clustering?
- What is Document Clustering Analysis?
- What is clustering Index in DBMS?
- What is an Agglomerative Clustering Algorithm?
- What is STING grid-based clustering?
- What is scipy cluster hierarchy? How to cut hierarchical clustering into flat clustering?
- What are the applications of clustering?
- What are the methods of clustering?
- What are the elements in Hierarchical clustering?