Categorization is the process of organizing pieces of information into topical categories. Usually, these are hierarchical trees, with the most general topics at the top and the most specific at the bottom.
A search engine may apply categorization of the documents in the index based on similarities, matching rules or programmatic rules.
Categorization or taxonomy defines how documents in the corpus is categorized. These groups of categories form facets that can be used for navigation.