Explicit Feature Extraction Node

The Explicit Feature Extraction node is built using the feature extraction algorithm called Explicit Semantic Analysis (ESA). ESA is a vectorial representation of text, which can be individual words or entire documents. The algorithm uses a document corpus as the knowledge base. In ESA, a word is represented as a column vector in the tf–idf matrix of the text corpus and a document is represented as the centroid of the vectors representing its words. Oracle Data Mining provides a prebuilt ESA model based on Wikipedia. You can import the model to Oracle Data Miner for mining purposes.

You can use the Explicit Semantic Analysis node for the following purposes: