Text Analytics Toolbox Model for fastText English 16 Billion Token Word Embedding

by MathWorks Text Analytics Toolbox Team

Pre-trained English Word Embedding Model for Machine Learning and Deep Learning with Text

981 Downloads

Updated 11 Sep 2024

This Add-on provides a pre-trained word embedding and sentence classification model using FastText for use in machine learning and deep learning algorithms. FastText is an open-source library which provides efficient and scalable libraries for text analytics. For more information on the pre-trained word vector model see : https://fasttext.cc/docs/en/english-vectors.html

Opening the fasttext.mlpkginstall file from your operating system or from within MATLAB will initiate the installation process for the release you have.
This mlpkginstall file is functional for R2018a and beyond.
Usage Example:
% Load the trained model
emb = fastTextWordEmbedding;

% Find the top 10 closest words to “impedance” according to this word embedding
impedanceVec = word2vec(emb,"impedance");
vec2word(emb, impedanceVec,10)

ans =

10×1 string array

"impedance"
"impedances"
"capacitance"
"Impedance"
"resistor"
"impedence"
"inductance"
"voltage"
"S-parameters"
"ohms"

MATLAB Release Compatibility

Created with R2018a

Compatible with R2018a to R2024b

Platform Compatibility

Windows macOS (Apple silicon) macOS (Intel) Linux

Tags Add Tags

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Text Analytics Toolbox Model for fastText English 16 Billion Token Word Embedding

Requires

MATLAB Release Compatibility

Platform Compatibility

Categories

Tags Add Tags

Community Treasure Hunt

Discover Live Editor