Sri Krishnamurthy, QuantUniversity
Building and deploying NLP applications involves multiple steps, including data ingesting, pre-processing, labeling, model building, model selection, and deployment.
While data scientists are typically involved in prototyping end-to-end applications, deploying robust NLP applications in production requires building enterprise-grade pipelines and designing each stage in the pipeline to accomplish a particular task. This workshop presents QUSandbox, an enterprise platform for prototyping, designing, and scaling production-worthy machine learning pipelines. The platform and language agnostic platform enables integrating multiple tools to design coherent and production-grade pipelines that are auditable, replicable, and scalable. This master class will demonstrate the use of natural language processing techniques to analyze EDGAR call earnings transcripts that could be used to generate sentiment analysis scores using the Amazon Comprehend, IBM Watson, Google, and Azure APIs (application programming interfaces) to train your own model built in MATLAB. We will then illustrate how the various steps can be streamlined through a QuSandbox pipeline to enable building scalable machine learning applications in production.