Industry Track

The Industry Track of CoDS-COMAD '19 will cover the application of recent research advances to problems of real-world interest. It aims to bring together leading industry and academia practitioners to share their insights, expertise and experiences related to deployments of solutions in industry and government that address real-world challenges and highlight new and important research directions. The industry track comprises of four invited talks, eight oral presentations in two sessions, and a panel discussion.

Schedule - PDF | Interactive Program Committee

Nitendra Rajput, Info Edge India

Slides

Thursday, January 3, 2019, 2:00pm - 2:30pm

Nitendra Rajput

Executive Vice President, Info Edge India

Abstract: In this talk, we will demonstrate that deep learning has a very strong connection to the psychology of human decision making. Several behavioural economists and psychologists have researched on the hidden parameters through which we learn and make decisions, especially under uncertainty. We will talk about few such important life decisions that go beyond choosing a movie or doing eCommence. These decisions are around choosing a job, a life partner and a house. At Info Edge, we use deep learning in different businesses of the company, where the businesses are a job-portal (India's largest), matrimony (aka dating) site and a real estate site.
Bio: Nitendra Rajput works as Executive Vice President and Head of Data Science at Info Edge where he defines and leads the data science work for various Info Edge businesses such as naukri, jeevansathi, 99acres. Prior to this, Nitendra was at IBM Research for 18 years, managing the mobile agenda for IBM Research in India. Nitendra has authored over 80 publications at top international ACM and IEEE venues, owing to which he has been recognized as an ACM Distinguished Scientist and a senior IEEE Member. He has delivered several tutorials and conducted workshops in AI and speech areas at top ACM venues (MobileHCI, IUI, CSCW, CHI). With over 50 patents to his name, Nitendra was an IBM Master Inventor and was also an IBM Academy of Technology member. In 2012, he coauthored a book titled "Speech in Mobile and Pervasive Environments" that was published by John Wiley & Sons.

Arijit Biswas, Senior Machine Learning Scientist, Amazon

Video Slides

Friday, January 4, 2019, 2:00pm - 2:30pm

Arijit Biswas

Senior Machine Learning Scientist, Amazon

Abstract: E-commerce websites such as Amazon, Alibaba, and Walmart typically process billions of orders every year. Semantic representation and understanding of these orders is extremely critical for an eCommerce company. Each order can be represented as a tuple of . Exploring the space of all plausible orders could help us better understand the relationships between the various entities in an e-commerce ecosystem, namely the customers and the products they purchase. In this paper, we propose a Generative Adversarial Network (GAN) for orders made in e-commerce websites. Once trained, the generator in the GAN could generate any number of plausible orders. Our contributions include: (a) creating a dense and low-dimensional representation of e-commerce orders, (b) train an ecommerceGAN (ecGAN) with real orders to show the feasibility of the proposed paradigm, and (c) train an ecommerce-conditional-GAN (ec^2GAN) to generate the plausible orders involving a particular product. We propose several qualitative methods to evaluate ecGAN and demonstrate its effectiveness.
Bio: Arijit Biswas is currently a Senior Machine Learning Scientist at the India machine learning team in Amazon, Bangalore. His research interests are mainly in deep learning, machine learning and computer vision. Earlier he was a research scientist at Xerox Research Centre India (XRCI) from June, 2014 to July, 2016. He received his PhD in Computer Science from University of Maryland, College Park in April 2014. His PhD thesis was on Semi-supervised and Active Learning Methods for Image Clustering. His thesis advisor was David Jacobs and he closely collaborated with Devi Parikh and Peter Belhumeur during his stay at UMD. While doing his PhD, Arijit also did internships at Xerox PARC and Toyota Technological Institute at Chicago (TTIC). He has published papers in CVPR, ECCV, ACM-MM, BMVC, IJCV and CVIU. Arijit has a Bachelor's degree in Electronics and Telecommunication Engineering from Jadavpur University, Kolkata. Arijit is also a recipient of the MIT Technology Review Innovators under 35 award from India in 2016.

Vivek Tyagi, LinkedIn

Video Slides

Friday, January 4, 2019, 2:30pm - 3:00pm

Vivek Tyagi

Research Manager, LinkedIn

Abstract: Convolutional Neural Network (CNN) based text classifiers have been very successful in several text classification tasks. This talk will provide further empirical insights into sequence learning mechanism of a CNN Text classifier. We hypothesize that multiple convolution filters which convolve over embeddings of K words (K={1,2,3,4,5), get their weights trained by SGD to act somewhat as "matched filters". It is worth emphasizing that unlike lexical features (n-grams) in linear models, a single convolution filter responds (high activation) to multiple subsequences of K tokens which may be semantically similar and not just one specific subsequence (n-gram). This mechanism is empirically shown to be somewhat similar to "matched filtering" in linear system theory which detects a signal in presence of noise. Secondly, we also provide a comparison of our CNN text classifier with FastText and LSTM classifiers on a two public domain text classification tasks. Unlike the conventional wisdom, we show that for the text classification tasks, CNN with its fixed but controllable memory, performs better than RNN or LSTM based text classifiers. While theoretically LSTM can have infinite memory, still it is not clear upto what tokens extent does a LSTM text classifier's memory gets learned. We argue that a lower capacity CNN classifier with its fixed sized memory (filter width upto 5 tokens in our experiments) offers a better tradeoff. Our experiments indicate that the CNN classifier learns discriminative sequence patterns over the underlying dense word embeddings of the K adjacent words. This also helps the CNN classifier have the most pertinent K words long memory as compared to the bag of words (BoW) classifiers such as Linear SVM and FastText.
Bio: Vivek Tyagi is currently manager of the Text AI team at LinkedIn Bangalore where he and his team is developing novel word embeddings and Deep Learning classifiers to detect Spam and Low Quality content from the LinkedIn Feed. Prior to that, Vivek was a Research Director at American Express Risk Management Division, Gurgaon where he and his team developed Distributed GBM library -- AXGBoost -- by leveraging the open source XGboost library and developing six major Amex Proprietary algorithms into it. AXGBoost has been a major success at Amex Risk Management and is powering several Fraud and Credit Risk use cases and driving benefits upto hundred million dollars. Prior to that, Vivek has led several successful R&D programs in Speech Recognition and Speech Analytics at Xerox Research India, Interactive Intelligence USA and IBM Research India. Vivek obtained his BTech in Electrical Engineering from IIT Kanpur 2001, and his PhD in Computer Science from Swiss Federal Institute of Technology, Lausanne (EPFL) in 2006. He is a Senior Member of IEEE for contributions to speech recognition and has published about 40 papers in IEEE Transactions on Intelligent Transportation, Elsevier Speech Communication, and conferences proceedings such IEEE ICASSP, IEEE Machine Learning in Signal Processing (MLSP), Interspeech, IEEE CVPR Workshop in Biometrics and IEEE ICPR.

Monojit Choudhury, Microsoft Research Lab

Video Slides

Friday, January 4, 2019, 3:00pm - 3:30pm

Monojit Choudhury

Researcher, Microsoft Research

Abstract: Code-mixing or Code-switching is the linguistic phenomenon of mixing more than one language in a single conversation, sometime even in a single sentence or utterance. Such fluid alternation between languages is frequently observed in multilingual societies across the world, primarily in casual speech, but these days also in online user generated text. Processing of code-mixed text is challenging because of absence of adequate datasets compounded by the combinatorial explosion of language varieties and mixing patterns. In this talk, I will give a brief overview of these challenges, and present a few techniques that can effectively address these hurdles. I will also touch upon some of our research on social and pragmatic aspects of code-mixing.
Bio: Dr. Monojit Choudhury is a researcher in Microsoft Research Lab in India since 2007. His research spans many areas of Artificial Intelligence, cognitive science and linguistics. In particular, Dr. Choudhury has been working on technologies for low resource languages, code-switching (mixing of multiple languages in a single conversation), computational sociolinguistics and conversational AI. He has more than 100 publications in international conferences and refereed journals. Dr. Choudhury is an adjunct faculty at International Institute of Technology Hyderabad. He also organizes the Panini Linguistics Olympiad for high school children in India, and is the founding chair of the Asia-Pacific Linguistics Olympiad. Dr. Choudhury holds a B.Tech and PhD degree in Computer Science and Engineering from IIT Kharagpur.

Semantic Harmonisation of Numeric Data from Open Government Data
Asha Subramanian, Pavan Kumar Raichur Rajoli, Manikanta Vikkurthi and Pier Luigi Buttigieg
Intelligent Correction Predictor
Sudhir Verma and Uma Rani T M
Catch-phrase based Document Representation for Improved Prior Art Search
Nidhi Saraswat, Ishan Verma and Viren Gupta

Video

Fast Online 'Next Best Offers' using Deep Learning
Rekha Singhal, Gautam Shroff, Mukund Kumar, Sharod Roy Choudhury, Sanket Kadarkar, Rupinder Virk, Siddharth Verma and Vartika Tewari
Identifying Multi-dimensional Information from Microblogs During Epidemics
Shalmoli Ghosh, Koustav Rudra, Saptarshi Ghosh, Niloy Ganguly, Sanjay Podder, Naveen Balani and Neville Dubash
Unsupervised Multi-task Learning Dialogue Management with Recurrent Neural Networks
Sushravya G M and Shubhashis Sengupta
A Hierarchical Approach to Skin Lesion Classification
Rahul Kulhalli, Chinmay Savadikar and Bhushan Garware
Goal-based Ontology Creation for NLQ in SAP-ERP Platform
Neelamadhav Gantayat, Diptikalyan Saha, Jaydeep Sen and Senthil Mani

Video

The panel will explore all the real world issues and challenges faced by AI/ML scientists in developing and deploying AI solutions at scale for real world use cases. We will discuss the impact of topics like the following:

Lack of data
Noisy data
Lofty expectations
Speed and scale
Annotations/Cost of development
Talent/skill availability
etc.

Dr. Aruna Rajan

Senior Director, Data Sciences and Intelligent Software Products, Myntra

Dr. Karthik Sankaranarayanan

Senior Research Scientist and Research Manager - Artificial Intelligence, IBM Research

Mr. Krishnamurthy Vaidyanathan

Senior Managing Director, Goldman Sachs

Dr. Subhajit Sanyal

Manager, Machine Learning, Amazon

Dr. Vishwa Vinay,

Computer Scientist, Adobe Research

Moderator: Dr. Nanda Kambhatla

VP, Enterprise AI, Symphony AI and Ex IBM Distinguished Engineer

The Psychology of Deep Learning in Decision Making

Nitendra Rajput, Info Edge India

Thursday, January 3, 2019, 2:00pm - 2:30pm

Nitendra Rajput

Representation Learning for E-commerce

Arijit Biswas, Senior Machine Learning Scientist, Amazon

Friday, January 4, 2019, 2:00pm - 2:30pm

Arijit Biswas

Design and Analysis of High Performance Convolutional Neural Network Text Classifier and its Discriminative Sequence Learning Capability

Vivek Tyagi, LinkedIn

Friday, January 4, 2019, 2:30pm - 3:00pm

Vivek Tyagi

Processing Code-mixed Language

Monojit Choudhury, Microsoft Research Lab

Friday, January 4, 2019, 3:00pm - 3:30pm

Monojit Choudhury

Session 1 - Thursday, January 3, 2019, 2:30pm - 3:30pm

Session 2 - Friday, January 4, 2019, 11:00am - 12:30pm

Challenges in developing and deploying AI/ML solutions at scale in industry

Panelists