In this role, we are looking for candidates who have relevant years of experience in Text Mining. The Text Mining Scientist (TMS) is expected to play a pivotal bridging role between enterprise database teams, and business /functional resources. At a broad level, the TMS will leverage his/her solutioning expertise to translate the customer’s business need into a techno-analytic problem and appropriately work with database teams to bring large scale text analytic solutions to fruition. The right candidate should have prior experience in developing text mining and NLP solutions using open-source tools.
Responsibilities
- Develop transformative AI/ML solutions to address our clients' business requirements and challenges
- Project Delivery - This would entail successful delivery of projects involving data Pre-processing, Model Training and Evaluation, Parameter Tuning
- Manage Stakeholder/Customer Expectations
- Project Blue Printing and Project Documentation
- Creating Project Plan
- Understand and research cutting edge industrial and academic developments in AI/ML with NLP/NLU applications in diverse industries such as CPG, Finance etc.
- Conceptualize, Design, build and develop solution algorithms which demonstrate the minimum required functionality within tight timelines
- Interact with clients to collect, synthesize, and propose requirements and create effective analytics/text mining roadmap.
- Work with digital development teams to integrate and transform these algorithms into production quality applications
- Do applied research on a wide array of text analytics and machine learning projects, file patents and publish the papers
Qualifications we seek in you!
Minimum Qualifications / Skills
- in Computer Science, Information systems, or Computer engineering, Systems Engineering with relevant experience in Text Mining / Natural Language Processing (NLP) tools, Data sciences, Big Data and algorithms.
- in MBA and Undergraduate degree in any engineering discipline, preferably Computer Science with relevant experience
- cycle experience desirable in at least 1 Large Scale Text Mining/NLP project from creating a Business use case, Text Analytics assessment/roadmap, Technology & Analytic Solutioning, Implementation and Change Management, considerable experience in Hadoop including development in map-reduce framework
Technology
- Source Text Mining paradigms such as NLTK, OpenNLP, OpenCalais, StanfordNLP, GATE, UIMA, Lucene, and cloud based NLU tools such as DialogFlow, MS LUIS
- to Statistical Toolkits such as R, Weka,S-Plus, Matlab, SAS-Text Miner
- Core Java experience in large scale product development and functional knowledge of RDBMs
- on to programing in the Hadoop ecosystem, and concepts in distributed computing
- good python/R programming skills. Java programming skills a plus
Methodology
- years of experience in Solutioning & Consulting experience in verticals such as BFSI, CPG, with hands on delivering text analytics on large structured and unstructured data
- solid foundation in AI Methodologies like ML, DL, NLP, Neural Networks, Information Retrieval and Extraction, NLG, NLU
- to concepts in Natural Language Processing & Statistics, esp., in their application such as Sentiment Analysis, Contextual NLP, Dependency Parsing, Parsing, Chunking, Summarization, etc
- ability to Conduct look-ahead client research with focus on supplementing and strengthening the client’s analytics agenda with newer tools and techniques
Preferred Qualifications/ Skills
Technology
- level of understanding of NLP, NLU and Machine learning/Deep learning methods
- OpenCalais, StanfordNLP, GATE, UIMA, Lucene, NoSQL
- development paradigms that would enable Text Mining Insights Visualization, e.g., Adobe Flex Builder, HTML5, CSS3
- Windows, GPU Experience
- Scala for distributed computing
- learning frameworks such as TensorFlow, Keras, Torch, Theano
Methodology
- Network modeling paradigms, tools & techniques
- Analytics using Natural Language Processing tools such as Support Vector Machines and Social Network Analysis
- experience with Text analytics implementations, using open source packages and or SAS-Text Miner
- to Prioritize, Consultative mindset & Time management skills
The approximate annual base compensation range for this position is $100,000 to $125,000. The actual offer, reflecting the total compensation package plus benefits, will be determined by a number of factors which include but are not limited to the applicant’s experience, knowledge, skills, and abilities; geographic location; and internal equity.
Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color, religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values diversity and inclusion, respect and integrity, customer focus, and innovation. Get to know us at genpact.com and on LinkedIn, X, YouTube, and Facebook.
Furthermore, please do note that Genpact does not charge fees to process job applications and applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a 'starter kit,' paying to apply, or purchasing equipment or training.