Shanuj Shekhar

Stony Brook Graduate Student

About Me

I’m currently pursuing my Masters in Computer Science from Stony Brook University, New York. I completed my Bachelors in Computer Science from India, which also is my home country. Competitive Programming is one thing that I am very passionate about. I find the domains of Machine Learning and Data Science to be very interesting and love working on projects related to the same. In fact, most of the projects that I have worked in the span of last few years of my education in some form have helped me attribute my learning in these domains.

Skills

  • Java

  • Python

  • C#

  • Machine Learning Tools

  • HTML/CSS/Javascript

  • C++

  • C

  • Data Analysis Tools

  • Projects

    Smart IoT Climate Control System

    January 2020 - Ongoing

    We are developing a smart IoT climate control system and would like to leverage machine learning techniques for occupancy prediction (based on incoming sensor data) and for damper actuation (based on incoming sensor data and temperature target, when should heating/cooling be turned on and which heating/cooling dampers should be actuated).

    Topic: Machine Learning

    Languages and Tools Used: Python Deep Neural Networks PyTorch

    D3 Visualization of COVID-19 Pandemic

    https://www.youtube.com/watch?v=3fKE13DtB70&feature=youtu.be

    March 2020

    I have created a dashboard for visualizing COVID-19 cases in the USA, how the disease spread and how it affected the country’s unemployment rates. I have created two modes for our dashboard - one for the unemployment data and one for the COVID-19 case data to show the spread in different states and counties.

    Topic: Data Visualization, Front-end Development

    Languages and Tools Used: Python D3.js HTML5 CSS3 Javascript Flask

    Detect Heavy Drinking Episodes

    February 2020

    I have used Random Forest Classifier to identify intoxicated individuals according to their TAC labels and detect drinking episodes using accelerometer samples from their mobile devices.

    Topic: Machine Learning

    Languages and Tools Used: Python scikit sklearn

    Augmented Reality Video Game using Unity Tool

    January 2020

    I have designed a game in which a user can interactively build an augmented 3D scene on a planar surface in the real world. My application has two modes: a build mode and a game mode and application starts with a menu of two buttons displaying the two modes.

    Topic: Virtual Reality

    Languages and Tools Used: C# Unity Tool Blender 3.0 Vuforia

    Generating entity descriptors (Post-Modifiers) based on context

    November 2019

    The task of post modifier generation requires to automatically generate a post modifier phrase describing the target entity (an entity essentially refers to a noun but here we only consider people) that contextually fits in the input sentence.

    Topic: Natural Language Processing, Recurrent Neural Networks

    Languages and Tools Used: Python scikit-learn numpy

    Emotion Recognition for Real-Time Feedback

    June 2019

    This project titled “Emotion Recognition for Real-Time Feedback” performs facial expression analysis in near real-time from a live webcam feed. It classifies human expressions into 8 different classes (Happy, Sad, Angry, Contempt, Disgust, Fear, Surprise, Neutral) with an accuracy of 67% .

    Topic: Machine Learning, Support Vector Machine

    Languages and Tools Used: Python scikit-learn numpy OpenCV

    Chat-Application

    June 2016

    Server-Client side scripting of Chat Application

    This application allows multiple users to chat with each other. This is a very basic implemention of node.js and socket.io javascript modules. With the help of this application, users can chat irrespective of their locations.

    Topic: Web Development

    Languages and Tools Used: node.js nodemon ngrok HTML5 CSS3 Javascript

    Visual Data Analytics

    https://www.youtube.com/watch?v=tTgrkyfKcsM&t=3s

    Febraury 2020

    Performing common visual data analytic tasks using Python and D3.js.

    Implemented Random Sampling and Stratified Sampling. Performed K-means Clustering for Stratified Sampling using Elbow Method and Dimension Reduction using PCA and MDS Analysis.

    Topic: Data Visualization, Front-end Development

    Languages and Tools Used: Python D3.js HTML5 CSS3 Javascript Flask

    Term Frequency per Inverse Document Frequency

    Febraury 2017

    A numerical statistic that is intended to reflect how important a word is to a document in a collection or corpus

    Calculated the term frequency for terms present in 2000 documents. (Dataset- http://mlg.ucd.ie/files/datasets/bbc-fulltext.zip). Found the top 10 most important terms for each document.

    Topic: Virtual Reality

    Languages and Tools Used: C# Unity Tool Blender 3.0 Vuforia

    Interactive Space Exploration Video Game in Virtual Reality using Unity Tool

    March 2020

    Designed Virtual Scene of Solar System consisting of Sun, Moon, Stars, cargo-ship, spaceship, etc. (Modelled using Blender 2.8).

    Implemented features like gravity, switching cameras of game, revolving and rotational motion of planets, driving spaceship. Spaceship is equipped with the state-of-the-art selection ray which can pick-up and move objects using VR controllers. Designed a greenhouse on a Planet where I made the player do some random tasks like aligning cargo boxes, etc. Designed teleportation tube to teleport into the greenhouse.

    Topic: Virtual Reality

    Languages and Tools Used: C# Unity Tool Blender 3.0 Oculus Quest (HMD, 64GB)

    Experience

    Mozilla Fix-The-Internet Open Lab (Spring)

    Builder/Developer @ Open Lab (Internship)

    April 2020 – June 2020

    Website: https://help-for-all.herokuapp.com/

    Developed an online platform for matching donation related resources like food, clothing etc.using relevant tweets. The website lists dona-tion/request tweets location wise, based on search.

    Implemented Naive Bayes Classifier for classification of tweets (Donation/Non-Donation, Donor/Requestor & Resource Type classification), with an accuracy of 80%, after parsing them using standard NLP techniques.

    Languages and Tools Used: Python HTML5 CSS3 Javascript Flask Heroku

    Cadence Design Systems, Inc.

    Summer Intern

    June 2018

    Text Detection in Images

    In this project, text detected from Microprocessor pin diagrams using Tesseract Tool (Tesseract is an optical character recognition engine for various operating systems) is used for improving the text detection accuracy. Extracted text from Microprocessor Pin Diagram images acts as input for generation of Bayesian Network (Dlib library). Using Posterior Probability concept text accuracy was improved.

    Languages and Tools Used: C++ Microsoft Visual Studio 2017 Tesseract

    National University of Technology, Delhi

    http://www.nsit.ac.in/

    Research Intern

    June 2017

    Reusable Hybrid Test Automation Framework for Web Based Scrum Project

    Project done on Amazon, Flipkart e-commerce websites. Performed automation testing using Selenium Tool. The primary focus of this framework is to ensure reusability and maintainability of test scripts so as to speed up the testing process of web applications.

    Research Paper Published: Publication Link

    Languages and Tools Used: Java Eclipse IDE Selenium 2.0

    Education

    State University of New York at Stony Brook

    August 2019 - December 2020 (Expected)

    Masters in Computer and Information Science

    GPA: 3.61 / 4.00

    Graduate Coursework: Machine Learning; Visualization; Analysis of Algorithms; Natural Language Processing; Operating Systems; Virtual Reality

    Dr. B.R.Ambedkar National Institute of Technology, Jalandhar

    August 2015 - May 2019

    Bachelor of Technology in Computer Science and Engineering

    GPA: 8.53 / 10.00

    Undergraduate Coursework: Data Mining; Data Structures and Algorithms; Artificial Intelligence; Advanced Programming Concepts Using Java; Agile Software Development