Shylendra Sai Bangaru

Senior Data Scientist

Murfreesboro, Tennessee, United States

View resume

Skills

Programming Languages : Python (Pandas, NumPy, SciPy, Matplotlib, Scikit-Learn, TensorFlow, PyTorch), R, SQL, JavaScript, Scala, Java
Data Analytics/Modeling : Jupyter Notebook, Google Colab, Apache Zeppelin, SAS, SPSS, MATLAB
Data Visualization : Tableau, Power BI, Looker, D3.js, Seaborn, ggplot2, Matplotlib, Plotly
Big Data Tools : Apache Hadoop, Apache Spark, Apache Flink, Databricks, Apache Kafka, Apache Hive, Presto
ETL Tools : Apache Nifi, Apache Airflow, Talend, Informatica PowerCenter, AWS Glue
OLAP Tools : Apache Kylin, Microsoft Analysis Services, Oracle OLAP, Amazon Redshift
Databases : PostgreSQL, MySQL, MongoDB, Apache Cassandra, Oracle, Microsoft SQL Server, GoogleBigQuery, Amazon DynamoDB, Snowflake
Cloud Platforms : AWS (S3, EC2, RDS, Lambda), Google Cloud Platform (BigQuery, Dataflow, AI Platform), Microsoft Azure (Synapse, HDInsight)
DevOps Tools : Docker, Kubernetes, Jenkins, Git, Terraform, Ansible
Project Methodologies : Agile, Scrum, Kanban, Waterfall
Machine Learning Libraries : TensorFlow, PyTorch, Keras, XGBoost, LightGBM, H2O.ai
AI & Deep Learning : TensorFlow, Keras, PyTorch, FastAI, OpenCV
Data Warehousing Tools : Snowflake, Redshift, Teradata, Google BigQuery, Azure Synapse

About

Experienced Data Architect and Analyst with over 9 years of expertise in data architecture, modeling, and engineering across both OLTP and OLAP environments. Adept at designing scalable data solutions using modern architectures, including star and snowflake schemas, and managing complex data ecosystems with proficiency in dimensional and relational modeling.

Skilled in leveraging tools like Erwin, Power Designer, and ER Studio for building conceptual, logical, and physical models, with hands-on experience across Big Data platforms such as Hadoop, Spark, Hive, Pig, and PySpark. Proven ability to implement data governance, profiling, cleansing, and ETL strategies using Informatica, PowerCenter, SSIS, and custom automation via UNIX and Perl scripting.

Well-versed in cloud data services on AWS (Redshift, S3, EMR) and Azure (Data Lake), with strong command of SQL and procedural languages across Oracle, Teradata, Netezza, and DB2. Passionate about delivering high-quality, actionable insights using predictive analytics and machine learning models (Random Forest, SVM, Logistic Regression, Neural Networks) in Python and R.

Experienced in managing complete data science project lifecycles—from acquisition and transformation to modeling and visualization—using tools like Tableau and libraries like scikit-learn, caret, dplyr, and ggplot2. Strong advocate of SDLC, Agile, and industry best practices for building high-impact, reliable data systems that support business growth and innovation.