Open in app

Sign In

Write

Sign In

Ben Weber
Ben Weber

8.5K Followers

Home

About

Published in Towards Data Science

·Dec 13, 2021

10 Technologies I Explored as an Applied Data Scientist in 2021

ML for ad technology and diving into deep learning — 2021 was a continuation of technical skill development for my data science career, focused on building real-time machine learning systems. While I started building production-grade data products last year, as outlined in my 2020 lookback, this year I focused much more on machine learning, with an emphasis on using deep…

Data Science

9 min read

10 Technologies I Explored as an Applied Data Scientist in 2021
10 Technologies I Explored as an Applied Data Scientist in 2021
Data Science

9 min read


Published in Towards Data Science

·Sep 7, 2021

Approaches for Building Real-Time ML Systems

Responding to Prediction Requests in Milliseconds — As an applied data scientist at Zynga, I’ve started getting hands on with building and deploying data products. As I’ve explored more and more use cases for machine learning, there’s been an increasing need for real-time machine learning (ML) systems, where the system performs feature engineering and model inference to…

Machine Learning

11 min read

Approaches for Building Real-Time ML Systems
Approaches for Building Real-Time ML Systems
Machine Learning

11 min read


Published in Towards Data Science

·Jun 14, 2021

Technologies for Applied Data Science

Tools for building real-time ML applications — Building and deploying data products that use machine learning to personalize products typically involves a variety of disciplines including data science, product management, and engineering. I’ve worked in a few organizations where data science and engineering teams are somewhat siloed, and the following approaches are used to put models into…

Machine Learning

11 min read

Technologies for Applied Data Science
Technologies for Applied Data Science
Machine Learning

11 min read


Published in Towards Data Science

·Dec 28, 2020

8 New Tools I Learned as a Data Scientist in 2020

Making the move from Docker to Live Deployments — While 2020 has been a challenging year, I was able to use the transition to remote work to explore new tools to expand my data science skill set. It was the year that I made the transition from data scientist to applied scientist, where I was responsible for not only…

Data Science

7 min read

8 New Tools I Learned as a Data Scientist in 2020
8 New Tools I Learned as a Data Scientist in 2020
Data Science

7 min read


Published in Towards Data Science

·Nov 9, 2020

Data Science in a Serverless World

Building Data Products with Managed Services — In large companies, there are typically separate teams for training machine learning models and putting these models into production. A data science team may be responsible for feature engineering, model selection, and hyperparameter tuning, while a machine learning engineering team is responsible for building and maintaining the infrastructure required to…

Python

7 min read

Data Science in a Serverless World
Data Science in a Serverless World
Python

7 min read


Published in Towards Data Science

·Sep 7, 2020

NoSQL for Real-Time Feature Engineering and ML Models

Building User Profiles with Streaming Data — For the majority of my data science career, I’ve built machine learning models using data fetched from a data warehouse or lake. With this approach, you can create a feature vector for each user by applying SQL commands to transform several events into a user summary. However, one of the…

Data Science

11 min read

NoSQL for Real-Time Feature Engineering and ML Models
NoSQL for Real-Time Feature Engineering and ML Models
Data Science

11 min read


Published in Towards Data Science

·Sep 3, 2020

Democratizing PySpark for Mobile Game Publishing

Zynga Analytics at Spark Summit 2020 — Over the past two years, analytics at Zynga has been increasingly using PySpark, which is the Python interface to the Spark big data platform. We have central and embedded analytics teams that use PySpark to support mobile publishing operations including analytics and reporting, experimentation, personalization services, and marketing optimization. I…

Data Science

18 min read

Democratizing PySpark for Mobile Game Publishing
Democratizing PySpark for Mobile Game Publishing
Data Science

18 min read


Published in Towards Data Science

·Aug 15, 2020

When to use Java as a Data Scientist

While Python and R provide rich ecosystems for data scientists to handle a wide range of problems, there are situations in which other programming languages, including Java and Go, should be explored. I have found that hands-on experience with Java has been increasingly useful as I shift my focus from batch…

Data Science

4 min read

When to use Java as a Data Scientist
When to use Java as a Data Scientist
Data Science

4 min read


Published in Towards Data Science

·Apr 6, 2020

Securing ML Services on the Web

HTTPS and Access Control — If you’re looking to host a machine learning service over the web, then it’s usually necessary to lock down the endpoint so that calls to the service are secure and only authorized users are able to access the service. In order to make sure that sensitive information is not exposed…

Data Science

15 min read

Securing ML Services on the Web
Securing ML Services on the Web
Data Science

15 min read


Published in Towards Data Science

·Feb 17, 2020

DevOps for Data Science with GCP

Deploying Production-Grade Containers for Model Serving — One of the functions of data science teams is building machine learning (ML) models that provide predictive signals for products and personalization. While DevOps has not always been considered a core responsibility of data science teams, it is becoming increasingly important as these teams start to take more ownership of…

Data Science

14 min read

DevOps for Data Science with GCP
DevOps for Data Science with GCP
Data Science

14 min read

Ben Weber

Ben Weber

8.5K Followers

Director of Applied Data Science at Zynga @bgweber

Following
  • Jeff Hale

    Jeff Hale

  • Weszt Hart

    Weszt Hart

  • NYC Media Lab

    NYC Media Lab

  • Caitlin Kindig

    Caitlin Kindig

  • Geoff Keighley

    Geoff Keighley

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech