Wayfair Tech Blog

Wayfair DS Explains It All: Trent Woodbury on Handling Imbalanced Data

Most machine learning algorithms are designed to train on balanced datasets. Resultantly, when our data are highly imbalanced, a typical model will have atrocious recall. In this video, Wayfair Senior Data Scientist Trent Woodbury explains the three most common ways of handling this imbalanced data problem.

Wayfair Tech Blog

Meet Jen Wang: Chemist. Traditional Chinese Medicine Enthusiast. Data Scientist.

How does a chemist with a postdoc in drug design end up working as a data scientist at an e-commerce home furnishings company? “Ha, so it’s a long story actually,” says Jen Wang, data science manager on the Marketing team at Wayfair.

Wayfair Tech Blog

Meet John Walk: Ice Climber. Nuclear Physicist. Data Scientist.

John stands out in a room. With a flaming red beard and an arm of tattoos, he looks like he might be more at home in a Nordic tavern than an office. But once you notice his Captain Hammer t-shirt, the Gandalf action figure on his desk, and start talking to him about Nuclear Physics, you see that he’s right at home in a room full of data scientists.

Wayfair Tech Blog

Wayfair DS Explains It All: Peter B. Golbus on Theoretical Machine Learning

This week in Wayfair Data Science’s Explainer Series, Data Science Tech Lead Peter B. Golbus discusses machine learning from a theoretical computer science perspective. In this video, we describe multiclass classification as an encoding task, i.e. a process for building compression schemes that convert large "files" (feature vectors) into small ones (labels). By framing classification this way, we are able to use the powerful tools of Information Theory to produce actionable insight. In particular, we discuss that classification accuracy is bounded from above by the mutual information between your features and labels, and how information theory explains why ensembling and feature selection are such powerful tools for machine learning.

Wayfair Tech Blog

Wayfair DS Explains It All: Esra Cansizoglu on Object Pose Estimation

This week in Wayfair Data Science’s explainer series, we’re discussing object pose estimation, an important problem in robotics and augmented reality (AR) applications. In robotics, when given a 3D model of an object a mobile robot must be able to localize it in space in order to manipulate it. This localization process is also central to our AR work at Wayfair. On the Wayfair app, you can explore how our products look in your room using AR. The ability to estimate the pose of the selected item while you are moving your smartphone around your room is essential to providing the best AR experience. In this video, Wayfair data science manager Esra Cansizoglu explains how we solve this problem using perspective-n-point algorithm in a RANSAC framework.

Wayfair Tech Blog

How We Detect Anomalies in Our Product Recommendations Metrics

Introduction

Wayfair Tech Blog

Wayfair DS Explains It All: Tim Zhang on Training Image Synthesis

This week in Wayfair Data Science’s explainer series, Senior Machine Learning Engineer Tim Zhang lays out what you need to know about training image synthesis. Training image synthesis is a fairly young project for the Computer Vision team at Wayfair. We have explored a few different use cases that can take advantage of this approach, such as search-with-photo, image-based 3D geometry generation, and camera perspective estimation. Looking forward, we are aiming at faster rendering speed and more intelligent domain randomization.

Wayfair Tech Blog

Wayfair DS Explains It All: Archi Mitra on Human-in-the-loop Systems

In this installment, Archi Mitra (Senior Machine Learning Engineer on the Computer Vision team) lays out an intro to Human-in-the-loop systems, which are used in every Computer Vision project here at Wayfair. Our Visual Search, Object Detection, Room & Style algorithm leverages this system to annotate training data and validate results. Media metadata tagging uses this system to improve tagging efficiency and build one of a kind integrated AI model development platform.

Wayfair Tech Blog

Wayfair DS Explains It All: Afshaan Mazagonwalla on Bayesian Machine Learning

Welcome back to Wayfair Data Science's Explainer Series! This week, Afshaan Mazagonwalla will be speaking about Bayesian Machine Learning