Open in app

Sign in

Write

Sign in

Devin Petersohn
Devin Petersohn

178 Followers

Home

About

Published in

riselab

·Aug 12, 2021

So you want to build an open source tool/library as a grad student

This is a collection of experiences and recommendations for building an open source community as a grad student — Many grad students and professors have asked me for suggestions on how to build a functioning and thriving open source community while in grad school. This blog post appears as a chapter in my thesis, but ultimately I decided to extract those contents and put them here for easier retrieval…

Open Source

10 min read

So you want to build an open source tool/library as a grad student
So you want to build an open source tool/library as a grad student
Open Source

10 min read


Published in

Towards Data Science

·Apr 7, 2021

We Don’t Need Data Engineers, We Need Better Tools for Data Scientists

The role of Data Engineer exists as we know it because of a lack of adequate tooling for Data Scientists — In most companies, Data Engineers support the Data Scientists in various ways. Often this means translating or productionizing the notebooks and scripts that a Data Scientist has written. …

Data Science

4 min read

We don’t need Data Engineers, we need better tools for Data Scientists
We don’t need Data Engineers, we need better tools for Data Scientists
Data Science

4 min read


Published in

riselab

·Oct 7, 2020

How to ensure a data scientist is never productive

We need to start placing a higher value on data scientists’ time than we do on machine time — While data science tools are being optimized to perform well on microbenchmarks, they are becoming more and more difficult to use. Is the benchmark performance worth the human time cost it takes to get there? …

Data Science

5 min read

How to ensure a data scientist is never productive
How to ensure a data scientist is never productive
Data Science

5 min read


Published in

Towards Data Science

·Jul 7, 2020

The Modin view of Scaling Pandas

Comparing Modin with Dask, Ray, Vaex, and RAPIDS — Recently, a blog post was written that compared a variety of tools in a set of head to heads. I wanted to take the opportunity to talk about our vision with Modin and where we’d like to take the field of data science. Modin (https://github.com/modin-project/modin) takes a different view…

Pandas

6 min read

The Modin view of Scaling Pandas
The Modin view of Scaling Pandas
Pandas

6 min read


Published in

Towards Data Science

·Jan 14, 2020

Preventing the Death of the Dataframe

Dataframes are losing their statistical computing and machine learning roots — Dataframes emerged from a specific need, but because so many diverse systems now call themselves dataframes, the term is on the verge of meaning nothing. In an effort to preserve the dataframe, we formalized the definition based on the original data model in our recent preprint[2]. Before we get into…

Data Science

6 min read

Preventing the Death of the Dataframe
Preventing the Death of the Dataframe
Data Science

6 min read


Published in

riselab

·May 8, 2019

Two missing links in Serverless Computing: Stateful Computation and Placement Control

by Ion Stoica and Devin Petersohn — Serverless computing is rapidly gaining in popularity due to its ease of programmability and management. Many see it as the next general purpose computing platform for the cloud [4]. However, while existing serverless platforms have been successful in supporting several popular applications such as event processing and simple ETL, they…

Serverless

10 min read

Two missing links in Serverless Computing: Stateful Computation and Placement Control
Two missing links in Serverless Computing: Stateful Computation and Placement Control
Serverless

10 min read

Devin Petersohn

Devin Petersohn

178 Followers
Following
  • Aditya Parameswaran

    Aditya Parameswaran

  • Michael Galarnyk

    Michael Galarnyk

  • Josh Patterson

    Josh Patterson

  • Stephanie Wang

    Stephanie Wang

  • Sarah Wooders

    Sarah Wooders

See all (28)

Help

Status

About

Careers

Blog

Privacy

Terms

Text to speech

Teams