Pascal Bugnion @pbugnion pbugnion
ASI Data Science is a London-based data science consultancy
One of the most common languages for building large data science applications
But there is no good data visualization library
At a conservative two hours per commit, it would take a single person 23 years to build
POST /v1/plot
{
"data" : {
"x": ["2016-02-22 18:40", "2016-02-22 19:14", ... ],
"y": [732.83, 729.65, ... ]
},
"figure": {
"title": "Stock price ($)",
...
}
}
import co.theasi.plotly
import util.Random
// Generate uniformly distributed x
val xs = (0 until 100)
// Generate random y
val ys = xs.map { x => x + 5.0 * Random.nextDouble }
val p = Plot().withScatter(xs, ys)
draw(p, "basic-scatter", writer.FileOptions(overwrite=true))
// returns PlotFile(pbugnion:173,basic-scatter)
import co.theasi.plotly
import util.Random
// Generate uniformly distributed x
val xs = (0 until 100)
// Generate random y
val ys = xs.map { x => x + 5.0 * Random.nextDouble }
val p = Plot().withScatter(xs, ys)
draw(p, "basic-scatter", writer.FileOptions(overwrite=true))
// returns PlotFile(pbugnion:173,basic-scatter)
Come and contribute!
But for every popular repository, there are hundreds of unsucessful ones
But for every popular repository, there are hundreds of thousands of unsucessful ones
But for every popular repository, there are hundreds of thousands of unsucessful ones
The top 28% of repositories hold 72% of the total stars
The top 28% of repositories hold 72% of the total stars
The distribution of stars has a Gini coefficient of 0.6
World income distribution Gini coefficient is 0.8
OECD income distribution Gini coefficients are 0.4-0.6
So why do some repositories become popular?
The 98th percentile repositories written in a:
If I'm in the top 2% of repos for language X, I have Y stars:
Language | # stars |
---|---|
Go | 93.1 |
Scala | 36.6 |
JS | 22.4 |
Average | 19.4 |
Language | # stars |
---|---|
Python | 19.3 |
C++ | 19.1 |
PHP | 17.7 |
Ruby | 15.8 |
C# | 12.8 |
Java | 10.2 |
Rule #3: hard work pays off
For repos of at least 1000 commits written in...
Language | P( $\geqslant$ 50 stars) |
---|---|
Go | 52% |
Scala | 36% |
JavaScript | 13% |
Python | 12% |
But some languages provide better ROI
Make it approachable
Built with