Hah, my friend and I did nearly the exact same project in college, though minus ...

sanjams · on Sept 9, 2019

Ha. I did this exact same thing for a project in college using echonest and linear regression. In the end, we were unable to find a single statistically significant coefficient. We ended up having to change our project completely. Kudos to your team for finding something there

theferalrobot · on Sept 9, 2019

I also did something similar in college but due to similar issues noted pivoted to genre classification with extracted audio features. With that though I was actually able to get a pretty accurate classifier going.

ingenieroariel · on Sept 9, 2019

First experiment I would do is remove the artist field from the input :)

mrguyorama · on Sept 9, 2019

I also wanted to do this for the "eventual" revisit. We also wanted to try more computationally intensive training systems, including possibly neural networks, but my poor 4th gen i5 just could not cope. I'm waiting for AMD accelerated training to be mostly trivial, so possibly forever.

joewee · on Sept 10, 2019

And maybe eliminate the top 100 artist?

data4lyfe · on Sept 9, 2019

"90% based on which artist made the song"

Doesn't that demonstrate that the actual business case of producers discovering new artists doesn't even factor into the model's case of discovering which new songs will actually be hits.

It seems like the former is much harder than the latter in this case.

dwd · on Sept 10, 2019

I think you will find is that the process of an artist being discovered is basically the same as getting into YC.

It's about the artist and whether they have star quality and are saleable. There are plenty of song writers to write the actual songs.

Classic case is someone like Sia Furler who has written a ton of hits for other artists.

https://time.com/4209769/sia-best-songs-written-for-other-ar...

mrguyorama · on Sept 9, 2019

Keep in mind that our results and findings, while making "sense" post-hoc, are predicated on our process and procedure being correct, which honestly, as complete newbies to the whole field, was roughly a coin flip.

rhizome · on Sept 9, 2019

Not only that, but sales figures and marketing spend are also not included.