Crazy idea: what if you used a dimensionality reduction like t-SNE instead of learning a vector representation? Would you expect similar results?
I personally found that vector representations performed significantly better than other approaches.
And the results will actually be a lot better once I ship a better model (the current one can definitely be improved upon).
Crazy idea: what if you used a dimensionality reduction like t-SNE instead of learning a vector representation? Would you expect similar results?