Are the training data in order?
User 32327’s buying order: phones → fashion → home decor → …?
Are the training data in order?
User 32327’s buying order: phones → fashion → home decor → …?
No, the training data has no preference order.
I read the page, still don’t understand the dataset.
1 MRR this metric emphasises the order of your predictions. You have to predict 3 categories in decreasing order of probability.
2 Train_data.csv have multiple transactions for individual user_id. But in one row you have been given only one transaction ( one product category and one aov).
3 Your submission won’t fit in the format as demonstrated in the sample submission. So please predict only the top 3.
I hope it resolves your doubts.
I have submitted few submissions nearly 6 in past few days I observe when ever I submit I find both MRR and precision to be same and I doubt that can anyone clarify that
Same happened with me. I am waiting for the leaderboard. I will again start doing it when it is up.
Hi @nasir.afroze
How do we have to use ‘training data target’ ? Does it only have future assumptions? Because aov is negative for every instance.
Hi @nasir.afroze,
Would it be possible for you to share the evaluation metric code which is being used to evaluate our submissions? This will help to validate the model locally before making a submission.
hey can you explain aov. Is it cost or value of the product to the user.
Evaluation metric code can not be shared but you can easily code it by yourself. The evaluation metrics are explained in a good enough manner.
A hint- Try to write a find in string function.
AOV - The amount of money a customer pays to buy a product.
Just for clarification , so the max value of precision can be 3 and max value of MRR will be 1 , please correct if I am wrong .
The maximum value of precision can be 1 only. Precision is for checking whether any of your predictions match the correct category.
Yes the maximum value of MRR can only be 1.
It is the correct category which should be predicted for training data set . You can validate your model based on the results you get while using it as the correct category dataset for evaluating your model.
Hi @nasir.afroze,
I started the hackathon today itself and had a couple of doubts regarding the data:
@nasir.afroze What would be the weights for different metrics (Precision, Recall and MRR) to decide the private leaderboard or would it be solely based on Precision which is currently being used to display the rank on public leaderboard?
I have the same question
Hi @paradocs,
Welcome to the second hackathon of GHF.
1 Out of the 257406 transactions listed in training,151 repeat transactions can be done or not? Giving an open-ended answer I hope you get what I am saying.
2 You are asked to predict the future transaction of a certain number of 16k user_id not given in the train_data_target. You can see that there is no user_id common in test_data and train_data_target.
I hope it helps.
Recall>Precision>MRR. The order used for ranking . same criteria for private leaderboard