DecSys2020

4 years, 5 months ago

Semester Project for Decision Systems 2020/2021 Course

This is a semester project for students enrolled in the Decision System 2020/2021 course at the Faculty of Mathematics, Informatics and Mechanics at the University of Warsaw.

Overview

The goal of the competition is to create an efficient model for predicting whether the total 14-days sales of a particular product, offered by the Fitfood company at one of their FitBoxy locations in Poland, will exceed four pieces.

More detailed competition rules are given in Terms and Conditions.

The description of the data and evaluation metric is in the Task description section.

The submission system will open on Monday, December 28.

Terms & Conditions

Participants of the challenge are obliged to follow the competition rules:

This challenge is organized by Andrzej Janusz (the Organizer) for students enrolled in the Decision System 2020/2021 course at the Faculty of Mathematics, Informatics, and Mechanics at the University of Warsaw.
The provided data sets are the property of the Organizer and the KnowledgePit platform. It is forbidden to share or redistribute provided data sets to any third party without explicit consent from the Organizer.
Participants can work individually or as a team consisting of maximally two persons. The teams need to be formed at the beginning of the challenge. Participants cannot change their teams.
Each team has a limited number of submissions - the limit is set to 100.
The number of submissions per day is limited to 5.
Participants can use data that was made available in the challenge - using any external resources is possible only after receiving explicit consent from the Organizer. Queries regarding the external resources need to be issued through the competition forum.
It is strictly forbidden to hack the provided data or to exploit any unfair data leak that can improve the solution score. All attempts at making predictions for any test instance using information extracted from other test instances will result in disqualification.
The deadline for submitting the solutions is January 27, 2021 (23:59 GMT). Late submissions will not be accepted.
Each team is obliged to provide a short report describing their final solution. The report must contain information such as the name of the team, the names of all team members, and a brief overview of the used approach. The description should explain all data preprocessing steps and model construction steps. It should be submitted in pdf format using our submission system by January 27, 2020 (23:59 GMT).
By enrolling in this competition, you grant the Organizer the right to process your submissions and reports for the purpose of evaluation and post-competition research.
The final project score will depend on the quality of the solution (the score obtained in the final evaluation), and on the quality of the submitted report.

Final results

Rank	Team Name	Is Report		Preliminary Score	Final Score	Submissions
1	Niedecyzyjni	True	True	0.8953	0.894000	15
2	krzpiesiewicz	True	True	0.8951	0.893000	10
3	baseline	True	True	0.8919	0.890000	5
4	Krzysztof Grabiec	True	True	0.8894	0.886600	7
5	Golden Bacon MAIers	True	True	0.8867	0.885600	14
6	Patryk Skwiercz	True	True	0.8795	0.877800	24
7	toporowska	True	True	0.8783	0.876500	16
8	mm	True	True	0.8785	0.876500	5
9	Krzysztof Ostrowski	True	True	0.8731	0.871500	7
10	jakubszymkowiak	True	True	0.8738	0.870800	12
11	Kajetan Fornalik	True	True	0.8671	0.864800	3
12	jjk	True	True	0.8501	0.846600	10
13	baseIine	True	True	0.8197	0.815700	6

Enroll

Please log in to the system!

Task description

Provided data describe a short-term sales history of products at various point of sales (PoS). The target attribute will_it_sell tells if in the following 14 day period the total sales of a given product at a particular location will be at least 4 pcs.

The data is very similar to the one from the second graded task, with the difference that the sets in this challenge do not contain any random probes (which were deliberately added to the data from the second graded task for evaluation purposes).

The data tables are provided as two CSV files with the ';' separator sign. They can be downloaded after the registration for the challenge. Both files (training and test sets) have exactly the same format but all the values from the will_it_sell column in the test set are missing.

The evaluation metric will be AUC. During the challenge, your solutions will be evaluated on a small fraction of the test set, and your best preliminary AUC score will be displayed on the public Leaderboard.

The submission format: the solutions need to be submitted as text files with predictions. The file should have exactly the same number of rows as the test data table. In each row, it should contain exactly one real number expressing the likeliness that the correct target value for the corresponding test set instance is 1.

Data files

Forum

This forum is for all users to discuss matters related to the competition. Good manners apply!

There is no topics in this competition.