Dear participants, we highly appreciate your participation in the Datathon as challengers and we are sure that your contribution will make for another great Datathon. During the event your task would be to develop a data based solution to a chosen case study.
Before the event
Register into the Datathon Platform
Register in Datathon website at: https://www.datasciencesociety.net/datathon/register/. The registration will go through e-mail confirmation. A confirmation email about the registration is received. Keep in mind that there might be delay for receiving it or could be in spam folder.The registration becomes valid after the successful completion of a payment with a valid means of payment. Providing a substitute participant is possible at all times. The participation fee can be found in the registration form of the respective event. Make sure that You have filled-in Your profile, including an avatar (not necessary Your actual photo), Your name, Your data science interest, etc.
Check the local host details
If you have chosen to attend one of our local hubs for the event so that you participate physically, you should check in advance the location and transport to the hub, starting times and working time of the venue, any requirements for security, etc. Please show up at the local Datathon Hub on time on the starting day of the Datathon. You will use your own tools, including hardware, software, communications, and stationery. If you need a computer and certain software, make sure you have it prepared.
Read the presented cases
At the beginning of the registration period the simplified descriptions of the cases are available at the Datathon website. In the week before the datathon the full descriptions will be available, and the data-sets will be available at the start of the Datathon. Review all the provided cases. Look for a case that will challenge and excite you. Form or join a team that you will have fun who are interested in the same case. Read preliminary guidelines for the cases by the mentors, where they would have stated their advice and suggestions to the participants of how to solve the case. These guidelines should be ready about one week before the start of the event.
Participate in the online discussions
We highly recommend to be part of the conversation. One of the most valuable aspects of the Datathon is to be part of the online community. For the global communication with the participants, we use DataChat among other platforms.
The data chat is the main place for online communication among the participants, experts and mentors. You’re familiar with Slack, this one works very similar. It is in essence Rocket Chat (help: https://rocket.chat/docs/user-guides/; App Store: / Google Play) and it also has many cool and free features like Videochat, full customization, etc. If you want to install the Rocket Chat application on your mobile, you must be logged into the DSS platform first. All the communications in the DataChat are conducted in English. If you have any technical problems write in the #issues channel. Have fun, make jokes, support others and build a friendly and safe learning environment. Treat other participants the way you want to be treated. Collaborate and expand your knowledge together.
The mentors would log in about once a day online in the DataChat channel in the week before the start of the Datathon, so there are opportunities to interact and introduce yourself. Also all other participants are going to be there.
There are several groups which we can recommend to join, use search menu to find them:
#general – for general info
#announcements – read-only group for announcements from DSS
#issues – if there are problems also you can check Help -> FAQ menu on DSS site
#cases_…. – these are the groups where the case are going to be discussed with industry experts
#nlp_text_mining, #ml, #sna, #cv, #ai, #bio_inf – Areas where the top mentors are going to be and the most important, you are free to set up your team group and start working in private group
These channels are going to be the main place of where you will ask questions to the mentors during the Datathon.
It is not mandatory to have a team formed in order to participate. You will have a chance to form a team virtually up until the start of the Datathon. The preferable size of the team between 3 to 5 people. There are two ways to get in a team:
- Use the platform to proactively form your team
You will be able to form a team with ease once you have chosen a case to work on. We also advise you to take a look at the profiles of other participants and start using the chat room(s) for the case(s) that you find interesting. This is a good way to find other experts that would like to work on the same case.
- You may also use the option (available at registration) to be matched into a team by our Matchmaker, which would suggest you suitable teammates
If you have chosen that option, the Matchmaking suggestions will be streamed live before the event in #announcement channel. If someone is left out during the teaming up in the #announcement channel he/she should move to #general to pair up with other lonely participants and be proactive.
Once the teams are formed all Datathoners need to register the teams. That is done by posting your team details in a new article (which will contain yor team solution by the end of the event). You need to create new article and add all team mates , team name, case, and mentors. There is a template one which can be used called “Example of Team Article”. The role of the mentor is to read and comment the article (your solution) without editing it, while it is in private mode (Draft status). The one who created the article is by default in the team and he is considered as the author, every next teammate registered will be a co-author of the article. Think of a cool name and put it as a title.
Also create your own team channel in DataChat. Find each other in direct messages and appoint one of you to create the team private channel, with the most strange or funny name of your team.
During the event
The Datathon starts officially at the same time Worldwide at a live streamed opening ceremony. There would be some clarifications and possibly other useful messages given by DSS. At the end of the ceremony the data-sets are put available so that your team could start working on the case.
Start working on the team case solution:
Setup a working environment for the team. Transfer all the data into the team environment and start working. Approach industry experts for clarifications in their virtual rooms #case. Approach mentors they are going to monitor all groups relevant to their expertise and there you can raise your question. If you want higher security you can contact them directly, but be aware that they are not going to be online all the time, so if you are using a group (channel) someone else can answer on your question in a timely manner.
Q&A mentors live session
Normally around Saturday morning there are many questions by the participants of the Datathon. That is why we organize live Q&A session on Saturday morning in which the industry experts and the mentors provide some clarifications about the cases and about the approaches to solve them. Your attention would be very beneficial. There will be live streaming with updates in the general Data Chat.
Social network activities
For sharing your stories, we recommend you to use Twitter and Facebook where we want to capture the best moments of the Datathon. We invite every team to post a short team video on Saturday (up to a minute) and a picture on the social media adding #Datathon (and other official hashtags given by DSS). Let’ s create a story and tag ALL pictures and videos so the world can see them!
Upload your results and publish the article.
Finally when the time comes to be proud of yourselves what you did and how much you learn. We advise you to use the CRISP-DM methodology for your masterpiece. You need to add export from your code in Jupyter Notebook or something similar and your results. You may provide and uploaded video of up to 3 minutes to explain some details.
The processes of posting article – articles have three stances:
Draft – Is the working stance of the article, where only the team members and mentors can see, update, and comment.
Private – the article is only accessible by the current Datathon participants
Published – Is the stance where the article is viewable by everyone in the internet, but only logged in participants can comment on it.
The participants are allowed and encouraged to comment, upvote and ask questions to other articles. We want to create long lasting knowledge base and sense of community.
Saturday evening is a deadline to upload a working version of your article [to qualify for the competition]
At the end of the event
Semi Finals – Jury
The teams will be assigned in groups of 4-5 (working on the same case, and maybe one group with various cases) on Saturday evening. Every group gets to be judged by one mentor, one domain expert (from the company which provides the case) and one DSS expert. The team which gets ranked at the top of their group qualifies for the final. The jury vote will be based mainly on the content of the articles of the team. The mentors will provide you with feedback below your article, their names will be with distinguishing color. Participants are also encouraged to leave comments and questions to other articles, when they are available. You can easily compare different or similar approaches. The finalist teams are going to be announced and preparing for the final round where they need to login and present their solution.
Finals and closing
The finalist teams would have a live 3 minute presentation (over Google hangout) followed by 2 minutes Q&A session by the Jury. The team with the best collective rank wins. There will be streaming for the award ceremony so keep an eye on the website, schedule and the #announcement chat for pinned messages.