Key Information

Register
Submit
The challenge is finished.

Challenge Overview

This project aims at building a Proof of Concept application that would provide a user with an ability to categorize his/her personal activites and important moments and aggregate them by those categories and also time.  The categorization will be done automatically by analyzing the information extracted from his personal data from emails, documents provided by him, social profile (like facebook/twitter) etc. The current contest fouses on the categorization part of this overall concept.

We want to build a component that would provide us a prototype of the above capabality. Specifically, following are the high-level requirements for the component:

1.) Allow the users to create categories.

2.) Read the data from source files.

3.) Parse the  data and categorize it using natural language processing and classification algorithm.

4.) Allow the user to see the summaries of their stored data aggregated by categories or based on time.

Example: Consider that a user creates two categories A and B. The component then reads the source data. Consider that the source data consists of text files containing set of emails of a user. Each email is a data sample. The algorithm parses the text and then runs the classification algorithm to put into appropriate categories. Now, the component will also have the ability to allow user to select and see the information under each category. When he selects either a category A or category B, he should be able to see his mails corresponding to that category. He can also select to see the infomration aggreagated by time.

Some points to be considered here are as follows:

1.) When the user selects the category we just want to show him the summary of the information under that category. For example, he had 50 mails belonging to that category, only a small summary of each mail should be shown with a link to full data. This may require to plan for two different storage  - one for summaries and one for actual data.

2.) The time granularity should be month-wise.

3.) Currently, we do not want the component to actually connect to the social profiles or email accounts.The data source will be simple text files stored locally. Each data sample in the text files will have timestamp associated with it.

4.) Also the UI needs ot be very simple. Allow the user to create/delete categories. Allow the user to select which category they want to see and the time frame for which they want to see. The time frame can have granularity level of year and month.

Data Source:

The exact data sources to be used will be discussed in forums. But for the design, yo ucan consider the input data to be stored in text file. Each data sample will have associated time tag.

Documentation Provided:

The idea for this application with potential algorithm choices was  rolled out through an idea generation contest recently. We have provided the documentation from that contest.



Final Submission Guidelines

The component would work as a stand alone POC application. Both back-end and front-end are covered in this component.

Technology:

As this is a prototype application, the members are allowed to use any algorithm friendly langauge for their use.

Please note that we want to develop a Desktop Application

ELIGIBLE EVENTS:

2014 TopCoder(R) Open

REVIEW STYLE:

Final Review:

Community Review Board

Approval:

User Sign-Off

SHARE:

ID: 30035630