Task: Determine Requirements, Assumptions, and Constraints
• List all requirements of the project, including schedule of completion, comprehensibility and quality of results, and security, as well as legal issues. As part of this output, make sure that you are allowed to use the data.

• List the assumptions made by the project. These may be assumptions about the data that can be verified during data mining, but may also include non-verifiable assumptions about the business related to the project. It is particularly important to list the latter if it will affect the validity of the results.

• List the constraints on the project. These may be constraints on the availability of resources, but may also include technological constraints such as the size of dataset that it is practical to use for modeling.
Purpose

Make an assessment of the liabilities of the project. Making these concerns as explicit as possible will help to avert future problems.

Relationships
Steps
Determine Requirements

The fundamental requirement is the business goal discussed earlier, but consider the following:

  • Are there security and legal restrictions on the data or project results?
  • Is everyone aligned on the project scheduling requirements?
  • Are there requirements on results deployment (for example, publishing to the Web or reading scores into a database)
Clarify Assumptions

Are there economic factors that might affect the project (for example, consulting fees or competitive products)?

Are there data quality assumptions?

How does the project sponsor/management team expect to view the results? In other words, do they want to understand the model itself or simply view the results?

Verify Constraints

Do you have all passwords required for data access?

Have you verified all legal constraints on data usage?

Are all financial constraints covered in the project budget?