Securing Windows Server 2008 R2 : File Classification Infrastructure

9/22/2011 5:34:50 PM

Imagine opening a door within your company’s workspace and seeing a room of pile upon pile of file folders stacked ceiling high and stuffed full with loose pages of information. Some of the file folders may contain related data files while others contain disparate information. Some data is considered pertinent to the company and needs to be kept for many years. Other data may be outdated or stale and in need of purging.

The ceiling-stacked piles stretch the length of the room, and as you continue down the hallway, you notice that the files are distributed ceiling high throughout many rooms. This seems like fiction, but in most IT environments, dealing with file servers in the environment depicts a daily task in an administrator’s reality.

Electronic data has evolved over time to allow users to store all data sensitivity types. Data is most useful to users if it is available when needed. By stacking company data into disorganized piles within a File server infrastructure, data may be hard to come by when it is needed most. This is a problem that has existed since electronic data storage began, but the problem has become exacerbated as the amount of data that needs to be retained in the corporate world has grown. Another challenge that is occurring in most environments is the need to comply with stringent regulatory compliance requirements, which calls for data to be readily available and easily discoverable.

With Windows 2008 R2, Microsoft has introduced a new feature functionality that will begin to help administrators tackle this ever-growing problem. It is called File Classification Infrastructure (FCI) and its purpose is to assist in automatically classifying files in your environment to make them easier to manage and discover. Three methods exist to classify files:

Automatic classification—This is rule driven and files are classified based on content or folder location
Manual classification—Users can configure file properties that influence their classification
Line of Business applications and IT scripts—by utilizing the FCI, API files can have their file properties configured automatically via applications and scripts

By creating file management policies, you can control the way files are classified and then, based on their classifications, tasks can be performed against the files. A good example of this occurs when data is considered stale. In many environments, data purge does not occur on a regular basis. As a result, stale data can accumulate and create unneeded file content that must be maintained. By utilizing policies, you can classify data, and then perform data management tasks such as expiring files on a routine basis. In the next sections, we will explore the concept of FCI in more depth and discuss what you need to do to deploy it successfully.

Planning for FCI

Like many things in IT, FCI is a technology better deployed with a good plan behind it. So before jumping in with both feet, it is a really good idea to come to an agreement as an organization as to what the FCI classification structure will look like. Group files into like categories sound easy, until you start to discuss within the organization what those classifications actually are.

Within a corporation, often times, a Document or Records Management policy already exists. These existing policies can form wonderful springboards when planning for an FCI deployment. Often, the policy will have each distinct file or record type called out along with the named classification and associated retention information document for each type. If you are lucky enough to work in an environment with a formalized and well-documented records management policy, your journey in deploying FCI will have a much more clearly laid-out path.

For those of you not quite as lucky, it is in your best interest to plan and design for the classifications infrastructure before you attempt to build it. Additionally, you will not only want to know how the files will be classified, but what types of actions are to be performed on the various classifications. It may be useful to create a matrix documenting the automated file classifications and the actions for each. Table 1 describes a fictional example. The next step is to map your plan to the functionality within FCI. We will discuss deploying FCI in the next section.

Table 1. File Classification Matrix Example
File Classification Information	Property Value	Associated Actions
Sensitivity Ratings Confidential	Yes or No	Backup and retain for 12 years Discoverable within 2 days Expires on disk after 1 year
Internal Only	Yes or No	Backup and retain for 3 years Discoverable within 2 days Expires on disk after 1 year
Public	Yes or No	Backup and retain for 3 years Discoverable within 1 day Expires on disk after 180 days
Personnel Content	Contains SS#, phone, or home address Does not contain personal info	Move to Personnel file share No action
Business Impact	High	Discoverable within 1 day Expires on disk after 1 year
	Medium	Discoverable within 2 days Expires on disk after 1 year
	Low	Discoverable within 10 days Expires on disk after 180 days

Configuring FCI

To utilize FCI, the server must hold the File Services role. To install FCI open Server Manager, right-click the File Services role and select Add Role Services, as displayed in Figure 1. This will launch the Add Role Services wizard and allow you to select File Server Resource Manager from the list.

Figure 1. Add Role Services.

Once you have completed the installation wizard, you will then have the File Server Resource Manager console available to you on the Administrative Tools menu (see Figure 2 ). We will be reviewing Automatic Classification, and the configuration for each of the different components takes place from within this console.

Figure 2. File Server Resource Manager Console.

The console contains a section called Classification Management. Within Classification Management, you have two subnodes: Classification Properties and Classification Rules. The Classification Properties section is where you will build out your classifications plan into Classification Property Definitions (see Figure 3 ). The Create Classification Property Definition screen will require you to name your property definition and then identify the property type. You have quite a variety of choices to select from and will want to stick to your originally laid-out plan. Also, keep in mind that simplification of the classification structure you build will help to ease administrative burden down the road.

Figure 3. Classification Property Definitions.

Once you have completed building your classification structure, the next step is to create rules. Classification Rules Definitions, displayed in Figure 4 , are what will be used by the system to judge when to assign which property definitions to the various files you scan.

Figure 4. Classification Rules Definitions.

Each rule must contain the directories which are to be classified, and the classification mechanism. The choices for classification mechanism are Folder Classification and Content Classifier. The Folder Classification allows you to specify folder information to be used as the match criteria to tag a file with a particular property. The Content Classifier allows for a more detailed match and can search file content in order to match. Regardless of the selected classification mechanism, you utilize the Advanced option on the Classifications tab to specify the parameters or values used to match (see Figure 5).

Figure 5. Classification Mechanism and Property Settings.

Managing FCI

Ok, now that you have built your Classification Properties and Classification Rules, do not sit back and wait for magic to happen. We have a few more steps to go before the system will start to work for you. First, we must send the rules you have just created out into your file structure to start scanning and tagging documents. You have two choices of how to accomplish this: One method is to run a manual scan for all rules on demand and the second method and preferred choice is to schedule scans to run on a recurring basis. After performing an on-demand scan, a statistical report of the results will be displayed. A portion of a sample report is displayed in Figure 6.

Figure 6. Sample Scan Report.

If you choose to schedule the scan, you will be asked to configure a standard scheduling window with your desired parameters for the scheduled execution. The schedule screen is part of the File Server Resource Manager options and is displayed in Figure 7. Scheduled scan reports are stored in the %systemdrive%\StorageReports\Scheduled directory by default.

Figure 7. Automatic Classification Schedule Creation.

Now you have effectively sent rules out into the file system to tag files with different classifications. Congratz! But ask yourself, what have you really accomplished? At this point, you have a whole pile of tagged files, but you have not really performed any actions on them besides categorization. So, the next step in working with FCI is to decide what to do with these classified files. In the File Server Resource Manager console, there is a section labeled File Management Tasks. File Expiration and Custom are the two file management tasks actions available. File Expiration allows you to configure a directory as a destination for any files that are deemed expired. Custom allows you, as the administrator, to create your own file management tasks which fit the needs of your organization.