. A purist, academic definition of normalization . 1st, 2nd, 3rd, Boyce-Codd, 4th, 5th, and Domain Key Normal Forms . Normalization and referential integrity as expressed by primary and foreign keys What Is Normalization? The academic definition of normalization is the accepted format of Normal Forms definition. I like to label normalization as academic because the precise definitions of Normal Forms are often misunderstood in a commercial environment. In fact, the truth is that language use in the exact definitions for Normal Forms is so very precise and carefully worded that problems are caused. Many database designers do not understand all facets of normalization in other words, how it all really works. Alot of this is a result of such precise use of language. After all, we are now in a global economy. There are a multitude of database architects who do not speak English, have a limited command of the English language, and should not be expected to be well-versed in either respect. In general, normalization removes duplication and minimizes redundant chunks of data. The result is better organization and more effective use of physical space, among other factors. Normalization is not always the best solution. For example, in data warehouses, there is a completely different approach. In short, normalization is not the be-all and end-all of relational database model design. This chapter also describes a brief user-friendly interpretation of Normal Forms. It is just as important to understand Normal Forms from a more academic, more precise but possibly less commercially viable perspective. The problem with the academic approach to normalization is that it seems to insist on always expecting a designer to apply every Normal Form layer in every situation. In my experience, in a commercial environment this is nearly always a mistake. The trouble with the deeper and more precisely refined aspects of normalization is that normalization tends to over-define itself for the sake of simply defining itself further. Before going into the details of normalization, some specifics should be covered briefly, including the concept of anomalies and some rather technical mathematical jargon. The Concept of Anomalies The intention of relational database theory is to eliminate anomalies from occurring in a database. Anomalies can potentially occur during changes to a database. An anomaly is a bad thing because data can become logically corrupted. An anomaly with respect to relational database design is essentially an erroneous change to data, more specifically to a single record. To put this into perspective, data warehouses can add and change millions of records in single transactions, making accounting for anomalies over zealous. In the interests of mathematical precision, explicit definition is required. Why? Mathematics is very precise and anomalies always should be accounted for. That is just the way it is. Consider the following: . Insert anomaly Caused when a record is added to a detail table, with no related record existing in a master table. In other words, adding a new book in Figure 4-1 requires that the author be added first, assuming, of course, that the author does not already exist. 74 Chapter 4
Note: If you are looking for cheap webhost to host and run your apache application check Vision jboss web hosting services