Updated November 27, 2017
A database is one of the most important aspects of the data science industry. This is where all the data is collected and stored for data analysts/scientists to explore sets upon sets of data. It is a collection of schemas, tables, queries, views, and reports. When someone refers to a database they are referring to a specific set of related data and the way it is organized.
A database systems allow one to search through a database in an organized fashion to discover data patterns and insights. These are known more technically as Database Management Systems (DBMS). They include:
- IBM’s DB2
Using these tools allows a data analyst/scientist to explore a specific sets of data or the entire database depending on the settings he/she applies.
Because Database Management Systems and Databases share such close relations they are often casually related as one in the same. To Clarify: Database is the collection of data. Database Management System allows data analysts to access, explore, understand and utilize the data for analytical purposes.
You can pull data from the database using a coding language known as SQL, but let’s save that for the next article:
Click here to learn about SQL Data Extraction