How is data organised for storage and retrieval beyond the relational database model?
Describe non-database methods of organising data, including flat files, hypermedia and free text retrieval, and the storage and retrieval methods used by information systems
A focused answer to the HSC Information Processes and Technology dot point on organising and storing data. Flat files versus databases, hypermedia, free text retrieval, indexing, online versus offline storage, and the traps markers look for.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
What this dot point is asking
NESA wants you to describe the ways an information system organises data for storage and retrieval, and to compare them. You need to know flat file systems, the database approach, hypermedia and free text retrieval, and the physical and logical methods used to store data and bring it back efficiently.
The answer
Why organisation matters
Two of the seven information processes are organising and storing and retrieving. How data is organised decides how easily it can be searched, updated and kept consistent. The right structure depends on the kind of data and how it will be used, so the syllabus asks you to compare several approaches rather than treat the relational database as the only option.
Flat file systems
A flat file stores all data in a single table or file, with every record holding every field. A spreadsheet of customers and their orders is a flat file. It is simple to create and read, and fine for small, single-purpose data. Its weakness is redundancy: a customer who places ten orders has their name and address repeated ten times. That wastes space and creates update anomalies, where changing the address in one row but not the others leaves the data inconsistent.
The database approach
A database stores data in multiple linked tables, with each fact recorded once and connected to others through keys. This removes the redundancy of flat files, enforces consistency, and supports complex queries across related data. The cost is greater design effort and a database management system to maintain the structure. The database approach also separates the data from the programs that use it, so many applications can share one well-organised store.
Hypermedia
Hypermedia organises information as nodes of text, images, audio and video joined by navigable links. The World Wide Web is the largest example: pages connect to other pages through hyperlinks, and the user navigates rather than queries. Hypermedia suits browsing and exploration where the path through the information is not fixed in advance, but it is poor for the precise, structured retrieval a database gives.
Free text retrieval
Free text retrieval searches collections of unstructured documents, such as articles or emails, for records containing particular words or phrases. A search engine is the familiar example. The system builds an index of the words in every document so it can find matches quickly. Free text retrieval handles data that has no fixed fields, but it returns documents rather than precise data values, and can return irrelevant matches.
Storage and retrieval methods
Once organised, data must be stored on physical media and retrieved efficiently.
- Storage media include magnetic (hard disk, tape), optical (DVD, Blu-ray), solid state (SSD, flash) and cloud storage. Each trades off capacity, speed, cost and durability. Tape is cheap and high capacity but slow, suited to backup; solid state is fast but costlier per gigabyte.
- Online storage is immediately accessible by the system; offline storage (a tape in a vault) must be loaded before use. Near-line storage sits between the two.
- Indexes speed retrieval by acting like a book's index: rather than scanning every record, the system looks up the index to jump straight to the data. The trade-off is that indexes take space and must be updated when data changes.
- File organisation methods include sequential (records in order, read start to finish), and direct or random access (jump straight to a record by its key or address). Direct access is far faster for finding one record.
Backup and archiving
Responsible storage includes backup (a second copy for recovery after loss) and archiving (moving old but valuable data to cheaper long-term storage). A backup strategy specifies what is copied, how often, and where the copies are kept, ideally off site.
Exam-style practice questions
Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
2020 HSC3 marksAn online games store is converting its flat-file database to a relational database. Describe the benefits for the games store of converting its flat-file database to a relational database.Show worked answer →
For 3 marks describe three benefits of moving from a flat file to a relational model.
Reduced data redundancy. In the flat file the publisher and developer names repeat in every game row. Splitting the data into related tables stores each publisher and developer once, so the same fact is not duplicated.
Improved data integrity. Because each fact is stored once, an update (for example correcting a publisher's name) is made in one place, avoiding the inconsistencies that occur when duplicated data is updated in some rows but not others.
More flexible querying and less wasted storage. Related tables linked by keys let the store run targeted queries across tables and add new games or publishers without restructuring, while removing repeated data saves space.
Markers reward benefits clearly tied to redundancy, integrity and flexibility, not just "it is better."
2019 HSC1 marksWhich of the following shows the logical organisation of data in a flat-file database? A. character, field, record, file. B. record, field, character, file. C. record, file, character, field. D. character, file, record, field.Show worked answer →
The answer is A, "character, field, record, file."
This is the data storage hierarchy from smallest to largest unit:
A character (a single letter, digit or symbol) is the smallest unit.
A field is a group of characters representing one item of data (for example a surname).
A record is a collection of related fields about one entity (for example all fields for one customer).
A file is a collection of related records.
So the correct increasing order is character, field, record, file. Markers test that you order from the smallest unit up.
2021 HSC1 marksIn a free text search, the symbols ? and * are used to replace characters in a word. Using the search query ?O*, which word could be found? A. NETWORK. B. OPTICAL. C. RADIO. D. TOPICS.Show worked answer →
The answer is C, RADIO.
In free text retrieval, ? is a wildcard for exactly one character and * is a wildcard for any number of characters (including none).
The query ?O* means: exactly one character, then the letter O, then any characters. So the matching word must have O as its second letter. Check each option:
NETWORK - second letter E, no match.
OPTICAL - second letter P (O is first), no match.
RADIO - second letter A, no match.
TOPICS - second letter O, match (T fills the ?, O is the second letter, PICS fills the *).
Only TOPICS has O as its second character, so the answer is D.