Models of information search: a comparative analysis
Department of Information Studies, School of Media, Culture and
Creative Arts, Curtin University, GPO Box U 1987, Perth, Western
Deputy Vice Chancellor and Head of Campus, The University of Notre
Dame Australia 19 Mouat Street, Fremantle, Western Australia, 6959
Business School, University of Western Australia, 35 Stirling Highway,
Crawley, Perth, Western Australia, 6009
The identification and development of information search models has underpinned our understanding of information management for several decades, offering important guidance on how users interact with their information environments. Of particular importance are the seminal models generated by Ellis (1989), Meho and Tibbo (2003) and Marchionini (1995). These three models offered a progressive view of information seeking processes that users employ when searching information sources. With the advent of other forms of information sources, such as electronic document and record management systems, the efficacy of these models needs to be revisited. Do they sufficiently reflect the processes users undertake in searching applied systems or are more specific models required to reflect the context-specific searching that users employ?
Comparing the three seminal information search models with a recently developed search behaviour model we examine the fit between the generic information search stages developed by the previous theorists and this recent model. This illustrates the importance of testing these traditional models against new applications to determine the synergies and differences occurring in different contexts.
We commence with a review of the established models and then introduce the search behaviour model when using electronic document and record management systems. We then examine the degree of fit between these established models and the applied information search behaviour model.
The information seeking behaviour models
The information seeking behaviour models described by Ellis (1989), Meho and Tibbo (2003) and Marchionini (1995) are widely cited as best describing the ways users search for information. They have been applied to a range of disciplines including computer science, information systems and information science, electronic information seeking and library science (for example (Brine and Feather 2010; Choo et al 2000; Ellis et al. 1993; Ellis and Haugan 1997; Komlodi et al. 2007; Marchionini, 1995, 2000, 2008; Marchionini and White 2007; Robins et al. 2002; Salajegheh and Hayati 2009; Shneiderman 1997; Spink et al. 2002a, 2002b; Vakkari 2003; van Deursen and van Dijk 2009; White 2007; Wilson et al.; Ford et al. 2002b).
As an early investigator of information search, Ellis (1989) observed and documented the search behaviour of sixty academic social scientists at the University of Sheffield who were seeking paper-based information in a library setting. He identified six information seeking processes and information seeking activities (see Table 1, Column 1). Ellis (1989) described these processes as behaviour patterns and behavioural characteristics.
|Six common information seeking activities of social scientists (Ellis)||Revised information seeking behaviour model of social scientists (Meho and Tibbo)||Information seeking processes model of electronic system users (Marchionini)||Search behaviour model of knowledge workers working with electronic document and record management systems|
(starting, chaining, browsing, monitoring, differentiating, extracting, networking)
(recognise problem or need, accept problem, define problem, select search system)
|Stage 1: Start
(chaining, extracting, differentiating, verifying, information managing, synthesising, analysing, writing)
|Plan and execution
(select search system, formulate query / determine entry point, execute, examine)
|Stage 2: Formulate search strategy
(metadata search, retrieval, navigation)
|Evaluation and use
(examine, extract, reflect/iterate/stop)
|Stage 3: Execute search|
|Chaining||Ending||Stage 4: Process and evaluate search
(browse, refine, filter, sort)
|Monitoring||Stage 5: Access search results
|Extracting||Stage 6: Decision making about search results
(scan, verify, confirm, extract)
|Closure||Stage 7: End search
(extract, close, stop, seek help, check, retry)
The information seeking activities described in Ellis's model have proven robust, with subsequent mirroring of academics, physicists, chemists and engineers samples (Brine and Feather 2010; Ellis 1993; Ellis et al. 1993; Ellis and Haugan 1997; Salajegheh and Hayati 2009). Ellis's (1989) general information seeking behaviour model was one of the few models used in a joint longitudinal project between scholars in the United Kingdom and the United States of America to investigate users' mediated information search and retrieval processes (Ellis Wilson et al. 2002; Spink et al. 2002a, 2002b; Wilson et al. 2002a; Ford et al. 2002b).
The information seeking behaviour model developed by Ellis (1989, 2005) was based on information seeking in library environments using paper based information sources, not electronic or online library catalogues. Meho and Tibbo (2003) reviewed the applicability of Ellis's model for computing based library environments where library users had access to online library catalogues and the Internet. Whilst their study re-affirmed the applicability of Ellis's model, they found that a fuller description of the social scientists' information seeking behaviour should include a distinction between information seeking processes and information seeking activities. Hence, Meho and Tibbo's (2003) model usefully identified different stages in the information seeking processes and grouped the relevant activities in these categories, as presented in Table 1, Column 2.
The four inter-related stages they identified in library users' information seeking processes are searching, processing, accessing and ending (Meho and Tibbo 2003). Four additional information seeking activities were added to those observed by Ellis (1989): accessing, networking, verifying and information managing (Meho and Tibbo 2003).
In spite of differences in the format or medium of the information sources used, it is interesting to note the broad transferability of Ellis's (1989, 2005) model. Meho and Tibbo (2003) confirmed the relevance of Ellis's model to electronic contexts.
Marchionini (1995) developed a more integrated model entitled the information seeking process model, the better to capture the processes users apply in an electronic search. This model was designed as a generic construct to describe the information seeking processes from any electronic information source. Marchionini's (1995) model focuses specifically on the information search processes of electronic system users, and their actual information seeking activities. Marchionini's (1995) model has been widely cited in studies on search behaviour and search queries (Komlodi et al. 2007; Vakkari 2003; White 2007); particularly in human computer interactions and user studies especially related to the Internet (Choo et al. 2000; Marchionini 2008; Robins et al. 2002; Shneiderman 1997; van Deursen and van Dijk 2009).
Marchionini (1995) describes his model as employing the 'information seeking processes of electronic system users', depicting the information seeking process as being 'composed of a set of subprocesses' (p. 49). He categorises the information seeking process into three classes of sub-processes: 1) understanding; 2) planning and execution; and 3) evaluation and use. As indicated in Table 1, Column 3, he evaluates the information seeking processes of electronic system users as dynamic and action-oriented, explaining that the sub-processes of understanding are mental activities performed by the electronic system user, while the sub-processes of planning and execution, and evaluation and use, are both mental and behavioural (p. 59).
The information seeking behaviour models of Ellis (1989), Meho and Tibbo (2003) and Marchionini's (1995) were selected as they were deemed to closely reflect the search behaviour of users of the electronic document and record management system, as presented in Table 1. In summary, Ellis's (1989) model was selected for its initial portrayal of the information search behaviour in paper based settings that provided the understanding of and the vocabulary to describe the search activities library users engaged in. Meho and Tibbo's (2003) model was selected as it provided further enhancement of Ellis's (1989) model by providing insight into users' information search behaviour in online systems like the internet. It was also selected for identifying that users engaged in additional information search processes beyond those described in Ellis's (1989) identification of information search activities. Marchionini's (1995) model studied and described the information seeking behaviour of computer systems users: a similar focus to our search behaviour model. It is for these reasons, Ellis's (1989), Meho and Tibbo's (2003) and Marchionini's (1995) models were selected as suitable scaffolds to test our research.
Other information search behaviour models were not deemed as suitable for the purposes of this review, as they emphasised different contexts or explored less related elements. For example, Kuhlthau's (1988, 1993, 1999, 2005) longitudinal studies of students' information seeking behaviour focuses on the student's affective or feelings aspects as they proceed with their information search and as they digested their sought information, which was not the emphasis of our study. Likewise, Bates's berrypicking information search model was not selected as it reported how library users berrypick and gather information from different information sources (Bates 1989). The focus of our study was to investigate the search behaviour of users from the time they started to when they ended and/or stopped their search in the one electronic document and record management system.
The search context in electronic document and record management systems
These are specialised information systems that capture and manage the key records of an organization. Their architecture operates in a similar mode to other information systems, in that they draw on metadata to enable descriptive, storage and retrieval functions. Technological enhancements have encouraged devolution of the search processes to organizational users, who can search at point of need for records according to the linked metadata or various identifiable features. Increasingly prevalence of these systems has however, resulted in little consideration of how user-based information search operates in these applied settings. Do they reflect similar behaviour to those identified by the previous researchers or do users employ alternative processes? Our research aimed to investigate the similarity and differences to the three established models, with the goal of further clarifying the main stages and behaviour of information search. The models offered a useful scaffold on which to frame the descriptions of the search behaviour of electronic document and record management systems. Other models (Krikelas 1983; Kuhlthau 1988, 1993, 1999, 2004, 2005; Leckie 2005; Leckie et al. 1996; Wilson 2005) were less focused on the information process and were deemed less helpful in exploring the processes. Also, excluded was Ingwersen's (1996, 2001) research as it focussed more on cognitive aspects based on laboratory experiments.
Table 1 provides a comparison of the hypothesised search behaviour model for electronic document and record management system users with the information seeking behaviour models by Ellis (1989), and Meho and Tibbo (2003), and the information seeking processes outlined by Marchionini (1995). The seven hypothesised search process stages and search activities for this search model are presented in column 4 of Table 1.
An empirical research method was used in this research to investigate the search behaviour of users of electronic document and record management systems. The search behaviour patterns of forty users from four different organizations were mapped. Interviews with each user were conducted and sample searches were reviewed using a protocol analysis (Ericsson and Simon 1993).
One hundred and four flowcharts were developed from the self-reported and observed searches. Thus, the search behaviour were triangulated using a range of investigative techniques. This strategy adhered to Yin's (1984) recommendation that construct validity be supported by the use of multiple data collection methods. In contrast, previous information seeking studies by Ellis (1989), Meho and Tibbo (2003), Branch (2002: 14) and Ingwersen (1982: 173) used only a single research method.
Four Australian government institutions employing three different electronic document and record management system architectures (HP TRIM, e-Docs and Objective) participated in the research. A range of systems was selected to ensure the results were sufficiently robust across different platforms. Each operated with a different user interface based upon a tree and/or virtual database view. The Objective and e-Docs systems were designed with both a tree view of the classification folder structure and virtual database interface view. Of the two HP TRIM systems, one user interface was designed with the virtual database view only, while the other was designed with both the classification folder structure and virtual database view.
The study participants in each organization comprised one records manager and ten users, giving a total of forty users (fifteen males and twenty-five females) and four records managers (three males and one female). The age distribution of both groups is shown in Table 2.
|20 - 29||30 - 39||40 - 49||50 - 59||60 - 69|
Interviews with the four records managers were conducted to explore how the systems had been implemented and the ways in which the records management standards and national practice principles were reflected (see Singh et al. 2008c). The managers then identified ten experienced users of their electronic document and record management system who might be interviewed about their search behaviour.
The search processes employed by users were explored from three perspectives during the interviews. Firstly, each participant was asked to describe the process and the range of strategies they generally employed when searching for a record. Secondly, they were asked to describe their preferred search style that captured the different options they generally employed. This offered a useful comparator to the system design and training foci that were in place within the organization. Their preferred style also provided a scaffold as to the normal protocol they tended to follow when searching. Users were then asked to recall their last simple and difficult searches, and to describe and demonstrate those search processes. Thirdly, using the protocol analysis users also identified recent simple and difficult searches, and replicated these to the best of their ability. These searches were observed and recorded as a series of steps and decision processes. The observation also noted the point at which the search was stopped. The protocol analysis tool tested whether users actually exhibited the behaviour they had described in their preferred search descriptions.
Thirty semi-structured interview questions were grouped into six broad topical segments (Table 3, Column 1) to measure each user's preferred search behaviour when undertaking a typical search.
|Semi-structured Topical Segments||Purpose and Description|
|(a) Usage||- why use the system and what types of information were sought?|
|(b) Searching patterns in the system||- what search methods were used, what were the preferred search methods, how were search results followed, how and when a decision to stop search is made, what are the difficulties when searching?|
|(c) Classification scheme||- is user familiar with and do they understand the scheme used, how do they find the scheme?|
|(d) Situational and time factors||- does time affect searching and is a time limit applied when searching?|
|(e) Training||- what training has user received and how they found it?|
|(f) Design of the system||- what are user's view of the current design and what changes would they like made?|
Four organizations were selected and then visited by a single researcher (Joseph).
On day one, the researcher reviewed the internal information management documentation, observed a demonstration of the electronic document and record management system to determine how the organization's records management program was implemented.
For three days the researcher conducted hour-long interview sessions with each of the ten users of the system in the organization. First, each user was asked to complete a short questionnaire that provided background information about them. Second, face-to-face semi-structured interviews were conducted with each participant. Finally, each participant described and demonstrated how they conducted their last simple and difficult searches. Generally these searches were conducted at their own work area. A protocol analysis was employed to capture the different search behaviour that users recounted and demonstrated during this third phase.
The interviews and protocol analyses were transcribed. Table 4 summarises the steps described to measure the search behaviour of each individual user.
|Measurement of:||Research method used:||Information sources:||Resources developed:|
|Individual search behaviour of 40 users||Semi-structured interview questions for users||Transcribed semi-structured interview data from users||40 self-reported flowcharts|
|Protocol analysis||Transcribed protocol analysis data from users||38 observed flowchart for simple search|
|29 observed flowchart for difficult search|
First, for each of the forty users a flowchart of their description of their preferred individual search behaviour was plotted, using a similar approach to that employed by (Krikelas 1983; Kuhlthau 1988, 1993, 1999, 2005; Leckie et al. 1996; Wilson 2005). These flowcharts are referred to as their self-reported flowcharts whilst the subsequent flowcharts are referred to as observed flowcharts. The flowcharts documented the different stages of search that were executed and mapped the decision paths taken by each individual.
Secondly, for thirty-eight of the forty users, their observed search behaviour when they conducted their last simple search during the protocol analysis was plotted. Two participants reported on difficult searches rather than simple.
Twenty-nine users were able to identify difficult searches during the interviews. Eleven users stated that they had not or did not encounter difficult searches, or that it had been so long since their last difficult search that they were unable to recall the details.
In total, one hundred and seven flowcharts were drawn and analysed to determine the individual search behaviour for each user including their observed simple and difficult search behaviour in most instances.
Once the flowcharts for each user were plotted, an analysis of the flowcharts revealed that the self-reported searches offered a comprehensive view of each participant's individual search behaviour: identifying all the different approaches that they engaged in when seeking information using the system. The search behaviour observed in specific searches matched the preferred styles, but using a sub-set of the options. For example, the user's self-reported flowchart might indicate that the user was able to conduct a search using three different search formulation strategies, but if the task they had to perform in their last simple search was to find a document with a known record number, then the user might formulate the search using just the record number. For reasons like this, the self-reported search flows were taken as the most definitive source when capturing the final aggregated search behaviour of users. They were found to capture the widest repertoire used by each participant, while the observed searches reflected more targeted approaches to address a specific search problem. Therefore, it was decided to aggregate the forty individuals' reported preferred search processes and search activities and then overlaying the responses for each to answer the research question: what is the search behaviour of electronic document and record management systems users?
Aggregation of forty self-reported flowcharts
The self-reported flowcharts for all ten users in the same organization were mapped onto a single flowchart. The aggregated measure was determined by visual scanning to compare each of the ten self-reported flowcharts, observing similarities and differences in search behaviour from one flowchart to the other. These steps were repeated to aggregate the search behaviour model for each of the four organizations. The final aggregated measurement of the search behaviour flowchart drawn for each organization incorporated all the search processes and activities, collated by visual observation of all ten self-reported flowcharts.
Comparison of the search behaviour model of electronic document and record management system users with Ellis and Meho and Tibbo's models
We report on the search behaviour identified from that research and compare the results with earlier information search models. In reviewing the results, it is important to recognise that each search model has been drawn from quite different user populations, information sources and search foci. However, all user groups were focused on addressing their information need, information gap and/or anomalous state of knowledge (Belkin 1980; Dervin 1992; Wilson 2005). The reasons for seeking information were also different. Library users were seeking information to address a research question, with the goal of obtaining a set of responses to be evaluated, whilst users of electronic document and record management systems required a discrete information source either to complete a work task or to fulfil an information need. We were interested then, to see if the information search models remained comparable, despite the different context in which the searches were operating.
A number of similarities and differences between the search behaviour of library and that of electronic document and record management system users were uncovered. The search processes employed by the participants demonstrated a number of features common to those identified by the previous investigators. Ellis (1989) describes starting search behaviour as 'activities characteristic of the initial search for information' by a researcher commencing work on a new topic or area (p. 178). Examples of these starting characteristics include making use of informal contacts for guidance to starter references in the new area, relying on previous experience using library resource materials to make a start in researching a new topic and referring to citations in papers in the new area as well as following up citations to further their research (Ellis, 1989: 179).
Ellis's (1989) concept of starting was evidenced during the interviews and protocol analysis sessions with the users, who reported searching the system for the following reasons:
- they had a task to do and required information from the system to complete the task;
- they needed to action a task by responding to action items through the system's workflow; or
- they required information in the system as reference material or to recollect what was communicated or performed previously on the same or similar subject.
Unlike the library researchers studied by Ellis (1989), users were more aware of how to start their search in the electronic document and record management system. This was primarily because the information required was not in a new topic area but part of their work task roles. Often users already had role-related background information relating to the information being sought. Therefore, when starting a search, users had a better idea of what they were searching for. They were aware of at least one piece of metadata associated with the search or knew where the information was filed. This clarity in their task assessment enabled users to decide on a search strategy based on whether they had authored or filed the item, or knew where the item was stored in the system; and then to navigate to the folder or item. Unlike Ellis's (1989) subjects, users could also consider whether they had conducted the search previously or had saved the search into their favourite shortcuts.
Like Ellis's (1989) participants, who sought assistance from their contacts for starter references for their search, users used their colleagues as contacts to find out background information that would assist in searching for information. These colleagues often were contacted because they had either created or registered the required information into the system. All the organizations studied had nominated Record Focal Points who are trained power users of the system and of the records management program implemented in the organization. Usually secretaries, administrative staff or personal assistants are assigned the additional responsibility of being the Record Focal Point for their department. They were provided with extra training and could be contacted for assistance if required. If assistance from colleagues was not solicited during the starting stage, it was usually assessed at the end stage if users failed to find the information they were seeking.
Ellis describes browsing as 'semi-directed searching in an area of potential interest' (Ellis 1989: 178) such as recently published journals or current contents, contents pages of journals, library shelves, or indexing and abstracting sources. Users exhibited a more contained form of browsing at three stages their search behaviour processes Figure 1:
- Stage 2: Formulate search strategy;
- Stage 4: Process and evaluate search results; and
- Stage 6: Decision making about search results.
At the formulate search strategy stage, users who decided to search by navigating the tree view displayed browsing characteristics as they worked their way through the folder structure. At the process and evaluate search results stage, users browsed the metadata fields presented in the search results displayed. They scanned the contents of the documents to verify the required information at the decision making about search results stage.
Ellis (1989) discusses closure but does not identify it as a search behaviour characteristic, describing it as 'the feeling of confidence that a search had been sufficiently thorough' (Ellis 1989: 185). The marking of sufficient citations signalled the closure of search activities by the library users. Ellis notes that 'sometimes a point was reached where the citations seem to be becoming more and more peripheral to the main subject of interest, or start dealing with increasingly minute aspects of it (Ellis 1989: 184). It was at this point, according to Ellis, that library users decided to close their search, as chasing up citations yielded the same references again or the new information added little value to that already gathered. Thus, in Ellis's research, the user closed their search once sufficient responses were obtained.
The closure search behaviour observed in the current research is described as part of ending the search in Stage 7 of the search model. For these users, the search goal was to achieve a single accurate record capture that matched the task requirements. The users reached a point in their searching that led them to either close their search if they found what they were seeking or to stop the search if they felt they had exhausted all possible search strategies and still could not find the information. Stopping the search for the users did not mean that they ended the search: it marked the stopping of their initial search efforts. Users then sought assistance from their colleagues or the Records Section and either returned to retry the search in the system or found the information elsewhere.
Grouping of search activities into four inter-related stages
Unlike Ellis (1989), Meho and Tibbo (2003) group the different search activities they identify in their research into four inter-related stages (Table 1, column 2). The search activities in bold font (Column 2) represent new characteristics observed by Meho and Tibbo (2003) that were not noted by Ellis (1989).
Meho and Tibbo (2003) report that users followed the four hierarchical search stages. However, there was no hierarchical sequence to the search activities performed by social scientists within each stage.
In this study, as in Meho and Tibbo's (2003) we found that the search behaviour of users was grouped and they consistently followed the seven hierarchical search process stages (Table 1, column 4). Further as in Meho and Tibbo's (2003) model, we observed users engaged in all or a few of the search activities within each search process stage depending on their search task requirements and search results presented by the system.
Hierarchical sequential stages
Unlike Ellis's (1989) and Meho and Tibbo's (2003) research, the search behaviour model reflects a hierarchical sequence of search activities. Within each search process stage, electronic document and record management system users were observed to engage in various search behaviour. Given users were seeking information from a computer system, there was a definite sequence of stages they engaged in until they ended their search in the system. However, the stages users went through were iterative. For instance, from Stage 4 users could return to Stage 2 to refine their search by reformulating their search strategy; or decide to retry their search in Stage 7 and return to Stage 2 to reformulate their search strategy. However, it was not possible to repeat specific search activities once the user moved to the next search process stage. As an example, scanning and verifying content could not be done in Stage 2: Formulating search strategy but only in Stage 6: Decision making about search results. Users were still able to repeat some search behaviour within Stages 4, 6 and 7 in the search model. For instance, in Stage 4 users could switch between search behaviour such as browse search results and sort results, return to browse results, then decide to filter results and browse again.
Ellis describes monitoring as "maintaining awareness of developments in a field through the monitoring of particular sources" (Ellis 1989: 178). Users neither reported nor were observed in any monitoring activity within the system. Instead, they relied on information sources such as their peers or line managers to keep them informed, via emails links to documentation in the system or via meetings, workshops or seminars. In two of the organizations incoming correspondences were transferred to the relevant knowledge workers, providing them with a mechanism to monitor and maintain awareness of what was happening in their line of work. This process partially reflects Ellis's description of monitoring. However, no user actually employed the option.
Some electronic document and record management systems provide automatic monitoring functionality options where users can set automatic notifications concerning information they intend to monitor to be sent to them at both folder and document/record level. However, the versions of the systems studied in this research did not provide automatic notification. Only one of the forty users reported a benefit from the automatic email notification as a way of monitoring new information on work tasks of relevance that sometimes are not readily shared by fellow colleagues. The remaining thirty-nine users stated that they preferred not to clutter their already unmanageable email systems with such notifications. They indicated their current monitoring practices were sufficient.
Chaining was a search behaviour characteristic that users did not exhibit in their self-reported and observed flowcharts, unlike library users in Ellis's (1989) and Meho and Tibbo's (2003) studies. Ellis describes chaining as 'following chains of citations or other forms of referential connection between material' (Ellis 1989: 178). He notes that chaining takes two separate forms; backward chaining and forward chaining. Backward chaining is 'following up references or sources cited in material consulted' (Ellis 1989: 183), and forward chaining is 'identifying citations to material consulted or known' (Ellis, 1989: 183).
There are two possible ways in which chaining can be performed when using an electronic document and record management system. First, backward chaining is possible using the systems related to linking functionality, which enables related information to be grouped together and linked. However, user effort is required to establish links, in order to realise the benefit in backward chaining later.
While users were aware that the electronic document and record management systems had chaining they did not employ this option when searching and it was therefore omitted from the search model. However, it is a possible search behaviour of users and may be observed in future research if users are trained in working with this option.
Differentiating and extracting
Ellis describes differentiating as 'using differences between sources as filters on the nature and quality of the material examined' (Ellis 1989: 178) and extracting as 'systematically working through a particular source to locate material of interest' (Ellis 1989: 178). Both these search behaviour characteristics are relevant to library users who are searching for multiple sources and tend to check a variety of indexing and abstracting tools and other information sources (Ellis 1989; Meho and Tibbo 2003). As these users were interrogating a single information source it is not surprising that they did not exhibit these two characteristics.
Comparison of the search behaviour model of electronic document and record management systems with Marchionini's information seeking processes model
Marchionini's (1995) information seeking processes model contains hierarchical sequential stages similar to the search model developed for this study. However, our search model has seven stages, whilst Marchionini's (1995) model consists of only three: 1) understand; 2) plan and execute; and 3) evaluate and use. A comparison of the similarities and differences between Marchionini's (1995) three information seeking processes and sub-processes and those of the search model (Table 1, Column 3) is presented next.
The first row in (Table 1, Column 3) shows that Marchionini's (1995) information seeking sub-processes occur in the first stage of his model, which is understand. These sub-processes of recognising the problem or need, accepting the problem and defining the problem are similar to what users were observed performing when they conducted their task assessment activity in Stage 1: Start. In the task assessment activity users tried to understand, accept and define their work task, search task and subsequent task knowledge.
Plan and execution
The second row in (Table 1, Column 3) presents the information seeking sub-processes that Marchionini (1995) states occur in the second stage of his model, plan and execute.
In Stage 2, Marchionini (1995) describes his users selecting a search system, formulating their search query, determining their entry point, executing their search and finally examining the search results. Given that the electronic document and record management system is the default system under study in this research, the select search system in Marchionini's second stage is not observed in this model (Marchionini 1995: 59). On the other hand, Marchionini's formulate query and determine entry point is present in Stage 2: Formulate search strategy of the search model. Similarly, Marchionini's execute sub-process appears in Stage 3: execute of the model, and Marchionini's examine sub-process has its parallel in Stage 4: Process and evaluate search results.
Evaluation and use
The last row in (Table 1, Column 3) presents the information seeking sub-processes that Marchionini (1995) states occur in his third stage, evaluation and use. This time Marchionini's examine sub-process has its counterpart in several stages of the search model:
- Stage 4: Process and evaluate search results;
- Stage 5: Access search results; and
- Stage 6: Decision making about search results.
A variety of search activities such as browsing, filtering, sorting and launching information that were performed under each of the above stages in the search model, are not addressed in Marchionini's model.
The extract and stop information seeking sub-process in Marchionini's (1995) model is similar to the extract documents and stop search activities observed in Stage 7: End search in the search model. Likewise, the reflect sub-process in the information seeking process model is similar to the launch documents and scan and verify contents in Stages 5 and 6 respectively of the search model. It is unclear what Marchionini (1995) refers to in the iterate sub-process in his model. We assume he refers to the iteration of all three sub-processes in the evaluation and use stage. This being the case, it would be similar to users iterating their search activities in each of the seven stages or deciding to reiterate from Stage 4 to 2 or from Stage 7 to 2.
Despite these variations, there are more similarities than differences between the search model developed in this research and Marchionini's model. However, the search behaviour model provides a more specific and detailed description of search processes, behaviour and activities of electronic document and record management system users than is possible using Marchionini's (1995) information seeking processes of electronic system users model.
Summary of findings
The research findings supported the hypothesised search behaviour model (Table1, Column 4), thereby indicating the appropriateness of building on information search models by Ellis (1989), Marchionini (1995), and Meho and Tibbo (2003), (Table 1, Columns 1 to 4).
In summary, the model reflected many of the principles previously documented, while offering a more complete encapsulation of users' search behaviour. The research identified similar search activities to those described in the previous models. However, search process stages are only present in Meho and Tibbo's (2003), Marchionini's (1995) and our search behaviour model, with all three models identifying search process stages that are hierarchical and iterative.
Differences were identified as a result of the purpose and levels of control over the relevant information systems and their resultant influence on users' search behaviour. The earlier models reflected a context where trained librarians registered content into library catalogues and online databases. In the case of the electronic document and record management system, content could have been registered by information professionals or by employees. The accuracy of the content at data entry levels impacts on the quality of the search outcome. This in turn influences the search processes and activities employed by users and their related information search behaviour. In the case of the search behaviour model, it was apparent that the quality and accuracy of content registered into the system was a factor that impacted on whether users' search experience was classed as simple or difficult. Users reported that their search was difficult when content registered was: misspelt; titled differently from how user would title or search by; and is filed into electronic files where user would not consider searching from (Joseph, 2009, 2010a). This complexity was not evident in the earlier models, where controls were more carefully managed by expert professionals.
Both Meho and Tibbo's (2003) and Marchionini's (1995) models portray generic search behaviour in electronic systems. In contrast, the search behaviour model is specific to search behaviour in electronic document and record management systems only. However, its depth of exploration and more detailed analysis of information search may provide a model for future explorations of other applied search contexts.
Of particular note in this study was the finding that all users engaged in the same seven stages of a linear search process, although with some variation in the search activities at each stage. Their training in the different search methods and their preferred search styles influenced these variations. It was notable that there were two conclusions to the search: closure, when the search was successfully completed, or stopage, when a new strategy was required. Unlike the earlier models, this research highlighted the reliance on consulting people or other information sources to resolve the problem. Perhaps this reflects the increasing socialisation of knowledge systems in workplaces.
The findings highlight the social interactions occurring in real work contexts including the way in which knowledge communities operate. Users' decisions on whom or what information sources they should next pursue indicated their transient knowledge about their knowledge co-workers and knowledge community. They were observed to be subtly aware of: who knows what; who is involved with current projects; what past projects they worked on; the information sources their peers worked with and were generally aware of their colleague's subject expertise. Equipped with this knowledge they were observed returning to the system to retry their search formulation strategy if they obtained updated or new task knowledge that might assist in retrieving the sought information.
In summary, the models by Ellis (1989), Meho and Tibbo (2003) and Marchionini (1995) were reliable scaffolds to develop the search behaviour model of electronic document and record management system users, despite the different context in which the searches were operating. Apart from the comparison of the similarities and difference discussed in this paper, there are two important differences between the search behaviour model and Ellis's (1989), Meho and Tibbo's (2003) and Marchionini's (1995) models. Firstly, their models do not present what users do after they stopped their search, see Stage 7: End Search (Figure 1.1). Secondly, their models do not differentiate between simple and difficult search behaviour.
In contrast, the search behaviour model addresses these differences, offering a more complex picture of the ways in which users interrogate systems in a more fully integrated work setting. The research offers useful guidance for information professionals seeking to monitor their users' search behaviour and implement strategies to simplify their difficult search encounters (see, for example, Joseph (2009: 39; 2010a). The model in Figure 1 illustrates the common search processes and activities users engaged in when seeking corporate information from the system. However, at several of the search process stages different users choose different search activities. For example, some users opted to search using the metadata fields, some using their favourite shortcuts, whilst others searched by navigating the tree-view folder structure. Our research shows that these different search behaviour are owing to three factors: 1) the design of the electronic document and record management system 2) the specific search task being addressed by the user and the 3) training the user received. These findings suggest that organizations use the search behaviour model as a template to find out about their individual users' search behaviour. We recommend borrowing the vocabulary used in describing search processes and activities in our model for this exercise. Alternatively, users could be encouraged to diagnose their own search behaviour using the template. Information professionals could subsequently use the search behaviour assessment to improve training needs and perhaps review the system design of their electronic document and record management systems.
The findings recommend that training on electronic document and record management systems be targeted on meaningful document titling and on consistent entry of important metadata fields when registering new documents. Further, the findings suggest that information professionals provide training to all and especially new employees on working with and searching in the system. Short but targeted system refresher training for users is recommended to improve their search skills.
Apart from the training implications there are also two system design value implications learnt from the search behaviour model. The first is to design systems that contain metadata fields with controlled pick-lists to ensure the capture of accurate and consistent metadata in such applications. Such design measures would enable consistent and accurate metadata capture when registering new information and importantly when searching the system. Next, the findings recommend the system be designed to offer users with both a virtual database view and a tree-view folder structure view, to meet the search styles of different users and to offer them flexible search options to better suit their search needs. This would enable users who are familiar with where their information is classified to navigate to the information quickly.
Overall the search behaviour model assists in an increased understanding of the detailed search processes and activities knowledge workers undertake when searching in context-specific applied systems in organizations. It offers considerable insight into the decision processes that are undertaken and highlights the challenges that information professionals may encounter as they shift to a user-managed information setting. The overall findings of this study indicate that there is considerable value in working toward a common scaffold that maps information searching across different information systems. This, in turn, offers an important basis to better identify the stages at which user support can best be offered. The research also highlights the value of user education in building a stronger understanding of different search approaches and their application.
About the authors
Pauline Joseph (PhD) is a Lecturer in Records and Archives Management at the Department of Information Studies at Curtin
University. Pauline Joseph is the corresponding author and can be contacted at: email@example.com. Note that Pauline's surname was previously Singh.
Shelda Debowski (PhD) is Deputy Vice Chancellor, The University of Notre Dame and previously, Winthrop Professor of Higher Education Development at the University of Western Australia.
Peter Goldschmidt (PhD) is a Professor in Information Management at the University of Western Australia Business School.