Story image

Enterprise search doesn't diminish the need for repository governance

06 Apr 17

With enough sophistication in analysis, there is an argument to be made that search relevance can be inferred even from very "messy" sets of data.

In enterprise search, this is increasingly the case, with natural language processing and machine-learning technologies helping to mine vast amounts of data from diverse repositories across the enterprise.

However, this does not mean that the search ecosystem does not benefit immensely from underlying governance practices that aim to enforce policies and lifecycles for data within their native repositories.

By doing so, the underlying corpus of data is kept cleansed and prepped for search, which greatly improves performance and outcomes.

Search performance is proportional to data quality

No search tool is powerful enough to entirely filter out dirty or irrelevant data, particularly when that data is unstructured in format and imbued with human-generated meaning.

The axiom repeated ad nauseam in statistics bears true: garbage in, garbage out. With most search tools on the market providing an ecosystem of connectors to access data in their native repositories (as opposed to aggregating and storing the data themselves), the onus is on the enterprise to ensure that the native repositories are operating as sources of truth rather than sources of confounding data.

For federated search tools to achieve optimal performance, attention must still be paid to the governance of the individual content repositories that the search tools are connected to. Therefore, the data that is being searched and accessed is relatively clean and managed to begin with.

Failure to manage these repositories in their natural state diminishes the true potential of enterprise-wide search and discovery by inflating the volume of irrelevant/outdated data, increasing required processing power and cost, and increasing the risk of unauthorised data access.

Adoption of an enterprise search and analysis tool ideally begins with an audit of the systems to be connected; ensuring that governance policies, lifecycles, and controls are already established and being properly executed within each repository.

Another challenge with enterprise search is maintaining security and access control settings that may exist within native content repositories.

This is because search results need to reflect individual user roles and permissions without displaying inappropriate or sensitive results to unqualified users. Housekeeping is required first.

When implementing a search and discovery tool, attention must be paid to the native security and permission settings within the individual data platforms and repositories that the search tool is connected to.

Although, while some search tools have excellent functionality for inheriting and maintaining these controls, the controls themselves must be in place first to pass on. Due diligence is required in this regard because many organizations still struggle to coordinate governance policies across respective systems.

Video conferencing in dire need of simplification, study shows
A Forrester study shows that 84% of companies are using two or more cloud-based video conferencing apps.
Three ways to achieve data security whilst enabling BYOD
"A mobility strategy is now more important than ever before, that said, selecting the right one is often no small task."
Mobile Infrastructure market sees fastest growth since 2014
The report from Dell’Oro shows that while the vendor rankings for the top three vendors remained unchanged with Huawei, Ericsson, and Nokia leading.
HPE unveils AI-driven operations for ProLiant, Synergy and Apollo servers
With global learning and predictive analytics capabilities based on real-world operational data, HPE InfoSight supposedly drives down operating costs.
How IoT and hybrid cloud will change in 2019
"Traditional VPN software solutions are obsolete for the new IT reality of hybrid and multi-cloud."
Enterprises to begin closing their data centres
Dan Hushon predicts next year companies will begin bidding farewell (if they haven't already) to their onsite data centres.
Citrix acquires micro app platform Sapho
Sapho’s micro applications improve employee productivity by consolidating access to tools, activities and tasks in a simple and unified work feed.
HPE expands AI-driven operations
HPE InfoSight extends select predictive analytics and recommendation capabilities to HPE servers, enabling smarter, self-monitoring infrastructure.