FEATURE: MINE Magazine – Discovering new mining deposits with big data

Following in the footsteps of tech firms like Amazon and Google, US mineralogists and scientists are experimenting with machine learning and big data to discover new, potentially lucrative mineral deposits. Heidi Vella speaks to the experts from the Deep Carbon Observatory to ask how their research might one day transform mineral exploration.

Understanding more about mineral formations and physical properties has high value for academics, miners, material scientists and engineers alike. And while mineralogists and geologists have been studying the Earth’s crust for centuries, its sheer size and complexity means there is still much more to discover.

“Much of what we learn through material and atomic scale behaviour is through minerals, so it is important to know and characterise what is out there,” says Dr Shaunna Morrison, a mineralogist and planetary scientist at Carnegie Institution’s Geophysical Laboratory.

“When we have a good representation of the actual mineral inventory it can tell us a lot about the geologic past of our planet.”

To learn more and to learn quicker, experts have been experimenting with applying advanced data science techniques to mineralogy.

Last year, Morrison and her colleagues from the Deep Carbon Observatory – a community of over 1,000 scientists – published a paper that showed how network theory, which has previously been used to analyse the spread of disease, terrorist networks, and Facebook connections, can reveal mineral diversity and distribution worldwide.

The technique uses mathematical theory for organising, connecting and understanding data and can even predict minerals as yet unknown to science. But it’s the potential for finding new deposits that really shines.

Mineral discovery using data science

According to Morrison, the data science method that could be of the most interest to miners is the ongoing work using recommender systems – a mathematical data science algorithm for infinity analysis or ‘market basket’ analysis –  to understand co-concurrent relationships.

This data science technique is routinely used by firms like Amazon to understand items frequently purchased together, information which is then used to suggest additional purchases for customers.

“Just like Amazon-purchasing co-occurs, so do minerals co-occur on Earth’s surface in a very systematic way, so this is something we can easily characterise,” she explains.

The US Geological Survey (USGS), Mindat.org and the Deep Carbon Observatory, among others, have large databases that include mineral evolution data, geospatial information, mineral occurrence frequency, age and more, which scientists can use to generate models of mineral co-occurrence around the world.

From this information, mineralogists can then determine which other locations have the highest probability of having a previously unknown deposit of a mineral or mineral assemblage.

“For example, if we know that a certain combination of mineral species signals that there is likely a deposit of interest, we can determine where that deposit is most likely to exist,” Morrison explains.

“We can even determine the probability, for example, that there is a 2% chance you will find it there and a 96% chance you will find it somewhere else: I think that is really interesting for mining companies and several have already shown interest.”

Data sharing encourages smarter working

This work essentially extends on what is already being done by geologists and mineralogists, but on a much larger scale.

“In reality, we are not just looking at two parameters, but tens of thousands that affect a system – this is something that is really hard for any human brain to see in a spreadsheet or even using standard statistical methods,” says Morrison.

However, for advanced machine learning, using multi-varying techniques, a system can be characterised in an extremely multi-varied and multi-dimensional way.

“No one person can understand all the systems and go through all the data in a blink of an eye,” she adds.

By using these techniques, it is eventually hoped that mining companies will find commercially viable deposits much quicker and easier, and help them to know where they definitely shouldn’t be looking.

While algorithms can determine the probability of where certain deposits might be, they can’t yet determine how big they are.

This is something Morrison says they want to do in the future, but they will require more data.

“The issue is a lack of data on the amount of materials on Earth’s surface right now,” she says. “Mining companies have a lot of information but it is often held privately.”

Data sharing is often considered off-limits within mining firms for competition reasons. Yet Morrison believes if all data was shared it could ‘elevate everyone’ to work smarter.

“Mining companies are still going to have their competitive edge, whether that is buying-power or efficiency in operations or their location,” she adds. “Data is power and if we really want to understand where to explore and where not to we need to quantify everything.”

Read the full feature here