Data analysis and Geostatistics
Data analysis and Geostatistics
2025
Subjects covered in the course
data description:
measures of uncertainty:
missing values:
statistical testing:
regression & correlation:
multivariate techniques:
spatial data analysis:
mean - median - mode, histograms, normality, outliers, modality, box and whiskers plots, stem and leaf diagrams
sources of uncertainty, range, standard deviation, variance, inter-quartile range, error propagation
common problem in geology and generally ignored - real missing values vs. detection limits, and how to deal with missing values
hypotheses, confidence levels, value and rank testing, Z-, t-, Chi-squared, Kolmogorov-Smirnov, Mann-Whitney tests
Scatter diagrams, Pearson & Spearman correlation coefficients, significance of correlation, curve fitting, (non-)linear models
sum of squares methodology, discriminant function analysis, prin-ciple component & factor analysis, cluster analysis
spatial distribution of data, 3D visualization (isolines, bubble plots, trend surfaces), semi-variograms, kriging
All these aspects will be addressed both in the lectures and in practical sessions. However, you should also consult the book as it provides useful background, examples and more in-depth discussion of these subjects.
Copyright: Vincent van Hinsberg & Simon Vriend
Last updated: September 2025
McGill policy statements
Usage of machine learning and AI tools
AI and machine learning tools have the potential to transform (statistical) data analysis, making it faster and more effective, and allowing application of more diverse and advanced methods, thereby gaining a deeper understanding of your data.
However, AI is susceptible to hallucination and it can be hard to understand how it arrived at its answer. Moreover, results may not be robust or meaningful.
AI and machine learning are therefore a valid (and potentially powerful) aid in your data analysis, but you remain responsible for critically evaluating its output. Questions you should ask yourself include: Is the output correct; Is the output meaningful; Are fits and models robust; Do I understand how it arrived at the output/conclusions; Does the analysis meet data and method requirements; Is it the most appropriate method; etc.
Generative AI tools are therefore allowed in this course, including for the assessed Data analysis project, but any results obtained by using such tools must be attributed (e.g. type of tool and prompts used) and output has to be critically assessed (e.g. through method evaluation or test-set verification).
“McGill University values academic integrity. Therefore, all students must understand the meaning and consequences of cheating, plagiarism and other academic offences under the Code of Student Conduct and Disciplinary Procedures (see www.mcgill.ca/students/srr/honest/ for more information)”
“In accord with McGill University’s Charter of Students’ Rights, students in this course have the right to submit in English or in French any written work that is to be graded”
“Work submitted for evaluation as part of this course may be checked with text matching software within myCourses.”
“In the event of extraordinary circumstances beyond the University’s control, the content and/or evaluation scheme in this course is subject to change.”
“Instructor generated course materials (e.g., handouts, notes, summaries, exam questions, etc.) are protected by law and may not be copied or distributed in any form or in any medium without explicit permission of the instructor. Note that infringements of copyright can be subject to follow up by the University under the Code of Student Conduct and Disciplinary Procedures”.
“You are reminded of your responsibility in ensuring that this content and associated material are not reproduced or placed in the public domain. This means that it can be used for your educational purposes, but you cannot allow others to use it by putting it up on the Internet or by giving it or selling it to others who may also copy it and make it available. Please refer to McGill’s Guidelines for Instructors and Students on Remote Teaching and Learning for further information.”