ITx Rutherford 2019 Speakers

Keynotes and Speakers for ITx Rutherford

Check back often - more speakers are being added regularly

Noor Alani

Eastern Institute of Technology

Noor is an academic staff member at the School of Computing – EIT. He obtained Ph.D. from AUT in 2019 specialised in computer vision models to improve traffic and pedestrian safety. His research interests include machine vision, emerging technologies, and data analytics. In 2016 he had a visiting position at Wuhan University, China. He received numerous awards including the Akria Nakamura awards for 2017, and 2018 from AUT, the “Best Student Paper Award” at CAIP 2017 – Springer, and best presentation award from ICCAR 2018 - IEEE.

Improved Feature Selection and Ensemble Learning For Cervical Cancer Assessment

Wednesday 12:00pm - 12:30pm, CITRENZ (

Choosing the right influencing feature is a challenging field in data science due to the presence and complexity of multi-dimensional data. Cervical cancer is an excellent example for such study, as well as impacting individuals and families, presents almost no-symptoms at the early stages of development of this condition. Because multi-factors may be involved, this demands a lot of research and analysis to identify causative or linked features. The researchers have applied and optimised an ensemble learning algorithm as it is the best model for multi-modal medical data when relatively high dimensionality is present. The main objective of this study was to minimize the dependency on data pre-processing techniques, whilst analysing the data (filling/ignoring missing values with the statistical method). Main factors were studied and validated using Root Mean Square Error (RSME) and Mean Absolute Error (MAE).
The classification accuracy for features were obtained by 10-fold cross-validation and test (where 66% is training data and 34% test data). The data was obtained from the UCI machine learning repository. WEKA and MATLAB were used to identify features. SPSS and SAS were used for RMSE and MAE. This approach is generic, and may also be applied to any relevant dataset for other purposes, and for teaching data analytics.
Keywords: Feature selection, ensemble learning, data mining, machine learning, models, HPV, WEKA, MATLAB.