cse3bgx big data management on the cloud

BIG DATA MANAGEMENT ON THE CLOUD

CSE3BGX

2019

Credit points: 15

Subject outline

Companies are acquiring massive amounts of data and also providing internet based service to millions of people. This is extremely challenging due to the large scale of data involved and the huge number of concurrent requests by users. In this subject we will study the current state-of-the-art technologies for analysing huge amounts of data and responding to millions of user requests within a second. Currently the most cost efficient way of achieving the above aim is to use large-scale cloud-based services offered by vendors such as Amazon, Google, IBM, Microsoft, etc. We will study how to use the cloud services provided by these vendors to meet the big data needs of businesses. In particular this subject will include the following topics: cloud architectures, parallel database systems, map and reduce, key value stores,transaction support in the cloud, virtualization, and multi-tenant database systems.

SchoolSchool Engineering&Mathematical Sciences

Credit points15

Subject Co-ordinatorRabei Alhadad

Available to Study Abroad StudentsNo

Subject year levelYear Level 3 - UG

Exchange StudentsNo

Subject particulars

Subject rules

Prerequisites Must be admitted in SBAIO or SBACTO and have passed CSE2DBX or CSE2DCX or CSE1OFX and CSE1IOX

Co-requisitesN/A

Incompatible subjectsN/A

Equivalent subjectsN/A

Special conditionsN/A

Learning resources

Readings

Resource TypeTitleResource RequirementAuthor and YearPublisher
ReadingsHadoop The Definitive GuideRecommendedWhite T., 2015O'Reilly Media
ReadingsHadoop in ActionRecommendedLam C., Davis M., Gaddam A., 2016Manning
ReadingsBig Data Management on the CloudRecommendedDidasko Digital, 2018Didasko

Graduate capabilities & intended learning outcomes

01. Compare and contrast the benefits of using cloud computing over traditional methods of managing big data for clients.

Activities:
Webinar presentation and open discussions on cloud computing. Self- checking open-ended questions to support the activities.

02. Critically evaluate the best type of cloud-based service to use for a particular application scenario.

Activities:
Webinars with open discussion forums on the evaluation of the three types of cloud-based services. Knowledge assessed via online quizzes and self-paced online problems.

03. Design and develop efficient frameworks such as mapReduce to analyse large data sets.

Activities:
Webinars and practical activity on mapReduce programs.

04. Implement cloud-hosted database systems on a cloud computing platform.

Activities:
Webinars and practical activity on setting up cloud-based database systems.

05. Develop efficient programs that query cloud-hosted database systems.

Activities:
Webinars and practical activity on querying cloud-hosted database systems

Subject options

Select to view your study options…

Start date between: and    Key dates

Online (Didasko), 2019, Study Block 1, Online

Overview

Online enrolmentYes

Maximum enrolment sizeN/A

Enrolment information

Subject Instance Co-ordinatorZhen He

Class requirements

Unscheduled Online ClassWeek: 02 - 13
One 3.0 hours unscheduled online class per week on any day including weekend during the day from week 02 to week 13 and delivered via online.

Assessments

Assessment elementComments%ILO*
Online test (30 minutes) (equivalent to 500 words)Multiple-choice and/or short answer questions on cloud computing. Test to be conducted in Week 5.1501, 02
Practical scenario-based report on processing of big data sets (equivalent to 1500 words)A practical scenario-based report on processing of big data sets using MapReduce programming model.3003
Practical scenario-based report on cloud-hosted database systems (equivalent to 1500 words)A practical scenario-based report on setting up and querying cloud-based database systems3004, 05
Online subject test (60 minutes) (equivalent to 1000 words)Multiple-choice and/or short answer questions test that covers the theoretical knowledge2503, 04, 05

Online (Didasko), 2019, Study Block 2, Online

Overview

Online enrolmentYes

Maximum enrolment sizeN/A

Enrolment information

Subject Instance Co-ordinatorZhen He

Class requirements

Unscheduled Online ClassWeek: 06 - 17
One 3.0 hours unscheduled online class per week on any day including weekend during the day from week 06 to week 17 and delivered via online.

Assessments

Assessment elementComments%ILO*
Online test (30 minutes) (equivalent to 500 words)Multiple-choice and/or short answer questions on cloud computing. Test to be conducted in Week 5.1501, 02
Practical scenario-based report on processing of big data sets (equivalent to 1500 words)A practical scenario-based report on processing of big data sets using MapReduce programming model.3003
Practical scenario-based report on cloud-hosted database systems (equivalent to 1500 words)A practical scenario-based report on setting up and querying cloud-based database systems3004, 05
Online subject test (60 minutes) (equivalent to 1000 words)Multiple-choice and/or short answer questions test that covers the theoretical knowledge2503, 04, 05

Online (Didasko), 2019, Study Block 3, Online

Overview

Online enrolmentYes

Maximum enrolment sizeN/A

Enrolment information

Subject Instance Co-ordinatorZhen He

Class requirements

Unscheduled Online ClassWeek: 10 - 21
One 3.0 hours unscheduled online class per week on any day including weekend during the day from week 10 to week 21 and delivered via online.

Assessments

Assessment elementComments%ILO*
Online test (30 minutes) (equivalent to 500 words)Multiple-choice and/or short answer questions on cloud computing. Test to be conducted in Week 5.1501, 02
Practical scenario-based report on processing of big data sets (equivalent to 1500 words)A practical scenario-based report on processing of big data sets using MapReduce programming model.3003
Practical scenario-based report on cloud-hosted database systems (equivalent to 1500 words)A practical scenario-based report on setting up and querying cloud-based database systems3004, 05
Online subject test (60 minutes) (equivalent to 1000 words)Multiple-choice and/or short answer questions test that covers the theoretical knowledge2503, 04, 05

Online (Didasko), 2019, Study Block 4, Online

Overview

Online enrolmentYes

Maximum enrolment sizeN/A

Enrolment information

Subject Instance Co-ordinatorRabei Alhadad

Class requirements

Unscheduled Online ClassWeek: 14 - 25
One 3.0 hours unscheduled online class per week on any day including weekend during the day from week 14 to week 25 and delivered via online.

Assessments

Assessment elementComments%ILO*
Online test (30 minutes) (equivalent to 500 words)Multiple-choice and/or short answer questions on cloud computing. Test to be conducted in Week 5.1501, 02
Practical scenario-based report on processing of big data sets (equivalent to 1500 words)A practical scenario-based report on processing of big data sets using MapReduce programming model.3003
Practical scenario-based report on cloud-hosted database systems (equivalent to 1500 words)A practical scenario-based report on setting up and querying cloud-based database systems3004, 05
Online subject test (60 minutes) (equivalent to 1000 words)Multiple-choice and/or short answer questions test that covers the theoretical knowledge2503, 04, 05

Online (Didasko), 2019, Study Block 5, Online

Overview

Online enrolmentYes

Maximum enrolment sizeN/A

Enrolment information

Subject Instance Co-ordinatorRabei Alhadad

Class requirements

Unscheduled Online ClassWeek: 19 - 30
One 3.0 hours unscheduled online class per week on any day including weekend during the day from week 19 to week 30 and delivered via online.

Assessments

Assessment elementComments%ILO*
Online test (30 minutes) (equivalent to 500 words)Multiple-choice and/or short answer questions on cloud computing. Test to be conducted in Week 5.1501, 02
Practical scenario-based report on processing of big data sets (equivalent to 1500 words)A practical scenario-based report on processing of big data sets using MapReduce programming model.3003
Practical scenario-based report on cloud-hosted database systems (equivalent to 1500 words)A practical scenario-based report on setting up and querying cloud-based database systems3004, 05
Online subject test (60 minutes) (equivalent to 1000 words)Multiple-choice and/or short answer questions test that covers the theoretical knowledge2503, 04, 05

Online (Didasko), 2019, Study Block 6, Online

Overview

Online enrolmentYes

Maximum enrolment sizeN/A

Enrolment information

Subject Instance Co-ordinatorRabei Alhadad

Class requirements

Unscheduled Online ClassWeek: 23 - 34
One 3.0 hours unscheduled online class per week on any day including weekend during the day from week 23 to week 34 and delivered via online.

Assessments

Assessment elementComments%ILO*
Online test (30 minutes) (equivalent to 500 words)Multiple-choice and/or short answer questions on cloud computing. Test to be conducted in Week 5.1501, 02
Practical scenario-based report on processing of big data sets (equivalent to 1500 words)A practical scenario-based report on processing of big data sets using MapReduce programming model.3003
Practical scenario-based report on cloud-hosted database systems (equivalent to 1500 words)A practical scenario-based report on setting up and querying cloud-based database systems3004, 05
Online subject test (60 minutes) (equivalent to 1000 words)Multiple-choice and/or short answer questions test that covers the theoretical knowledge2503, 04, 05

Online (Didasko), 2019, Study Block 7, Online

Overview

Online enrolmentYes

Maximum enrolment sizeN/A

Enrolment information

Subject Instance Co-ordinatorRabei Alhadad

Class requirements

Unscheduled Online ClassWeek: 27 - 38
One 3.0 hours unscheduled online class per week on any day including weekend during the day from week 27 to week 38 and delivered via online.

Assessments

Assessment elementComments%ILO*
Online test (30 minutes) (equivalent to 500 words)Multiple-choice and/or short answer questions on cloud computing. Test to be conducted in Week 5.1501, 02
Practical scenario-based report on processing of big data sets (equivalent to 1500 words)A practical scenario-based report on processing of big data sets using MapReduce programming model.3003
Practical scenario-based report on cloud-hosted database systems (equivalent to 1500 words)A practical scenario-based report on setting up and querying cloud-based database systems3004, 05
Online subject test (60 minutes) (equivalent to 1000 words)Multiple-choice and/or short answer questions test that covers the theoretical knowledge2503, 04, 05

Online (Didasko), 2019, Study Block 8, Online

Overview

Online enrolmentYes

Maximum enrolment sizeN/A

Enrolment information

Subject Instance Co-ordinatorRabei Alhadad

Class requirements

Unscheduled Online ClassWeek: 32 - 43
One 3.0 hours unscheduled online class per week on any day including weekend during the day from week 32 to week 43 and delivered via online.

Assessments

Assessment elementComments%ILO*
Online test (30 minutes) (equivalent to 500 words)Multiple-choice and/or short answer questions on cloud computing. Test to be conducted in Week 5.1501, 02
Practical scenario-based report on processing of big data sets (equivalent to 1500 words)A practical scenario-based report on processing of big data sets using MapReduce programming model.3003
Practical scenario-based report on cloud-hosted database systems (equivalent to 1500 words)A practical scenario-based report on setting up and querying cloud-based database systems3004, 05
Online subject test (60 minutes) (equivalent to 1000 words)Multiple-choice and/or short answer questions test that covers the theoretical knowledge2503, 04, 05

Online (Didasko), 2019, Study Block 9, Online

Overview

Online enrolmentYes

Maximum enrolment sizeN/A

Enrolment information

Subject Instance Co-ordinatorRabei Alhadad

Class requirements

Unscheduled Online ClassWeek: 36 - 47
One 3.0 hours unscheduled online class per week on any day including weekend during the day from week 36 to week 47 and delivered via online.

Assessments

Assessment elementComments%ILO*
Online test (30 minutes) (equivalent to 500 words)Multiple-choice and/or short answer questions on cloud computing. Test to be conducted in Week 5.1501, 02
Practical scenario-based report on processing of big data sets (equivalent to 1500 words)A practical scenario-based report on processing of big data sets using MapReduce programming model.3003
Practical scenario-based report on cloud-hosted database systems (equivalent to 1500 words)A practical scenario-based report on setting up and querying cloud-based database systems3004, 05
Online subject test (60 minutes) (equivalent to 1000 words)Multiple-choice and/or short answer questions test that covers the theoretical knowledge2503, 04, 05

Online (Didasko), 2019, Study Block 10, Online

Overview

Online enrolmentYes

Maximum enrolment sizeN/A

Enrolment information

Subject Instance Co-ordinatorRabei Alhadad

Class requirements

Unscheduled Online ClassWeek: 41 - 52
One 3.0 hours unscheduled online class per week on any day including weekend during the day from week 41 to week 52 and delivered via online.

Assessments

Assessment elementComments%ILO*
Online test (30 minutes) (equivalent to 500 words)Multiple-choice and/or short answer questions on cloud computing. Test to be conducted in Week 5.1501, 02
Practical scenario-based report on processing of big data sets (equivalent to 1500 words)A practical scenario-based report on processing of big data sets using MapReduce programming model.3003
Practical scenario-based report on cloud-hosted database systems (equivalent to 1500 words)A practical scenario-based report on setting up and querying cloud-based database systems3004, 05
Online subject test (60 minutes) (equivalent to 1000 words)Multiple-choice and/or short answer questions test that covers the theoretical knowledge2503, 04, 05

Online (Didasko), 2019, Study Block 11, Online

Overview

Online enrolmentYes

Maximum enrolment sizeN/A

Enrolment information

Subject Instance Co-ordinatorRabei Alhadad

Class requirements

Unscheduled Online ClassWeek: 45
One 3.0 hours unscheduled online class per week on any day including weekend during the day in week 45 and delivered via online.

Assessments

Assessment elementComments%ILO*
Online test (30 minutes) (equivalent to 500 words)Multiple-choice and/or short answer questions on cloud computing. Test to be conducted in Week 5.1501, 02
Practical scenario-based report on processing of big data sets (equivalent to 1500 words)A practical scenario-based report on processing of big data sets using MapReduce programming model.3003
Practical scenario-based report on cloud-hosted database systems (equivalent to 1500 words)A practical scenario-based report on setting up and querying cloud-based database systems3004, 05
Online subject test (60 minutes) (equivalent to 1000 words)Multiple-choice and/or short answer questions test that covers the theoretical knowledge2503, 04, 05

Online (Didasko), 2019, Study Block 12, Online

Overview

Online enrolmentYes

Maximum enrolment sizeN/A

Enrolment information

Subject Instance Co-ordinatorRabei Alhadad

Class requirements

Unscheduled Online ClassWeek: 49
One 3.0 hours unscheduled online class per week on any day including weekend during the day in week 49 and delivered via online.

Assessments

Assessment elementComments%ILO*
Online test (30 minutes) (equivalent to 500 words)Multiple-choice and/or short answer questions on cloud computing. Test to be conducted in Week 5.1501, 02
Practical scenario-based report on processing of big data sets (equivalent to 1500 words)A practical scenario-based report on processing of big data sets using MapReduce programming model.3003
Practical scenario-based report on cloud-hosted database systems (equivalent to 1500 words)A practical scenario-based report on setting up and querying cloud-based database systems3004, 05
Online subject test (60 minutes) (equivalent to 1000 words)Multiple-choice and/or short answer questions test that covers the theoretical knowledge2503, 04, 05