Sunday, 15 December 2024

Applications of Big Data in Business

In class we learned how businesses big and small utilise Big Data in a business environment.

Amazon, one of the biggest retailers in the world rely on Big Data to promote products. This is done by comparing searches, clicks on similar products as well as using geological information to target region-specific requirements. For example, someone who bought a notebook will most likely require stationary. As for the region, It will promote warm winter clothing in areas with cold climates while it would be less likely to promote shorts or tank tops. This is done through algorithms that compare other users buying habits. There are many factors at play, but the basic ones would be age, sex, occupation, and lifestyle. Reviews are also a big part as well. Poorly rated products are less likely to be promoted. Big Data is used to maximise profits and encourage consumers to buy more.

Facebook also uses Big Data. Likes, comments, and cookies provide valuable data that can be extracted. This alone can be used to build a picture of the type of person is behind the keyboard, even if little external information is given. This can be quite intimidating especially if it is used maliciously, however Facebook mostly uses this data to target advertisements and make recommendations. Furthermore, they use it to analyse trends in user behaviour. 

Google utilises Big Data in Google Maps. The algorithms identify ideal routes to take with the shortest journey time. This is imperfect however as in some cases it leads to roads that have been closed off temporarily.

Smaller businesses can also utilise Big Data, not just the global giants. In fact, they should use it to gain the upper hand against competitors. Whether it is to find out trends in customer spending habits or find ways to better promote their business, it can be an effective tool in gaining the advantage in a business environment where competition is tight. 

References:

https://ilearn.fife.ac.uk/course/view.php?id=9751#section-7

Applications of Big Data in Society

In class we learned about how Big Data is used in Society today.

People in society naturally generate a lot of data, whether it is through social media, e-commerce or the websites they access. This information can be useful to government institutions, allowing them to track citizens behaviour and influence them in different ways.

One application would be the GDELT project (Global Database of Events, Language and Tone), which maps a number of themes on society, particularly media, that spans across the world. This tracks human sentiments and creates a network of events happening on a global scale.

Education would be another factor. Big Data can look at students test results and highlight those who need extra support. This might seem insignificant, however multiple students dropping out of education will contribute to a larger societal issue. By providing extra support highlighted by these algorithms, these numbers will decrease. Furthermore, the same can be applied to the courses themselves, those with low pass rates may indicate the need for further funding and development.

Crime is another application in which Big Data has been implemented. Records of previous crimes and their locations is vital information. Comparing this data on a graph can show key areas where crime rates are higher and more police are required. This is currently a work in progress however, as some criticisms have come forward arguing that predictive policing is seen as racist and targets certain communities. Fraud detection would be another application, as algorithms can be trained to look for certain anomalies online. 

References:

https://ilearn.fife.ac.uk/course/view.php?id=9751#section-9

Applications of Big Data in Science

In class we learned about how Big Data can be used in science. 

Weather forecasting has become more accurate and efficient due to the rise of Big Data. The time between forecasting has been reduced from over a week to just under 3 days. This is because Big Data uses predictive algorithms to learn weather patterns, giving insight into weather forecasts much sooner than alternative methods. It will only get more accurate over time as the algorithms gather more data, particularly on extreme weather patterns which are not as predictable. 

Healthcare has also benefitted from the rise of Big Data. It allows people to receive diagnoses much faster, but also creating new links between symptoms and illness. For example, a new study showed a link between the retina and diabetes. When an algorithm is shown multiple pictures of eyes under a powerful camera, it can distinguish who is most likely to be vulnerable to illnesses like diabetes. How cool is that? It also played a crucial role during the COVID-19 pandemic where it was able to track outbreaks and predict the number of cases in certain regions.

Another application would be in the military, particularly in the development of new technologies and analysis of patterns. DARPA utilises Big Data in managing their defense on cyber attacks, encryption, and in their engineering. It plays a crucial part particularly in the defence of countries, but has many other uses. Data from sattelites can provide vital information on the battlefield if analysed correctly, furthermore Big Data can also analyse social media platforms to judge morale.

References:

DARPA and Data: a Portfolio Overview. (n.d.). Docslib. https://docslib.org/doc/626715/darpa-and-data-a-portfolio-overview
https://ilearn.fife.ac.uk/course/view.php?id=9751#section-8

Wednesday, 4 December 2024

Characteristics of Big Data

In class we learned about the key components of Big Data and what variables apply to it. Each of these measures analyse the effectiveness of data sets, and are essential in having accurate, reliable data.

Big Data can be measured by using 'The 7 V's of Big Data' which include:

  • Volume
  • Velocity
  • Variety
  • Veracity
  • Value
  • Variability
  • Visualisations
Volume refers to the size of the data sample, the scale on which it represents. Data projections have shown a dramatic increase in the volume, growing exponentially every year, which brings us onto the next point:

Velocity is the rate at which new data is being generated. Institutions must constantly upgrade and ensure they have the capacity to store data being put into their systems. Bigger measures for data storage are being used in today's world to accommodate the increasing velocity, with exabytes and zettabytes becoming more common on the large scale.

Variety is the different types of data collected. It can be structured, semi-structured, and unstructured. These refer to how easily data is to analyse, with structured being the easiest and most effective for analysis while unstructured is incredibly difficult to analyse. This can also refer to the sources in which data is acquired. Some examples include but are not limited to science, business, and government statistics.

Veracity is the term that represents the accuracy of data. In other words, its a test of how reliable the data is. This is crucial when analysing data because it isn't accurate, the end result will not be useful whatsoever. Big Data should always use data sets that are as accurate and relevant as possible. After all. if the data cannot be trusted, then why should we use it? No data set is100% accurate, however recent measures have ensured that it is as close to 100% as it has ever been.

Value refers to how useful data is, and how organisations can use the data after its value is extracted. If the data can be used, it automatically has value, however data can be used in different applications and therefore some data may be more valuable than others. Data can be used in many different ways, but a business, for example, could find value in customer data showing what products should be targeted, how to improve products, and providing valuable feedback on certain products, to name a few.

Variability is similar to veracity but slightly different. It looks at the consistency of data and the real meaning behind it. Some data may have a different meaning than what is originally intended. If inconsistencies are not found it can greatly impact the accuracy of results. 

Visualisations refers to the way data can be displayed and represented. This is commonly done through charts and graphs and makes information more readable in contrast to looking at data in a table. The format in which data is displayed makes data easy to comprehend.

We made a poster in class to represent all of these terms in class. How cool is this? (Admittedly it could be a little better)


References:

https://ilearn.fife.ac.uk/course/view.php?id=9751#section-5

Additional Comments

Overall I have really enjoyed learning about Big Data and never realised how important of a role it plays in everyday life. From learning ab...