Amalgamation of Data Science and Cyber Security

 “32% of confirmed data breaches  and 86% of email attacks include phishing attacks and $24.26 billion were lost in 2018 due to card payment fraud.

Image Source: https://www.google.com/amp/s/www.digitalnest.in/blog/how-data-science-plays-an-important-role-in-addressing-cyber-security-risks/amp/

Before starting with anything, let's first talk about

WHAT IS DATA SCIENCE?

Data Science is the interdisciplinary field that deals with the programming, domain expertise, algorithm, scientific methods and systems to extract significant evidence and insights from either structured or unstructured data. The term interdisciplinary is used because it is the confluence of mathematics and statistics, computer science and business knowledge.


Image Source: https://www.techiexpert.com/difference-data-science-machine-learning/

Data is something that cannot be confined in limits, it has a vast category- images, text, videos, voice, speech, and many more.- all are data and are divided as structured, unstructured, and semi-structured. Today’s time, when the number of electrical gadgets is on rise, are throwing out a tremendous amount of data. With the Data Science, it has became feasible to process the data and yank a valuable treatment of the problems. It includes

Data Collection: Data is collected from various sources that are mandatory and related to the problem solving

Data Cleaning: It includes removing the incorrect data, handling NULL values and then normalized to be used.

Data Understanding: The data is need to be comprehended so that a proper analysis and training model could be deployed.

Data Training: It include creation of model with a proper algorithm and trained with the data sets.

Visualization: The outcome of the data is then visualized and the results are organized on basis.


Data Science in CYBERSECURITY

The data and Data Science is found useful in finding out the antidotes of various cyber threats. One such application of Data Science is in PHISHING DETECTION.

Phishing is the most common threat in cyber security where user himself permits the mal-actors to steal sensitive information. Refer to the link below, to know more about phishing.

Phishing- A guide to identification and prevention

Data Science can identify and preventing such phishing scams in a very straightforward way. It requires the collection of data from such previous attacks and flag the message as a scam or not. The labelled data might contain the attack vectors including the link, written message, source IP, data compromised, etc. in the datasets on the basis of these, comparing the message source, links, etc. the phishing domain are determined. It also encompasses the legitimate domain(non-phishing) data. The Logistic Regression model then simply flags the message as phishing or non-phishing. It is similar like that of scam filter used by the mail providers.

Another application of Data Science in cybersecurity is CREDIT CARD FRAUD DETECTION. The data related to credit card usage like,

The location where most of the time card is used

The average amount of money spent on the card

What purpose card is mostly used for

What is the frequency of card use, etc.

Such datasets can identify the card used in context if the card transaction is not followed by some of the data defined. It classifies the transaction as fraudulent or legitimate.

Further, the preventive measures could be applied from the ML model side like blocking the transaction or informing the legitimate user and cyber department about the transaction, its location and all.

It is not as simple as it looks like, the major trouble is collecting the valid data. It become very difficult to collect the post-attack as many hackers destroy the data like deleting the conspiracy site and leave no digital fingerprints.

Technology is playing its role and we have to play ours. Be aware of your surrounding threats and then only you can prevent yourself from such.


Comments

Aditya said…
You have very well described the use of data science in the field of security. Good work Abhay.
Unknown said…
ausm abhay ..Very Useful knowledge
dimple jha said…
Content is very important thanks for sharing.
Rachna said…
Such an useful information. Keep sharing dude!!!!!!
Ajeet Rai said…
Useful content for us..
Thanks for sharing 👍