Rekolha dataset email ho label spam (1) ka ham (0).
Ezeemplu:
| Email | Label |
|-------------------------|-------|
| "Dapatkan diskon 50%" | 1 |
| "Kabar baik untuk Anda" | 0 |
Etapa ba prosesu ne mak hanesan tuir mai nee:
Contoh: "Aku pergi beli!" → ["aku", "pergi", "beli"]
Iha etapa ida nee, ita sura fiture utiliza Bag-of-Words (BoW).
| Kata | Frekuensi |
|------------|-----------|
| diskon | 2 |
| hadiah | 1 |
| beli | 1 |
Ita sei kria modelu ANN ho input layaer, hidden layer, no output layer.
Contoh: Se ita uza fitur 3, mak layer input sei uza 3 neuron.
Ita sei sura output husi neuron ho funsaun aktivasi, hanesan sigmoid:
Output = 1 / (1 + e^(-z))
Iha nebe z = w1*x1 + w2*x2 + w3*x3 + b
Ezemplu Kalkulasaun:
Jika w1 = 0.2, w2 = 0.5, w3 = 0.1, b = -0.3
No x1=1, x2=0, x3=1 (frekuensia liafuan)
z = (0.2*1) + (0.5*0) + (0.1*1) - 0.3
= 0.2 + 0 + 0.1 - 0.3
= 0
Output = 1 / (1 + e^(-0)) = 0.5
Ita sei uza algoritma backpropagation hodi hafoun nia bobot.
Ezemplu:
w_new = w_old + α * (target - output) * output * (1 - output) * input
Dimana α = laju pembelajaran.
Ita uza metrik evaluasaun hanesan akurasi, presisi, dan recall.
Modelu bele implementa hodi detekta email spam.
Contoh: Se modelu detekta spam, mak email refere sei fosai nudar spam.