Umeå University's logo

umu.sePublikasjoner
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Detection of Synthetic Climate Misinformation with Machine Learning Algorithms and Sentence-Level Analysis
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för datavetenskap.
2025 (engelsk)Independent thesis Basic level (degree of Bachelor), 10 poäng / 15 hpOppgave
Abstract [en]

The spread of climate-related misinformation can reduce public support for climate change mitigation policies. A study showed that on social media, people tend to absorb news content without knowing the details of the context. In that case, LLM can be utilised to spread misinformation, subsequently altering people's opinions for malicious purposes. To observe two machine learning algorithms: Support Vector Machine and Logistic Regression's capability to detect LLM-generated misinformation, we created a synthetic dataset, consisting of 300 examples. We have collected 150 climate-related news articles from various well-reputed sources to create the synthetic dataset. Then, we created a five to six-sentence summary based on the original article with the help of GPT-4. Each actual summary is falsified with the help of GPT-4 as well. Moreover, we evaluated each summary example from the synthetic dataset with the FineSure framework to obtain each summary's faithfulness, completeness and conciseness. The results showed that Support Vector Machine achieved an F1-score of 0.839, and Logistic Regression's F1-score was 0.787 on the synthetic dataset. We performed sentence-level analysis with the GUTEK framework on these models' false positive and negative examples. The sentence-level analysis with the GUTEK framework showed that policy-related sentences had the most impact on these models in predicting false positives. On the other hand, factual-related sentences significantly influenced these models to predict false negatives. 

sted, utgiver, år, opplag, sider
2025.
Serie
UMNAD ; 1576
HSV kategori
Identifikatorer
URN: urn:nbn:se:umu:diva-242908OAI: oai:DiVA.org:umu-242908DiVA, id: diva2:1988038
Utdanningsprogram
Bachelor of Science Programme in Computing Science
Examiner
Tilgjengelig fra: 2025-08-11 Laget: 2025-08-10 Sist oppdatert: 2025-08-11bibliografisk kontrollert

Open Access i DiVA

fulltext(2976 kB)144 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 2976 kBChecksum SHA-512
79d718865e25d5c5c7a8ee9afdb3c79c53e54f20d7b8b8d04a91d7f305f87ac7478e352468ad12e1055c8f3330c40f5df7f95edb6986856a53439830bb4fefdf
Type fulltextMimetype application/pdf

Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 144 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

urn-nbn

Altmetric

urn-nbn
Totalt: 751 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf