Photo de Touza Isaac

Touza Isaac

Enseignant, Développeur et Doctorant en Informatique

Web Content Analyzer

Web Content Analyzer

Technologies and Editors Utilized

Xml Java

Web Content Analyzer is an Android application designed to classify web pages by analyzing their textual content. This tool is ideal for users who need to quickly determine the category of a web page based on its content.

App Features:

  • Analyze the Page from its URL: Calculate the number of words and their occurrences, identify the longest sentence, and gather other textual data.
  • Extract Text from the Page: Pull out all textual content from the given web page.
  • Extract All Images from the Page: Retrieve and display all images found on the web page.
  • Extract Links Found on the Page: List all hyperlinks present within the web page.
  • Analyze the Page: Perform a detailed analysis of the web page content.
  • Determine the Category of the Page: Classify the web page into categories such as sports, health, religion, technology, education, and more.
  • Consult the History of Analyzed Pages: Keep track of previously analyzed pages for easy reference.

Screenshots

Changelog

Full Changelog: https://github.com/Touza-Isaac/Web-Content-Analyzer/commits/v1.0-beta

This project showcases the power of combining Java and XML to create a robust application capable of in-depth web content analysis and categorization. Whether you are a developer looking to enhance your skills or someone interested in understanding web page content better, Web Content Analyzer provides the tools you need.

Retour à la liste des projets