Package Torello.HTML

Class MalformedHTMLException

  • All Implemented Interfaces:
    Direct Known Subclasses:

    public class MalformedHTMLException
    extends java.lang.Exception
    This Exception may be thrown by code that checks the validity of an HTML Page Vector.

    This scrape package loads HTML pages into page Vector's, and does not check DOM-Tree styled validity warnings.

    This is a "vectorized" approach to HTML. The good part of loading pages to Vector's is that HTML-trees are really notoriously bad for analysing anything about the content of the page - other than for looking up answers, numbers, or a catch-phrase here or there. This package was developed to translate foreign-news articles, but could easily be used for parsing or reading any HTML-page on the internet. As such, no HTML-trees are built, and therefore validity checking is not performed by this package. This does mean articles are never transformed, nor changed, all the parser does is load tokens to an array-like Vector.

    Loading content to a tree, and checking for validity, and performing suggestions and modifications might come from a later package-development, but for the time being, this type of analysis would only make the project much more difficult to read, and not provide a lot of benefit in the realm of web-sites who don't have "poorly formed HTML" content problems.

    NOTE: I have left this class here because it is used once or twice by class "Elements", and because perhaps a programmer might wish to do validity checking.
    See Also:
    Serialized Form

    • Field Summary

      Serializable ID
      Modifier and Type Field
      static long serialVersionUID
    • Method Summary

      • Methods inherited from class java.lang.Throwable

        addSuppressed, fillInStackTrace, getCause, getLocalizedMessage, getMessage, getStackTrace, getSuppressed, initCause, printStackTrace, printStackTrace, printStackTrace, setStackTrace, toString
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
    • Field Detail

      • serialVersionUID

        public static final long serialVersionUID
        This fulfils the SerialVersion UID requirement for all classes that implement Java's interface Using the Serializable Implementation offered by java is very easy, and can make saving program state when debugging a lot easier. It can also be used in place of more complicated systems like "hibernate" to store data as well.

        Note that Java's java.lang.Exception and java.lang.Error classes implement the Serializable interface, and a warning-free build expects this field be defined here.
        See Also:
        Constant Field Values
        Exact Field Declaration Expression:
         public static final long serialVersionUID = 1;
    • Constructor Detail

      • MalformedHTMLException

        🡅  🡇    
        public MalformedHTMLException​(java.lang.String message)
        Constructs a MalformedHTMLException with the specified detail message.
        message - the detail message.
      • MalformedHTMLException

        🡅  🡇    
        public MalformedHTMLException​(java.lang.String message,
                                      java.lang.Throwable cause)
        Constructs a new exception with the specified detail message and cause.

        NOTE: The detail message associated with cause is not automatically incorporated in this exception's detail message.
        message - The detail message (which is saved for later retrieval by the Throwable.getMessage() method).
        cause - the cause (which is saved for later retrieval by the Throwable.getCause() method). (A null value is permitted, and indicates that the cause is nonexistent or unknown.)
      • MalformedHTMLException

        public MalformedHTMLException​(java.lang.Throwable cause)
        Constructs a new exception with the specified cause and a detail message of (cause==null ? null : cause.toString()) (which typically contains the class and detail message of cause). This constructor is useful for exceptions that are little more than wrappers for other throwables.
        cause - The cause (which is saved for later retrieval by the Throwable.getCause() method). (A null value is permitted, and indicates that the cause is nonexistent or unknown.)