Fix html tags, close tags, repair bad quotes and more

This class can solve many problems coming from user generated html content or to fix html content before making some…

February 3, 2010

This class can solve many problems coming from user generated html content or to fix html content before making some hard work with your bots! (It’s specially usefull for web sites without the Html Tidy module of PHP).

Hre is a quick list of the magic things it can do.

  1. delete closed tags without their opening tag
  2. fix open tag without close, closing them automatically
  3. check bad nesting and fix them (if you have a bold inside a bold… or a paragrah that contains a table…)
  4. fix bad quotes in attributes (open quotes where missing…)
  5. merge different styles attributes in the same tag
  6. remove html comments
  7. remove empty tags and more bad tags

It works ina complex way since it analyzes the html code char by char and search for tags. When a tag is found start the work of cleaning attributes, then store data found in a matrix and search for the closing tags.
The data saved in the matrix are later used to re-build the correct fixed html.

EXAMPLE:
It’s very simple to use, suppose you have a variable with the dirty html:

$a = new HtmlFixer();
$clean = $a->getFixedHtml($dirty_html);

You can download the class from the HTML FIXER page.

Author

I'm a software engineer, an everyday web developer and a maker. I usually build sites with PHP, within or without WordPress. I build Internet of Things with Arduino and ESP8266. I'm the founder of Rockit.it and Dailybest.it and I'm actually the Chief Technical Officer of Better Days web agency.

Comments on “Fix html tags, close tags, repair bad quotes and more”

8 thoughts

  1. Savita says:

    Hi,

    This class is really helpful, just need to point some small issues…
    1) php short code is user <?, in place need to use this debug is false but still it shows all the debugging on the screen.

  2. admin says:

    Thank you Savita, probably I’ve uploaded the wrong file. I’ll fix it today!

  3. Reflexões says:

    Wow!!!

    This works perfeclty!!! fix all HTML tags….
    I lost much time fixing manually wrong HTML codes posted by users at a custom blog…

    Thx

  4. web says:

    I don’t have a php page, is there another way to fix his problem for .html pages?

  5. Lee Tung says:

    It is powerful! This is my need to fix HTML tags in PHP! Thanks admin!

  6. qinkun says:

    If there is a before ,it will delete the string before the end tag().
    e.g.
    something to dor u sure

    What is the result i wnat is :
    something to dor u sure

    What can i do in that case?

  7. Radu says:

    Bug?

    Enter the code below, replacing [ and ] with less than / greater than tags:

    $a = new CTS_HTMLFixer();
    echo $a->getFixedHtml(‘[font style=”color: rgb(153, 255, 51);” size=”4″]test[/font]’);

    ——-

    Result: [font style=”color:;” rgb(153, 255, 51);” size=”4″]test[/font]

    So it replaces the valid attribute style=”color: rgb(153, 255, 51);” with invalid HTML style=”color:;” rgb(153, 255, 51);”

  8. robcaa says:

    How can I disable comment delete?

Comments are closed

Recommended

Unobtrusive javascript to add an overlay banner

If you have to add an overlay to a site, here is a fast way to do it with an…

November 7, 2011

PHP parse url, mailto, and also twitter’s usernames and arguments

This small function receive a text as input and returns an html text with links if the source text contains…

March 10, 2010

OnKeyUp Fix Alphanumerical Chars

When you have an html form and you want only alphanumerical [a-z0-9] chars in your input, you can use this…

November 10, 2009

Highlight text for search results in PHP

Useful code to highlight text occurences in search results or in a text. How to highlight text in a string…

September 2, 2016

How to add rel=”nofollow” to links with preg_replace()

Adding rel="nofollow" to external link is a good SEO practice.

September 22, 2015

Block junk emails, spammers and temporary emails

If you need an Email Validator Function, consider this version that includes also the check against common temporary mail services…

December 3, 2013