Results 1 to 4 of 4

Thread: How to remove HTML tags from Excel Sheet

  1. #1
    Join Date
    Jan 2012

    How to remove HTML tags from Excel Sheet

    I have a worksheet containing text extracted from a web page that still includes html markup tags, or example <BR>, <font etc and all the other normal html markup codes. This is stored inside an EXCEL text cell. The rest of the Worksheet contains normal text, numeric and date data. Does anyone please know a way to automatically strip out all of the html markup tags from the single column that I have within this, whilst leaving the rest of the worksheet unaffected? Many thanks if you can help solve this little problem

  2. #2
    Join Date
    Jul 2011

    Re: How to remove HTML tags from Excel Sheet

    I assume the text is in a normal cell or do you mean a text box. I tried a little test with find and replace and find <*> and in the replace box leave blank /replace all.

  3. #3
    Join Date
    Jan 2012

    Re: How to remove HTML tags from Excel Sheet

    Unfortunately your suggested solution didnt work. The problem is that the text contained in the EXCEL cell (I have tried with both General and with text format) can contain up to 1500 - 2000 characters distributed into several hundred words. The overall EXCEL .xls file is about 10 MB size, and contains about 3000 rows with up to 17 columns. After trying your replace <*> trick just on the column gives me an error message saying 'Formula is too long'. Importing the whole .xls file into ACCESS, and trying the same replace operation on the column in question also gives an ACCESS error: 'There isn't enough free memory to continue the search.

  4. #4
    Join Date
    Aug 2011

    Re: How to remove HTML tags from Excel Sheet

    Just tried it in this way. First copy your values then Mark the spot where you want to add them and then Edit > Paste 4th Values option. Another note, also used more often because: hyperlinks from the text go to the other column:
    Function GetAddress (hyperlink cell As Range) 
    GetAddress = Replace _ 
    (HyperlinkCell.Hyperlinks (1) Address, "mailto:". '") 
    End Function
    Alt + F11 and Insert> Module and paste code. Alt + Q and save. The feature is found under "Custom" in the dialog box (Shift + F3). GetAddress = (A1) Where cell A1 for example has a hyperlink.

Similar Threads

  1. Replies: 2
    Last Post: 23-02-2012, 01:59 PM
  2. How to remove or sort the data order in Excel sheet
    By Ramanujan in forum MS Office Support
    Replies: 2
    Last Post: 15-02-2012, 07:12 PM
  3. Unable to remove smart tags from Excel
    By GurdeepS in forum Windows Software
    Replies: 4
    Last Post: 25-02-2010, 08:13 AM
  4. html tags in windows
    By b.venu in forum Operating Systems
    Replies: 1
    Last Post: 17-09-2009, 12:28 PM
  5. About Hidden tags in HTML
    By OmiWalia in forum Software Development
    Replies: 3
    Last Post: 06-08-2009, 10:45 PM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts