Evgenii Legotckoi
Jan. 13, 2017, 8:15 p.m.

Qt/C++ - Lesson 058. Syntax highlighting of HTML code in QTextEdit

Some time ago, I was engaged in the study of syntax highlighting in QTextEdit and practiced on the syntax hightlighting for HTML code. As a result, able to do a pretty good variant of syntax highlighting of HTML code, but due to the fact that there is a possibility that this code will not be applied by me elsewhere, I decided to lay out the data code example.

Syntax highlighting HTML to QTextEdit will be as follows:


Project structure

The project consists of the following files:

  • HTMLExample.pro - the profile of the project;
  • main.cpp - the start project file;
  • mainwindow.h - header file of the application window;
  • mainwindow.cpp - file source code of the application window;
  • mainwindow.ui - interface file;
  • HTMLHighLighter.h - class header file to highlight HTML code;
  • HTMLHighLighter.cpp - file source code for the class lights HTML code;

main.cpp, HTMLExample.pro - создаются по умолчанию, в mainwindow.ui добавляем только QTextEdit .

mainwindow.h

Here include the header file of the HTMLHighLighter an define its object.

  1. #ifndef MAINWINDOW_H
  2. #define MAINWINDOW_H
  3.  
  4. #include <QMainWindow>
  5. #include "HTMLHighLighter.h"
  6.  
  7. namespace Ui {
  8. class MainWindow;
  9. }
  10.  
  11. class MainWindow : public QMainWindow
  12. {
  13. Q_OBJECT
  14.  
  15. public:
  16. explicit MainWindow(QWidget *parent = 0);
  17. ~MainWindow();
  18.  
  19. private:
  20. Ui::MainWindow *ui;
  21. HtmlHighLighter *m_htmlHightLighter;
  22. };
  23.  
  24. #endif // MAINWINDOW_H

mainwindow.cpp

And in this file simply set HTMLHighLighter object in the document object of QTextEdit.

  1. #include "mainwindow.h"
  2. #include "ui_mainwindow.h"
  3.  
  4. MainWindow::MainWindow(QWidget *parent) :
  5. QMainWindow(parent),
  6. ui(new Ui::MainWindow)
  7. {
  8. ui->setupUi(this);
  9. m_htmlHightLighter = new HtmlHighLighter(ui->textEdit->document());
  10. }
  11.  
  12. MainWindow::~MainWindow()
  13. {
  14. delete ui;
  15. }

HTMLHighLighter.h

The feature code syntax highlighting or just text in the QTextEdit is that QSyntaxtHighLighter class through all text blocks, which are divided text (divided by a newline), and determines how to highlight the current block from the start, depending on the state of the backlight of the previous text block.

Naturally it is necessary to implement. In the class constructor will be initialized rules highlight different parts of the code and templates, which will be determined by the part of the code. And in the method highlightBlock(const QString & text) , will be implemented, the processing logic of the text.

  1. #ifndef HTMLHIGHLIGHTER_H
  2. #define HTMLHIGHLIGHTER_H
  3.  
  4. #include <QSyntaxHighlighter>
  5.  
  6. QT_BEGIN_NAMESPACE
  7. class QTextDocument;
  8. class QTextCharFormat;
  9. QT_END_NAMESPACE
  10.  
  11. class HtmlHighLighter : public QSyntaxHighlighter
  12. {
  13. Q_OBJECT
  14.  
  15. public:
  16. explicit HtmlHighLighter(QTextDocument *parent = 0);
  17.  
  18. protected:
  19. void highlightBlock(const QString &text) Q_DECL_OVERRIDE;
  20.  
  21. private:
  22. // Status highlighting, which is a text box at the time of its closure
  23. enum States {
  24. None, // Without highlighting
  25. Tag, // Подсветка внутри тега
  26. Comment, // Внутри комментария
  27. Quote // Внутри кавычек, которые внутри тега
  28. };
  29.  
  30. struct HighlightingRule
  31. {
  32. QRegExp pattern;
  33. QTextCharFormat format;
  34. };
  35. QVector<HighlightingRule> startTagRules; // Formatting rules for opening tag
  36. QVector<HighlightingRule> endTagRules; // Formatting rules for closing tags
  37.  
  38. QRegExp openTag; // opening tag symbol - "<"
  39. QRegExp closeTag; // closing symbol tag - ">"
  40. QTextCharFormat edgeTagFormat; // character formatting of openTag and closeTag
  41. QTextCharFormat insideTagFormat; // Formatting text inside the tag
  42.  
  43. QRegExp commentStartExpression; // Regular expression of start comment
  44. QRegExp commentEndExpression; // Redular expression of end comment
  45. QTextCharFormat multiLineCommentFormat; // Format text inside a comment
  46.  
  47. QRegExp quotes; // Regular Expression for text in quotes inside the tag
  48. QTextCharFormat quotationFormat; // Formatting text in quotes inside the tag
  49. QTextCharFormat tagsFormat; // Formatting tags themselves
  50. };
  51.  
  52. #endif // HTMLHIGHLIGHTER_H

HTMLHighLighter.cpp

  1. #include "HTMLHighLighter.h"
  2. #include <QTextCharFormat>
  3. #include <QBrush>
  4. #include <QColor>
  5.  
  6.  
  7. HtmlHighLighter::HtmlHighLighter(QTextDocument *parent)
  8. : QSyntaxHighlighter(parent)
  9. {
  10. HighlightingRule rule;
  11.  
  12. edgeTagFormat.setForeground(QBrush(QColor("#32a9dd")));
  13. insideTagFormat.setForeground(Qt::blue);
  14. insideTagFormat.setFontWeight(QFont::Bold);
  15. openTag = QRegExp("<");
  16. closeTag = QRegExp(">");
  17.  
  18. tagsFormat.setForeground(Qt::darkBlue);
  19. tagsFormat.setFontWeight(QFont::Bold);
  20.  
  21. QStringList keywordPatterns;
  22. keywordPatterns << "<\\ba\\b" << "<\\babbr\\b" << "<\\bacronym\\b" << "<\\baddress\\b" << "<\\bapplet\\b"
  23. << "<\\barea\\b" << "<\\barticle\\b" << "<\\baside\\b" << "<\\baudio\\b" << "<\\bb\\b"
  24. << "<\\bbase\\b" << "<\\bbasefont\\b" << "<\\bbdi\\b" << "<\\bbdo\\b" << "<\\bbgsound\\b"
  25. << "<\\bblockquote\\b" << "<\\bbig\\b" << "<\\bbody\\b" << "<\\bblink\\b" << "<\\bbr\\b"
  26. << "<\\bbutton\\b" << "<\\bcanvas\\b" << "<\\bcaption\\b" << "<\\bcenter\\b" << "<\\bcite\\b"
  27. << "<\\bcode\\b" << "<\\bcol\\b" << "<\\bcolgroup\\b" << "<\\bcommand\\b" << "<\\bcomment\\b"
  28. << "<\\bdata\\b" << "<\\bdatalist\\b" << "<\\bdd\\b" << "<\\bdel\\b" << "<\\bdetails\\b"
  29. << "<\\bdfn\\b" << "<\\bdialog\\b" << "<\\bdir\\b" << "<\\bdiv\\b" << "<\\bdl\\b"
  30. << "<\\bdt\\b" << "<\\bem\\b" << "<\\bembed\\b" << "<\\bfieldset\\b" << "<\\bfigcaption\\b"
  31. << "<\\bfigure\\b" << "<\\bfont\\b" << "<\\bfooter\\b" << "<\\bform\\b" << "<\\bframe\\b"
  32. << "<\\bframeset\\b" << "<\\bh1\\b" << "<\\bh2\\b" << "<\\bh3\\b" << "<\\bh4\\b"
  33. << "<\\bh5\\b" << "<\\bh6\\b" << "<\\bhead\\b" << "<\\bheader\\b" << "<\\bhgroup\\b"
  34. << "<\\bhr\\b" << "<\\bhtml\\b" << "<\\bi\\b" << "<\\biframe\\b" << "<\\bimg\\b"
  35. << "<\\binput\\b" << "<\\bins\\b" << "<\\bisindex\\b" << "<\\bkbd\\b" << "<\\bkeygen\\b"
  36. << "<\\blabel\\b" << "<\\blegend\\b" << "<\\bli\\b" << "<\\blink\\b" << "<\\blisting\\b"
  37. << "<\\bmain\\b" << "<\\bmap\\b" << "<\\bmarquee\\b" << "<\\bmark\\b" << "<\\bmenu\\b"
  38. << "<\\bamenuitem\\b" << "<\\bmeta\\b" << "<\\bmeter\\b" << "<\\bmulticol\\b" << "<\\bnav\\b"
  39. << "<\\bnobr\\b" << "<\\bnoembed\\b" << "<\\bnoindex\\b" << "<\\bnoframes\\b" << "<\\bnoscript\\b"
  40. << "<\\bobject\\b" << "<\\bol\\b" << "<\\boptgroup\\b" << "<\\boption\\b" << "<\\boutput\\b"
  41. << "<\\bp\\b" << "<\\bparam\\b" << "<\\bpicture\\b" << "<\\bplaintext\\b" << "<\\bpre\\b"
  42. << "<\\bprogress\\b" << "<\\bq\\b" << "<\\brp\\b" << "<\\brt\\b" << "<\\brtc\\b" << "<\\bruby\\b"
  43. << "<\\bs\\b" << "<\\bsamp\\b" << "<\\bscript\\b" << "<\\bsection\\b" << "<\\bselect\\b"
  44. << "<\\bsmall\\b" << "<\\bsource\\b" << "<\\bspacer\\b" << "<\\bspan\\b" << "<\\bstrike\\b"
  45. << "<\\bstrong\\b" << "<\\bstyle\\b" << "<\\bsub\\b" << "<\\bsummary\\b" << "<\\bsup\\b"
  46. << "<\\btable\\b" << "<\\btbody\\b" << "<\\btd\\b" << "<\\btemplate\\b" << "<\\btextarea\\b"
  47. << "<\\btfoot\\b" << "<\\bth\\b" << "<\\bthead\\b" << "<\\btime\\b" << "<\\btitle\\b"
  48. << "<\\btr\\b" << "<\\btrack\\b" << "<\\btt\\b" << "<\\bu\\b" << "<\\bul\\b" << "<\\bvar\\b"
  49. << "<\\bvideo\\b" << "<\\bwbr\\b" << "<\\bxmp\\b";
  50.  
  51. for (const QString &pattern : keywordPatterns)
  52. {
  53. rule.pattern = QRegExp(pattern);
  54. rule.format = tagsFormat;
  55. startTagRules.append(rule);
  56. }
  57.  
  58. QStringList keywordPatterns_end;
  59. keywordPatterns_end << "<!\\bDOCTYPE\\b" << "</\\ba\\b" << "</\\babbr\\b" << "</\\bacronym\\b" << "</\\baddress\\b" << "</\\bapplet\\b"
  60. << "</\\barea\\b" << "</\\barticle\\b" << "</\\baside\\b" << "</\\baudio\\b" << "</\\bb\\b"
  61. << "</\\bbase\\b" << "</\\bbasefont\\b" << "</\\bbdi\\b" << "</\\bbdo\\b" << "</\\bbgsound\\b"
  62. << "</\\bblockquote\\b" << "</\\bbig\\b" << "</\\bbody\\b" << "</\\bblink\\b" << "</\\bbr\\b"
  63. << "</\\bbutton\\b" << "</\\bcanvas\\b" << "</\\bcaption\\b" << "</\\bcenter\\b" << "</\\bcite\\b"
  64. << "</\\bcode\\b" << "</\\bcol\\b" << "</\\bcolgroup\\b" << "</\\bcommand\\b" << "</\\bcomment\\b"
  65. << "</\\bdata\\b" << "</\\bdatalist\\b" << "</\\bdd\\b" << "</\\bdel\\b" << "</\\bdetails\\b"
  66. << "</\\bdfn\\b" << "</\\bdialog\\b" << "</\\bdir\\b" << "</\\bdiv\\b" << "</\\bdl\\b"
  67. << "</\\bdt\\b" << "</\\bem\\b" << "</\\bembed\\b" << "</\\bfieldset\\b" << "</\\bfigcaption\\b"
  68. << "</\\bfigure\\b" << "</\\bfont\\b" << "</\\bfooter\\b" << "</\\bform\\b" << "</\\bframe\\b"
  69. << "</\\bframeset\\b" << "</\\bh1\\b" << "</\\bh2\\b" << "</\\bh3\\b" << "</\\bh4\\b"
  70. << "</\\bh5\\b" << "</\\bh6\\b" << "</\\bhead\\b" << "</\\bheader\\b" << "</\\bhgroup\\b"
  71. << "</\\bhr\\b" << "</\\bhtml\\b" << "</\\bi\\b" << "</\\biframe\\b" << "</\\bimg\\b"
  72. << "</\\binput\\b" << "</\\bins\\b" << "</\\bisindex\\b" << "</\\bkbd\\b" << "</\\bkeygen\\b"
  73. << "</\\blabel\\b" << "</\\blegend\\b" << "</\\bli\\b" << "</\\blink\\b" << "</\\blisting\\b"
  74. << "</\\bmain\\b" << "</\\bmap\\b" << "</\\bmarquee\\b" << "</\\bmark\\b" << "</\\bmenu\\b"
  75. << "</\\bamenuitem\\b" << "</\\bmeta\\b" << "</\\bmeter\\b" << "</\\bmulticol\\b" << "</\\bnav\\b"
  76. << "</\\bnobr\\b" << "</\\bnoembed\\b" << "</\\bnoindex\\b" << "</\\bnoframes\\b" << "</\\bnoscript\\b"
  77. << "</\\bobject\\b" << "</\\bol\\b" << "</\\boptgroup\\b" << "</\\boption\\b" << "</\\boutput\\b"
  78. << "</\\bp\\b" << "</\\bparam\\b" << "</\\bpicture\\b" << "</\\bplaintext\\b" << "</\\bpre\\b"
  79. << "</\\bprogress\\b" << "</\\bq\\b" << "</\\brp\\b" << "</\\brt\\b" << "</\\brtc\\b" << "</\\bruby\\b"
  80. << "</\\bs\\b" << "</\\bsamp\\b" << "</\\bscript\\b" << "</\\bsection\\b" << "</\\bselect\\b"
  81. << "</\\bsmall\\b" << "</\\bsource\\b" << "</\\bspacer\\b" << "</\\bspan\\b" << "</\\bstrike\\b"
  82. << "</\\bstrong\\b" << "</\\bstyle\\b" << "</\\bsub\\b" << "</\\bsummary\\b" << "</\\bsup\\b"
  83. << "</\\btable\\b" << "</\\btbody\\b" << "</\\btd\\b" << "</\\btemplate\\b" << "</\\btextarea\\b"
  84. << "</\\btfoot\\b" << "</\\bth\\b" << "</\\bthead\\b" << "</\\btime\\b" << "</\\btitle\\b"
  85. << "</\\btr\\b" << "</\\btrack\\b" << "</\\btt\\b" << "</\\bu\\b" << "</\\bul\\b" << "</\\bvar\\b"
  86. << "</\\bvideo\\b" << "</\\bwbr\\b" << "</\\bxmp\\b";
  87.  
  88. for (const QString &pattern : keywordPatterns_end)
  89. {
  90. rule.pattern = QRegExp(pattern);
  91. rule.format = tagsFormat;
  92. endTagRules.append(rule);
  93. }
  94.  
  95. multiLineCommentFormat.setForeground(Qt::darkGray);
  96. commentStartExpression = QRegExp("<!--");
  97. commentEndExpression = QRegExp("-->");
  98.  
  99. quotationFormat.setForeground(Qt::darkGreen);
  100. quotes = QRegExp("\"");
  101. }
  102.  
  103. void HtmlHighLighter::highlightBlock(const QString &text)
  104. {
  105. setCurrentBlockState(HtmlHighLighter::None);
  106.  
  107. // TAG
  108. int startIndex = 0;
  109. // If you're not within a tag,
  110. if (previousBlockState() != HtmlHighLighter::Tag && previousBlockState() != HtmlHighLighter::Quote)
  111. {
  112. // So we try to find the beginning of the next tag
  113. startIndex = openTag.indexIn(text);
  114. }
  115.  
  116. // Taking the state of the previous text block
  117. int subPreviousTag = previousBlockState();
  118. while (startIndex >= 0)
  119. {
  120. // We are looking for an end-tag
  121. int endIndex = closeTag.indexIn(text, startIndex);
  122.  
  123. int tagLength;
  124. // If the end tag is not found, then we set the block state
  125. if (endIndex == -1)
  126. {
  127. setCurrentBlockState(HtmlHighLighter::Tag);
  128. tagLength = text.length() - startIndex;
  129. }
  130. else
  131. {
  132. tagLength = endIndex - startIndex + closeTag.matchedLength();
  133. }
  134.  
  135. // Set the formatting for a tag
  136. if (subPreviousTag != HtmlHighLighter::Tag)
  137. {
  138. // since the beginning of the tag to the end, if the previous status is not equal Tag
  139. setFormat(startIndex, 1, edgeTagFormat);
  140. setFormat(startIndex + 1, tagLength - 1, insideTagFormat);
  141. }
  142. else
  143. {
  144. // If you're already inside the tag from the start block
  145. // and before the end tag
  146. setFormat(startIndex, tagLength, insideTagFormat);
  147. subPreviousTag = HtmlHighLighter::None;
  148. }
  149. // Format the symbol of the end tag
  150. setFormat(endIndex, 1, edgeTagFormat);
  151.  
  152. /// QUOTES ///////////////////////////////////////
  153. int startQuoteIndex = 0;
  154. // If you are not in quotation marks with the previous block
  155. if (previousBlockState() != HtmlHighLighter::Quote)
  156. {
  157. // So we try to find the beginning of the quotes
  158. startQuoteIndex = quotes.indexIn(text, startIndex);
  159. }
  160.  
  161. // Highlights all quotes within the tag
  162. while (startQuoteIndex >= 0 && ((startQuoteIndex < endIndex) || (endIndex == -1)))
  163. {
  164. int endQuoteIndex = quotes.indexIn(text, startQuoteIndex + 1);
  165. int quoteLength;
  166. if (endQuoteIndex == -1)
  167. {
  168. // If a closing quotation mark is found, set the state for the block Quote
  169. setCurrentBlockState(HtmlHighLighter::Quote);
  170. quoteLength = text.length() - startQuoteIndex;
  171. }
  172. else
  173. {
  174. quoteLength = endQuoteIndex - startQuoteIndex + quotes.matchedLength();
  175. }
  176.  
  177. if ((endIndex > endQuoteIndex) || endIndex == -1)
  178. {
  179. setFormat(startQuoteIndex, quoteLength, quotationFormat);
  180. startQuoteIndex = quotes.indexIn(text, startQuoteIndex + quoteLength);
  181. }
  182. else
  183. {
  184. break;
  185. }
  186. }
  187. //////////////////////////////////////////////////
  188. // Again, look for the beginning of the tag
  189. startIndex = openTag.indexIn(text, startIndex + tagLength);
  190. }
  191.  
  192. // EDGES OF TAGS
  193. // Processing of color tags themselves, that is, highlight words div, p, strong etc.
  194. for (const HighlightingRule &rule : startTagRules)
  195. {
  196. QRegExp expression(rule.pattern);
  197. int index = expression.indexIn(text);
  198. while (index >= 0)
  199. {
  200. int length = expression.matchedLength();
  201. setFormat(index + 1, length - 1, rule.format);
  202. index = expression.indexIn(text, index + length);
  203. }
  204. }
  205.  
  206. for (const HighlightingRule &rule : endTagRules)
  207. {
  208. QRegExp expression(rule.pattern);
  209. int index = expression.indexIn(text);
  210. while (index >= 0) {
  211. int length = expression.matchedLength();
  212. setFormat(index + 1, 1, edgeTagFormat);
  213. setFormat(index + 2, length - 2, rule.format);
  214. index = expression.indexIn(text, index + length);
  215. }
  216. }
  217.  
  218. // COMMENT
  219. int startCommentIndex = 0;
  220. // If the tag is not a previous state commentary
  221. if (previousBlockState() != HtmlHighLighter::Comment)
  222. {
  223. // then we try to find the beginning of a comment
  224. startCommentIndex = commentStartExpression.indexIn(text);
  225. }
  226.  
  227. // If a comment is found
  228. while (startCommentIndex >= 0)
  229. {
  230. // We are looking for the end of the comment
  231. int endCommentIndex = commentEndExpression.indexIn(text, startCommentIndex);
  232. int commentLength;
  233.  
  234. // If the end is not found
  235. if (endCommentIndex == -1)
  236. {
  237. // That set the state Comment
  238. // The principle is similar to that for conventional tags
  239. setCurrentBlockState(HtmlHighLighter::Comment);
  240. commentLength = text.length() - startCommentIndex;
  241. }
  242. else
  243. {
  244. commentLength = endCommentIndex - startCommentIndex
  245. + commentEndExpression.matchedLength();
  246. }
  247.  
  248. setFormat(startCommentIndex, commentLength, multiLineCommentFormat);
  249. startCommentIndex = commentStartExpression.indexIn(text, startCommentIndex + commentLength);
  250. }
  251. }

Download project

Video

Do you like it? Share on social networks!

nayk1982
  • Aug. 29, 2018, 12:57 p.m.

Добрый день. Подскажите, как будет выглядеть метод , если вместо используется ?

nayk1982
  • Aug. 29, 2018, 1:01 p.m.

Половина слов из предыдущего комментария куда-то исчезло. Суть: вместо QRegExp используется QRegularExport. Как в этом случае будет выглядеть highlightBlock?

Evgenii Legotckoi
  • Sept. 1, 2018, 1:59 p.m.

Вы хотели сказать QRegularExpression?

Да вроде также должна выглядеть регулярная.
Там ничего особенного не было в регулярном выражении. Всё должно быть совместимо с регулярными перла, которые являются базой для QRegularExpression.
А по поводу ошибки с паркингом комментария, я постараюсь разобраться, когда вернусь из поездки. Спасибо за информацию.



Comments

Only authorized users can post comments.
Please, Log in or Sign up
  • Last comments
  • AK
    April 1, 2025, 11:41 a.m.
    Добрый день. В данный момент работаю над проектом, где необходимо выводить звук из программы в определенное аудиоустройство (колонки, наушники, виртуальный кабель и т.д). Пишу на Qt5.12.12 поско…
  • Evgenii Legotckoi
    March 9, 2025, 9:02 p.m.
    К сожалению, я этого подсказать не могу, поскольку у меня нет необходимости в обходе блокировок и т.д. Поэтому я и не задавался решением этой проблемы. Ну выглядит так, что вам действитель…
  • VP
    March 9, 2025, 4:14 p.m.
    Здравствуйте! Я устанавливал Qt6 из исходников а также Qt Creator по отдельности. Все компоненты, связанные с разработкой для Android, установлены. Кроме одного... Когда пытаюсь скомпилиров…
  • ИМ
    Nov. 22, 2024, 9:51 p.m.
    Добрый вечер Евгений! Я сделал себе авторизацию аналогичную вашей, все работает, кроме возврата к предидущей странице. Редеректит всегда на главную, хотя в логах сервера вижу запросы на правильн…
  • Evgenii Legotckoi
    Oct. 31, 2024, 11:37 p.m.
    Добрый день. Да, можно. Либо через такие же плагины, либо с постобработкой через python библиотеку Beautiful Soup