Author Topic: Pseudo semantic highlighting (Read 36937 times)

Alpha · « **on:** January 16, 2013, 03:25:21 am »

CC already has a fairly decent idea of the what is going on in the source, so I decided to make CC talk to Scintilla. This implementation is fairly crude, but I would appreciate feedback from anyone who would like to see a little more color in their code.

Code

Index: src/plugins/codecompletion/codecompletion.cpp
===================================================================
--- src/plugins/codecompletion/codecompletion.cpp	(revision 8788)
+++ src/plugins/codecompletion/codecompletion.cpp	(working copy)
@@ -3641,4 +3641,39 @@
     TRACE(_T("CodeCompletion::OnEditorActivatedTimer: Starting m_TimerToolbar."));
     m_TimerToolbar.Start(TOOLBAR_REFRESH_DELAY, wxTIMER_ONE_SHOT);
     TRACE(_T("OnEditorActivatedTimer() : Current activated file is %s"), curFile.wx_str());
+
+    cbEditor* ed = Manager::Get()->GetEditorManager()->GetBuiltinEditor(editor);
+    if (!ed || ed->GetControl()->GetLexer() != wxSCI_LEX_CPP)
+        return;
+    TokenIdxSet result;
+    m_NativeParser.GetParser().FindTokensInFile(curFile, result, tkAnyContainer | tkAnyFunction);
+    TokenTree* tree = m_NativeParser.GetParser().GetTokenTree();
+    wxArrayString varList;
+    for (TokenIdxSet::const_iterator it = result.begin(); it != result.end(); ++it)
+    {
+        Token* token = tree->at(*it);
+        if (!token)
+            continue;
+        if (token->m_TokenKind & tkAnyFunction)
+        {
+            if (token->m_ParentIndex == wxNOT_FOUND)
+                continue;
+            else
+                token = tree->at(token->m_ParentIndex);
+        }
+        if (token && token->HasChildren())
+        {
+            for (TokenIdxSet::const_iterator chIt = token->m_Children.begin();
+                 chIt != token->m_Children.end(); ++chIt)
+            {
+                 const Token* chToken = tree->at(*chIt);
+                 if (   chToken && chToken->m_TokenKind == tkVariable
+                     && varList.Index(chToken->m_Name) == wxNOT_FOUND )
+                {
+                    varList.Add(chToken->m_Name);
+                }
+            }
+        }
+    }
+    ed->GetControl()->SetKeyWords(3, GetStringFromArray(varList, wxT(" "), false));
 }

daniloz · « **Reply #1 on:** January 16, 2013, 08:13:20 am »

@Alpha: your patch didn't apply on my working copy, I had to it "by hand". I'm not sure if I have some differences or you have, haven't got the time to check...

However, I like to new colors, thx!

ollydbg · « **Reply #2 on:** January 16, 2013, 11:03:31 am »

Quote from: Alpha on January 16, 2013, 03:25:21 am

CC already has a fairly decent idea of the what is going on in the source, so I decided to make CC talk to Scintilla. This implementation is fairly crude, but I would appreciate feedback from anyone who would like to see a little more color in their code.

I like such feature, thanks.

MortenMacFly · « **Reply #3 on:** January 16, 2013, 04:25:57 pm »

Quote from: ollydbg on January 16, 2013, 11:03:31 am

I like such feature, thanks.

Yeah, its nice. It also "highlights" where work needs to be done. For example, add a new member variable to the class in a header file, then create an inline method to use this member variable (i.e. a getter-method). This variable will be the only one not highlighted like the others until CC scans this file again...

oBFusCATed · « **Reply #4 on:** January 16, 2013, 11:55:41 pm »

How do I test this?
I've applied it and I see no change.

Alpha · « **Reply #5 on:** January 17, 2013, 12:02:49 am »

This patch currently only hooks into the editor activated event, so you need to switch editors/close reopen editors (after initial parsing has finished) for colors to show up.

oBFusCATed · « **Reply #6 on:** January 17, 2013, 01:26:21 am »

OK, but it doesn't work for C code...

Probably you have to look in this topic: http://forums.codeblocks.org/index.php/topic,16249.0.html

Is it possible to extract this code in a separate plugin or in core?

Alpha · « **Reply #7 on:** January 17, 2013, 02:43:55 am »

Quote from: oBFusCATed on January 17, 2013, 01:26:21 am

OK, but it doesn't work for C code...

... hence the "pseudo"

.
The logic used is fairly simplistic:

List functions in the current file
- Collect the classes they are from
List the classes in the current file
Iterate through the member variables of both lists of classes
Put this list of variables in Scintilla's (previously unused by Code::Blocks) keyword set for "Global classes and typedefs"

This makes a lot of assumptions about coding style, but these assumptions generally yield decent results on C++ code. However, searching for a class in C code will fail for obvious reasons.
In C code, what would your expectations be for choosing the highlighted set?

Quote from: oBFusCATed on January 17, 2013, 01:26:21 am

Probably you have to look in this topic: http://forums.codeblocks.org/index.php/topic,16249.0.html

Yes... the plugin there is a lot more ambitious than what I am attempting.

Quote from: oBFusCATed on January 17, 2013, 01:26:21 am

Is it possible to extract this code in a separate plugin or in core?

Probably not; this makes use of the token tree that CC builds to decide on the set of keywords to highlight.

Quote from: MortenMacFly on January 16, 2013, 04:25:57 pm

For example, add a new member variable to the class in a header file, then create an inline method to use this member variable (i.e. a getter-method). This variable will be the only one not highlighted like the others until CC scans this file again...

Although the code is not necessarily expensive, I would prefer it run the fewest number of times necessary. Do you have a recommended selection of events I should attach it to?

Alpha · « **Reply #8 on:** January 17, 2013, 03:19:28 am »

This should look a little nicer on C code (highlight global vars in C), and also deals with inherited members (for C++).

I added a lock on s_TokenTreeMutex (because that is what the rest of the code seems to do when walking through tokens), however, I do not exactly understand the concept of a mutex very well; is it needed here?

MortenMacFly · « **Reply #9 on:** January 17, 2013, 04:19:37 pm »

Quote from: Alpha on January 17, 2013, 12:02:49 am

This patch currently only hooks into the editor activated event, so you need to switch editors/close reopen editors (after initial parsing has finished) for colors to show up.

There is also the drawback, btw: I noticed really massive slow-downs when opening an editor of a large file with many references to highlight. Do you experience the same?

MortenMacFly · « **Reply #10 on:** January 17, 2013, 04:21:36 pm »

Quote from: Alpha on January 17, 2013, 03:19:28 am

however, I do not exactly understand the concept of a mutex very well; is it needed here?

A mutex is used where the tree could be accessed in parallel (i.e. from another thread), to avoid freezes. Usually accessing the token tree always requires a lock, unless it has been set from the caller function already. I'll have a look but later...

Alpha · « **Reply #11 on:** January 17, 2013, 04:45:07 pm »

Quote from: MortenMacFly on January 17, 2013, 04:19:37 pm

There is also the drawback, btw: I noticed really massive slow-downs when opening an editor of a large file with many references to highlight. Do you experience the same?

I have not tried opening anything extremely large yet... is the slow-down constant, or is it a pause when you switch to the tab? (I could probably increase performance by switching to a hash instead of an array to insure unique entries.)

Quote from: Alpha on January 17, 2013, 12:02:49 am

This patch currently only hooks into the editor activated event, so you need to switch editors/close reopen editors (after initial parsing has finished) for colors to show up.

I forgot to mention, this second patch adds one other event: color all open editors when parsing completes.

MortenMacFly · « **Reply #12 on:** January 17, 2013, 06:52:55 pm »

Quote from: Alpha on January 17, 2013, 04:45:07 pm

I have not tried opening anything extremely large yet... is the slow-down constant, or is it a pause when you switch to the tab?

It seems as soon as I switch... I'll report back once I have played the second one.

Alpha · « **Reply #13 on:** January 18, 2013, 04:36:00 am »

Debug timing code attached. Which algorithm yields better performance (especially on larger files where performance actually matters) on your machine?

Keep in mind that the first run on an editor will be skewed because:

Code

        if (token->m_Ancestors.empty())
            tree->RecalcInheritanceChain(token);

will change performance after the first run (so only pay attention to later runs on an editor).

ollydbg · « **Reply #14 on:** January 31, 2013, 04:06:30 pm »

You add four stopwatches, and four type of tokens(keywords) were colourised. Can you tell me what kind of tokens for what stopwatch?
1,Variable?
2,Function?
3,Class?
4,?
I'm totally confused.

When tested, I see that only scan the current files token is NOT necessary, E.g.

Code

class MyFrame: public wxFrame

You can see that "wxFrame" will not be colourised because its token belong to another source file/header.

Code::Blocks Forums

News:

Author Topic: Pseudo semantic highlighting (Read 36937 times)

Alpha

Pseudo semantic highlighting

daniloz

Re: Pseudo semantic highlighting

ollydbg

Re: Pseudo semantic highlighting

MortenMacFly

Re: Pseudo semantic highlighting

oBFusCATed

Re: Pseudo semantic highlighting

Alpha

Re: Pseudo semantic highlighting

oBFusCATed

Re: Pseudo semantic highlighting

Alpha

Re: Pseudo semantic highlighting

Alpha

Re: Pseudo semantic highlighting

MortenMacFly

Re: Pseudo semantic highlighting

MortenMacFly

Re: Pseudo semantic highlighting

Alpha

Re: Pseudo semantic highlighting

MortenMacFly

Re: Pseudo semantic highlighting

Alpha

Re: Pseudo semantic highlighting

ollydbg

Re: Pseudo semantic highlighting