Print Page - Rewrote the DoParse function

Developer forums (C::B DEVELOPMENT STRICTLY!) => Development => CodeCompletion redesign => Topic started by: Loaden on April 19, 2010, 03:55:55 pm

Title: Rewrote the DoParse function
Post by: Loaden on April 19, 2010, 03:55:55 pm

I rewrote the DoParse function, replace "else if" to "switch, case", found that performance has improved.
And now, the DoParse function like this:

Code

void ParserThread::DoParse()
{
    ...
        bool switchHandled = true;
        switch (token.Length())
        {
            case 1:
            if (token == ParserConsts::semicolon)
            {
                ...
            }
            else if (token == ParserConsts::dot
                     || (token == ParserConsts::gt && m_LastToken == ParserConsts::dash))
            {
                ...
            }
            else if (token == ParserConsts::opbrace)
            {
                ...
            }
            else if (token == ParserConsts::clbrace)
            {
                ...
            }
            else if (token == ParserConsts::colon)
            {
               ...
            }
            else if (token == ParserConsts::hash)
            {
                ...
            }
            else
                switchHandled = false;
            break;

            case 2:
            if (token == ParserConsts::kw_if || token == ParserConsts::kw_do)
            {
                ...
            }
            else
                switchHandled = false;
            break;

            case 3:
            if (token == ParserConsts::kw_for)
            {
                ...
            }
            else
                switchHandled = false;
            break;
......

            case 9:
            if (token == ParserConsts::kw_namespace)
            {
                m_Str.Clear();
                HandleNamespace();
            }
            else
                switchHandled = false;
            break;

            default:
                switchHandled = false;
            break;

[attachment deleted by admin]

Title: Re: Rewrote the DoParse function
Post by: MortenMacFly on April 19, 2010, 06:28:06 pm

Hi Loaden,
I fully respect all the work you are doing. Hence please keep in mind that we need to track all your ideas for a proper later integration. Unluckily it comes to a moment where we are in feature freeze. I am hardly trying to keep everything you are proposing but it's getting harder and harder with every patch of yours. So my request would be, can you please track everything in a way that we can have all patches in one place? Probably you can post all of them in an own thread of yours, just a short information:
- what do they do
- what do they modify
- what's the base SVN revision to apply them to
- (what other patches are required for the to apply)
- what are conflicts with other patches
- link to another place in the forums for discussions.
Inform that this thread is a collection of patches and that answers are not welcome (I can even remove discussion).
I realised that you already conflict with a patch of OllyDbg. It will be hard to resolve this in the end and I don't want to take such a risk later on.

Title: Re: Rewrote the DoParse function
Post by: JGM on April 19, 2010, 06:52:42 pm

I think the problem here is that english is hard for him, he needs a mediator, some one that speaks on his native language, discuss with him his work and organize the ideas here, just a suggestion :)

Would be a shame loosing track of his work because of lack in organization (communication problems?)

Title: Re: Rewrote the DoParse function
Post by: Loaden on April 19, 2010, 07:07:15 pm

Sorry, I am not a professional programmer, I'm just a physics teacher.
I can not understand some replies, but one thing. That is i fully agree: After CB official release, discuss whether to apply my patch.
I have not studied English for 10 years. :(

Title: Re: Rewrote the DoParse function
Post by: MortenMacFly on April 19, 2010, 09:08:14 pm

Quote from: Loaden on April 19, 2010, 07:07:15 pm

Sorry, I am not a professional programmer, I'm just a physics teacher.

Interesting combination: Physics teacher advancing CC. ;-)

So to make things simple: Please start an own thread and post there your patch and a short explanation (including base SVN revision) and nothing else. Next time you have a patch, do the same in this very thread (basically "reply"). I will make this topic sticky and move every "replies" to another thread. Thus we have a clear history of your work in one place.

Probably it's already too late, but I would love to see this from the beginning of your work. Probably another CC dev can help?!

Title: Re: Rewrote the DoParse function
Post by: ollydbg on April 20, 2010, 01:48:27 am

Quote from: JGM on April 19, 2010, 06:52:42 pm

I think the problem here is that english is hard for him, he needs a mediator, some one that speaks on his native language, discuss with him his work and organize the ideas here, just a suggestion :)

Would be a shame loosing track of his work because of lack in organization (communication problems?)

I would like to help with the translation between Chinese and English.

Title: Re: Rewrote the DoParse function
Post by: ollydbg on April 20, 2010, 03:24:31 am

About loaden's the patch:

I found there is some code in your patch:

Code

switch (token.Length())

Why do a switch statement depend on the string length??

For me, I would do something like this ( You can find this method in any books teaching about compiler design ):

The low level lexer( or Tokenizer ), the "Token" returned from the Tokenizer should have two fields:
1, token type: identifier, open brace, close brace.... and these types were categories by different enum value.
2, token value: for a "identifier" type token, the value is a wxString, just the "identifier" 's name, for open brace type token, they don't have a "token value" field.

In the parserthtread code, we can do a switch by the "token type".

Title: Re: Rewrote the DoParse function
Post by: Loaden on April 20, 2010, 06:58:01 am

Quote from: MortenMacFly on April 19, 2010, 06:28:06 pm

Hi Loaden,
I fully respect all the work you are doing. Hence please keep in mind that we need to track all your ideas for a proper later integration. Unluckily it comes to a moment where we are in feature freeze. I am hardly trying to keep everything you are proposing but it's getting harder and harder with every patch of yours. So my request would be, can you please track everything in a way that we can have all patches in one place? Probably you can post all of them in an own thread of yours, just a short information:
- what do they do
- what do they modify
- what's the base SVN revision to apply them to
- (what other patches are required for the to apply)
- what are conflicts with other patches
- link to another place in the forums for discussions.
Inform that this thread is a collection of patches and that answers are not welcome (I can even remove discussion).
I realised that you already conflict with a patch of OllyDbg. It will be hard to resolve this in the end and I don't want to take such a risk later on.

OK, I will do it in the next two days.

Title: Re: Rewrote the DoParse function
Post by: Loaden on April 20, 2010, 07:09:14 am

Quote from: ollydbg on April 20, 2010, 01:48:27 am

Quote from: JGM on April 19, 2010, 06:52:42 pm
I think the problem here is that english is hard for him, he needs a mediator, some one that speaks on his native language, discuss with him his work and organize the ideas here, just a suggestion :)

Would be a shame loosing track of his work because of lack in organization (communication problems?)

I would like to help with the translation between Chinese and English.

Thanks lot!

Title: Re: Rewrote the DoParse function
Post by: MortenMacFly on April 20, 2010, 11:55:28 am

Quote from: Loaden on April 19, 2010, 03:55:55 pm

And now, the DoParse function like this:

Ok, I had a look. Besides a clearer look, what I saw in the first place is a GoTo. This is unfortunately a "no-go" for us. Please don't use GoTos in C++. This is very bad style and there is no reason for doing this.

Title: Re: Rewrote the DoParse function
Post by: Loaden on April 20, 2010, 03:57:25 pm

This patch only as a try, here you can change to "return".

Title: Re: Rewrote the DoParse function
Post by: ollydbg on April 27, 2010, 07:21:48 am

Quote from: MortenMacFly on April 19, 2010, 06:28:06 pm

I realised that you already conflict with a patch of OllyDbg. It will be hard to resolve this in the end and I don't want to take such a risk later on.

@MortenMacFly

Yes, Loaden's patch about "handling conditional preprocessor" is totally different with "my patch of handling Preprocessor". We use different methods, let me give a explanation:

My patch released in the "ptest" project, see here: ParserTester for codecompletion plugin (http://forums.codeblocks.org/index.php/topic,12066.0.html) do the main work in the Tokenizer class. That is to say, when a Tokenizer return a wxString, it will never return a wxString stream like "#" "ifdef", because all the handling of conditional preprocesser directives was done in the Tokenizer::DoGetToken().

The aim I implement this was that I want to solve some problems caused by these code:

Code

void function ( int i, #ifdef AAA
                     int j)
#else
)
#endif
{
XXXX
};

You see, when a #if is appeared in the function arguments parentheses, the parser may fail (because the current SVN code just match a closing parenthesis after it see a opening parenthesis) . But I'd say these kind of code happens in a rare situation. :D

Refer to Loaden's patch, he just do the "handling conditioanl preprocessor" in the Parserthread class. So, he didn't change Tokenizer class so much. The Most advantage of Loaden's patch is that he implement a true "expression solver". For example:

Code

//#define AAA

#ifdef AAA
#include <xxx.h>
#else
#include <yyy.h>
#endif

The current SVN code (also the patch supplied by me) only parse the code like below, all the #else branches were skipped whether AAA is defined or not.

Code

//#define AAA
#include <xxx.h>

But with Loaden's patch, the expression value after #if was dynamically evaluated when parsing, and only a true conditioanl preprocessor branch will be parsed.
So, As a conclusion: Loaden's Patch is much better than mine, you should pick his patch.

Code::Blocks Forums

Developer forums (C::B DEVELOPMENT STRICTLY!) => Development => CodeCompletion redesign => Topic started by: Loaden on April 19, 2010, 03:55:55 pm