Revamping Code Completion with little changes

Developer forums (C::B DEVELOPMENT STRICTLY!) > Development

<< < (8/10) > >>

mandrav:
Update:

--- Code: ---#ifndef CBTHREADPOOL_H
#define CBTHREADPOOL_H

#include <queue>
#include <vector>
#include <wx/event.h>
#include <wx/thread.h>

/// Abstract base class for workers assigned to the thread pool object
/// Derive a class from this and perform your thing in Execute()
/// It's like wxThread...
class cbThreadPoolTask
{
public:
virtual ~cbThreadPoolTask(){}
virtual int Execute() = 0;
};

// forward decl
class PrivateThread;

/// The main thread pool object.
/// If you set concurrent threads to -1, it will use the number of CPUs present ;)
class cbThreadPool : public wxEvtHandler
{
public:
cbThreadPool(wxEvtHandler* owner, int id = -1, int concurrentThreads = -1);
virtual ~cbThreadPool();

virtual void SetConcurrentThreads(int concurrentThreads);
virtual bool AddTask(cbThreadPoolTask* worker);
protected:
virtual void AllocThreads();
virtual void FreeThreads();
virtual void ScheduleTasks();
virtual void ScheduleNextTask(PrivateThread* thr);
void OnThreadTaskDone(wxCommandEvent& event);

wxEvtHandler* m_pOwner;
int m_ID;
std::queue<cbThreadPoolTask*> m_Pool;
std::vector<PrivateThread*> m_Workers; // worker threads
int m_ConcurrentThreads;
int m_MaxThreads;

DECLARE_EVENT_TABLE();
};

#endif // CBTHREADPOOL_H

--- End code ---

It's already working in a test app :)

Yiannis.

thomas:
The reason nobody reads the article is maybe that the link is broken (there is a space in front of Brett).

But apart from that article (which is a nice analysis of heavy-duty threading by the way), there may be good reasons why one would really want to have two or more pools. I am not saying one must, but one should at least consider.
Two pools can for example run at different priorities, so one does the "realtime" jobs, and the other fills the CPU when it is idle with whatever has to be done. It does not really matter if parsing files for code completioin takes a second or two longer, the user will never notice. Neither does it matter if something else takes half a second longer.
What the user will notice, however, is if the application does not feel interactive. If you click onto something and it takes half a second before it actually starts running because it is added to the tail of the job queue, then this is something very noticeable. With two pools, this is not a problem at all.

Regarding many threads, it is true that context switches may degrade performance really badly. Running 20 heavy number-crunching threads on a single CPU will certainly get you nowhere.
However, it is is simply not true that sleeping (or blocking) threads take away significant amounts of CPU, not even on Windows. We are talking about 4 or 5 threads alltogether. My PC has 382 threads running right now, and except while typing, all CPU time (3%) goes into task manager. If there are 5 more threads, there will be no difference at all. However, creating new threads is really really expensive compared to that (on Windows, at least).

Even with context switches in mind, having one or two "extra" threads may be an advantage because then one thread can fully utilize the CPU while the other is blocking (waiting until a DMA operation finishes, for example). Since many tasks in an IDE involve disk access, this is likely to happen, too.

Alltogether, it is not an easy decision, but I think that since threads are really cheap, one can be gratious with them in this case. Rather have 2 pools than one, and rather have rather 5 threads each than one or two. Since we are not going to factorize a 300-digit number or something, it should really not matter regarding context switches - most of the time our threads will sleep anyway.

grv575:

--- Quote from: zieQ on August 23, 2005, 11:31:09 am ---The term optimality I used previously refer to the article that nobody seems to have read, not a guess of what would be the better way to implement it and a guess on the number of worker threads to use. I'll stop making some comments if nobody read what I say :?

--- End quote ---

I did read the article. It's a good reference but very academic. A much better intro to multiprocessing and thread re performance considerations is http://www.open-mag.com/features/Vol_116/NUMA/NUMA.htm
It gives rules of thumb for this type of stuff (allocate as many intensive, conncurrent threads as you have cpus [or cpu cores]). Which is the accepted practice. Analyzing the pdf you linked: the parameter they synthesized and benchmarked at values of 10, 50, 100 give the amount of idle time the thread had..allowing other threads to be scheduled before the time slice is up. 100 is unrealistic, these threads are allocated and slept right away, which explains the linear scaling. At 50%, these are semi-intensive. And at 10% (90% of the timeslice is used), we only get improvements by using 2 threads in the pool vs. 1. This is because the test system had 1 cpu (don't remember if hyperthreading was enabled). Apparently scheduling was still more efficient by using 2 threads vs. 1 but I suspect it was a hyperthreading cpu (could always check the article ;) ).

Now the issue is that most of the threads in the pool will have something like 3-5% timeslice utilization, except for the compile and link threads which are cpu-intensive. Therefore, setting each pool to use the # of threads = to the # cpus/cores (accounting for hyperthreading of course) would work very well. Only the compilation pool would be extremely busy. And this would also keep the total thread count low as well. After all how many total cpus do you have (dual core would mean ~6 threads accross 3 pools for example)?

--- Quote ---As I said,
- no need for many pools, it will degrade performance anyway: if one of the pool is already consuming all the processing power, the other pool will have some difficulties to run, and otherwise there will be many thread idle which consume (a little but...) processing power

--- End quote ---

W/o many pools you run into the task queuing, and UI responsiveness issues that thomas summaried adequately.

--- Quote ---- the number of thread should depend on the workload and workload type, not a fixed value which provide good performance for somebody but bad for others

--- End quote ---

Does not seem to be the case. A configuration option where the user can choose how many threads they would like to use is a good idea. And the default should definately depend on the # of cpus on the system. Because this is optimal.

rickg22:
As long as threads are running in the background, there's no consequence that the user could feel (i.e. "my app froze while searching!" ).

The reason parsing kinda freezes codeblocks for a second when you save a file, at least when no wxYield is inserted in the middle of the app, is because it depends on events (which are handled in the MAIN thread) to do the further processing. But if we implement the pooling, there won't be any problem with that.

Anyway, Ifigured out how to avoid the queuing overhead.

You have TWO queues, and a "scheduler thread", dedicated 100% to queuing:

Passive task queue (shared by main thread and scheduling thread) [ pending tasks]
Active task queue (shared by scheduling and worker threads) [about-to-run tasks]

The passive and active task queues use DIFFERENT MUTEXES so they can be handled asynchronously.

The "scheduler thread" would remove the task from the passive queue, and keep a copy in memory for itself (if the queues use pointers rather than objects, much better ;-) ). Then it releases the "passive mutex" so it can start scheduling. This may take more time as the timeslice is very crowded, but since the passive mutex has been released, the GUI won't have to deal with it. All it knows is that the job's being put in the queue now.

rickg22:
You did an *excellent* work with the Code completion!

I have a question tho...

Why keep the name "ParserThread" if it's not a Thread anymore? It's a bit confusing... but anyway, my kudos to you, in 48 hours you've done wonders with Codeblocks! 8)

Congratulations!

Navigation

[0] Message Index

[#] Next page

[*] Previous page

Go to full version