Re: [bug #27823] SQLClient drops connections without sending notificatio

bug-gnustep

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [bug #27823] SQLClient drops connections without sending notificatio

From:	Richard Frith-Macdonald
Subject:	Re: [bug #27823] SQLClient drops connections without sending notifications
Date:	Wed, 28 Oct 2009 06:41:30 +0000


On 27 Oct 2009, at 19:42, Robert J. Slover wrote:

Richard,
At work, one part of our application is used to 'scan' data gatheredfrom various devices, normalizes it, and inserts it into thedatabase as measurements. This is very performance-critical, andfor a very large percentage of the cases, we are only ever insertingrows (new measurements). However, on occasion we need to'reprocess' the data that was originally gathered (for instance, ifthe model used to normalize the data has been modified to correct anerror). In this case, the same measurement rows will be inserted,albeit they may contain slightly different information. Thiscomponent of the application has no knowledge of whether it isprocessing data for the first time or reprocessing it (and no needto know, either). It first attempts to insert a row and if thatfails due to a duplicate key constraint violation, it will do anupdate instead. The overhead of querying the database first to seeif the record is already there would in most cases be completelywasted, and in the rarer case would not save anything over simplyattempting the insert in the first place. This of course usesstraight C and ODBC, but the principle is the same…if the ODBCdrivers forced a disconnect on every constraint violation, we wouldhave significantly worse performance, and would have to opt for thegenerally slower approach of querying first, since we can onlycommit a group of measurements for an interval on success of theentire scan (it either all goes in or none of it does).

As I said I don't mind accepting a patch to allow things to notdisconnect on error, but your example really just re-enforces myassertion that it should not be an issue.

You say that your code does not 'need to know' whether it's adding newrecords or replacing existing ones, yet it certainly does since itmust handle the errors which will occur if it tries to insert aduplicate value. This means that your code is more complex than itwould be if it really didn't need to know.

Your code will be performing inconsistently ... sometimes (usually) itwill be fast, but other times is will be slow because of the errorhandling. When it's slow, it is presumably still 'good enough' foryour current system, but is unlikely to scale well if you start havingto deal with bigger datasets.

If, instead it was structured as a transaction which first deletes anyexisting records and then inserts the new ones, it might be veryslightly slower in the common case, but more consistent and simpler.It would never be anything like as slow as the case where you try aninsert, catch the error, and then update ... and if that performanceis acceptable then the performance of the simpler, more consistent wayof doing things must be acceptable too. In fact the delete and insertmodel is very efficient ... when no deletion is actually needed, thedelete has the effect of reading index information into memory so it'savailable for the insertion and was not wasted effort. When adeletion is needed, the database server is able to optimise it ...postgres implements an update as a delete and insert anyway, so theperformance in this case is about the same as in the case where nodeletion is needed, which is also about the same as when you just doan update!

The SQLClient library was developed specifically for performancecritical database coding (specifically pushing huge numbers ofmessages to mobile phones) ...My idea of performance critical code is software which runsconsistently fast, and error generation/handling is something you takegreat effort to avoid as it is fundamentally opposed to consistency(when an error occurs performance changes) and speed (error handlingis slow because of the additional client-server messages andtransaction overheads). In fact, avoiding error generation would comeabout number three on the list of essentials for high performancedatabase programming (after use of indexes and batching of inserts/updates).

The only times I use the design pattern of attempting an operation,catching errors, and handling the errors separately are:1. rarely, when performance is truly not an issue (in which case lossof connection is irrelevant)2. inside a stored procedure ... so the error handling is all donewithin the database server and is therefore much faster as it's all ina single transactionThe second is not really what we were talking about though ... server-side error handling is a legitimate tool and means that the clientside doesn't receive an error.

So as I see it, the only case where this matters is where existingcode catching errors happens to be fast enough with the errorcatching, but not fast enough if reconnects are required ... a fairlyrare situation, in which I'd see the ability to change the disconnectbehavior as a stop-gap to allow you to keep a system running whilerewriting and testing the critical section to handle heavier loads.

[Prev in Thread]

Current Thread

[Next in Thread]

[bug #27823] SQLClient drops connections without sending notifications, Kai Henningsen, 2009/10/27
- [bug #27823] SQLClient drops connections without sending notifications, Richard Frith-Macdonald, 2009/10/27
  - Re: [bug #27823] SQLClient drops connections without sending notifications, Robert J. Slover, 2009/10/27
    - Re: [bug #27823] SQLClient drops connections without sending notifications, Richard Frith-Macdonald <=
    - Re: [bug #27823] SQLClient drops connections without sending notifications, Nicola Pero, 2009/10/28
    - Re: [bug #27823] SQLClient drops connections without sending notifications, Richard Frith-Macdonald, 2009/10/28

Prev by Date: Re: [bug #27823] SQLClient drops connections without sending notifications
Next by Date: Re: [bug #27823] SQLClient drops connections without sending notifications
Previous by thread: Re: [bug #27823] SQLClient drops connections without sending notifications
Next by thread: Re: [bug #27823] SQLClient drops connections without sending notifications
Index(es):
- Date
- Thread