Troubleshooting when working with nodes
Task list displays progress incorrectly
-
Sometimes no progress is displayed for a task because PolyAnalyst cannot estimate how long it will take to complete the task. In this case the task is displayed just to show that the task is running. For certain types of these tasks, the progress may be displayed as 0% all the way up to the point in time that the task completes, at which point the task list may briefly display the progress as 100% and then the task is removed.
-
Sometimes progress appears frozen at 0 or 99% for an extended period of time because PolyAnalyst cannot accurately measure the progress and is instead only providing an estimate, which is, in the case that the bar appears not to move, an inaccurate estimate. Tasks routinely involve many sub-operations. The progress for some of these sub-operations may be perfectly measurable but not for others. Progress for such composite tasks may progress smoothly from 0 to 80% within the space of a few minutes, and then spend the next hour to grow from 81% to 100%. This is considered normal and expected behavior. Similarly, a task may discover halfway through that it has twice as much work left to do. PolyAnalyst then may recalculate its estimate of the progress. However, instead of reverting from a metric such as 50% complete to a metric of 25% complete, PolyAnalyst instead chooses to spend a different amount of time completing the remaining 50% than it did the first 50%.
-
Sometimes PolyAnalyst is busy performing several tasks at once and at this moment has prioritized working on something else (e.g. some other user’s task(s) in another project). Similarly, the computer running PolyAnalyst Server may be running other non-PolyAnalyst programs simultaneously and has chosen to de-prioritize PolyAnalyst’s processing. As a result, progress will appear to slow to a halt.
-
Sometimes a task never appears in the task list because, most likely, the task completed very quickly and was removed as quickly as it was added. This is routinely the case for very simplistic processes when working with small amounts of data.
-
Sometimes the progress for a task reaches 100% and then restarts at 0%. This is because PolyAnalyst sometimes measures the progress of sub-operations individually (and resets the progress to 0% at the start of each sub-operation in succession).
-
Sometimes PolyAnalyst will display several separate tasks for a single operation because PolyAnalyst considers these to be separate operations. In other cases, a second or later operation is waiting for some other operation to first complete before it begins.
Why do some tasks take significantly longer to complete than other tasks?
The length of time it takes to complete certain operations can take hours, or even days, whereas some other operations will complete in seconds or minutes. PolyAnalyst is an advanced statistical data analysis and data processing system. It can store hundreds of millions of records and generate highly intricate mathematical models. Certain processes require an immense amount of computing power and time. While PolyAnalyst was designed to carry out operations as quickly as possible, and many of the operations will complete within minutes, you should not develop the expectation that all tasks complete in seconds or minutes. The fact that an operation is taking hours to complete does not suggest that something went wrong. This is simply the experience of running complicated data processing and analytical operations. We at Megaputer are always working on making things faster and more efficient in order to minimize this effect.
PolyAnalyst does not stop tasks quickly
Sometimes PolyAnalyst does not halt its operations immediately after clicking the Stop tasks button. Sometimes it takes several seconds (or even minutes) as PolyAnalyst must perform several cleanup operations. During this time the task list may show no change and PolyAnalyst may appear to continue working as if you had not chosen to stop all tasks. Please be patient.
As an example of why this might take so long, consider the situation where a node is importing several hundred million records, has made some progress, and now has to delete all of that data it imported up to the point of being halted. In addition, in some cases PolyAnalyst may not stop a task even after clicking the Stop tasks button. For example, the task of saving a project cannot be stopped.
PolyAnalyst continues processing after exiting
Exiting the client while tasks are running does NOT stop tasks from running. Tasks are run by the server and not by the client. When you initiate a task using the client, you are merely sending a message to the server to start a task. The client is just displaying what tasks the server is running. If you close the client while tasks are running and then later reopen the client and reopen the project you were working with, any tasks that are still running in the server will automatically reappear in the task list.
Error when executing a node
In addition to checking whether the configuration of a node is valid at the time that you click the OK or Execute button in the properties window of the node, or at the time you select the Execute option from a node’s right-click menu, sometimes an incorrect configuration is not determined until the node actually starts to execute. Many nodes perform additional checks to determine whether they can successfully execute around the time they start executing. Therefore, it’s possible for one of these real time checks to fail. Sometimes a node may stop executing and display a large red X over its icon on the flowchart (and possibly in the task list). Right-click on the node and select Display error description. View the error description and determine how to correct the configuration so that the node can eventually execute.
Viewing errors in the node log
If there is a sign of an error on a node icon, you can open the node log and get information about the type of an error.
A conversion error can occur if PolyAnalyst guesses incorrectly the type of a column while importing the data, for example, in the CSV source node. For example, if a dataset contains 397 integer and 1 numerical value, while importing the node will automatically determine that the column contains data of the integer type (as integer values prevail) with one exception. To correct this error, it’s needed to manually change the type of data to the correct one on the Column specifications tab. To do this, click the Refresh button to display all the columns on the Column specifications tab and change the data type in the User column. To change the data type in a column, double-click on the required cell and open a drop-down menu with all the available data types. Then select the required data type from the list.
Node view window disappears
A node’s view, if open, is automatically closed when you change the properties of a node or reexecute a node. This can also happen if you modify or execute any node preceding the node in a sequence, because that subsequently causes the node you are viewing to update or execute.
Similarly, if another user modifies the node or executes the node or any earlier node while you have the project open and have concurrent project editing enabled, then the window automatically disappears.
Cannot view node when tasks are executing
When you decide to view the contents of a node, PolyAnalyst must load that information into your computer’s memory. Most of the time this is an extremely fast process, which leads many users to always expect this to happen quickly, but it is not always the case. When dealing with larger amounts of information, or with a less powerful computer, or with a poor connection to a server that is processing several other things simultaneously and not located on the same machine as the one you are using, you may notice a delay in opening the view. You can view the Sent and Received information in the status bar to view that the program is in fact performing something, even though nothing appears to be happening. If after waiting for a full minute, you still do not see any new window, it may be that you did not click the view, or you may have encountered an error and should contact Megaputer Support.
Should a chart view appear frozen while opening the view, this is typically due to the processing which must occur to render the chart on the screen. When graphing a very large number of classes (more than a few hundred), PolyAnalyst must perform a large amount of calculations and this can take a considerable time. This happens in competition with the process which displays information, which causes PolyAnalyst to appear as frozen. Users need to be patient in this case and wait for PolyAnalyst to complete processing, or consider not trying to graph an attribute with such a large number of distinct values/intervals. It is common practice to avoid attempting to graph such a large number, regardless of the processing constraint, as the output of the graph is difficult to interpret as it is hard to discern among slices and judge the sizes of individual slices.
Node view option is disabled
This is not actually an error, but is instead a common source of confusion. The reason a node cannot be viewed is generally because the node is not in the correct state. Revisit the earlier introduction to node states. Nodes that have not yet completed processing do not produce viewable results, so PolyAnalyst logically prevents you from trying to view such non-existent results. There are several reasons why a node might not be in the completed state. For example, the node may not be configured, or may not have been run. It is always necessary to first configure and then execute a node prior to being able to view it. Even then, if you execute the node, view its results, and then decide to reconfigure the node, its results are deleted at the time you reconfigure the node, so its results will no longer be viewable.
Column lists in node properties are slow to load
In rare situations involving a large number of columns, or a slower computer, the list of columns may take several seconds to update after typing a new character into the Filter area.
On the average computer, this effect generally only appears when working with a dataset containing more than 500 columns. If you are experiencing a slow column load for a smaller dataset of less than 50 columns, it is a sign that something else, other than the shear number of columns, is slowing down the process.
You can reduce the chances of this happening by filtering out unused columns using a Filter column.