Requirements break-out session ----------------------------------------------- NCH anounced goals for the session: - List the requirements of each project - Identify common requirements - Prioritise requirements We identified categories then went round the projects to identify project requirements: Data Access (access to data sources - SQL queries, DB management, translation) Data Integration Application Support (client) Performance/Reliability Security Usability/Support Metadata/Registration PD has a list of all participants in this discussion For each category, we brainstormed some sub-points and each participant voted for those important to his/her project. Key --- Item: No of votes [*** identifies major requirement] Data access ----------- Flat file access (structured/unstructured/arbitrary structure): 9 *** Moving process closer to data: 5 Exploiting structure in file: 1 Unhandled characters (&, >, <, etc): 1 More data sources: 1 Customisable/incremental streaming of data: 2 (dependency for Distributed Query) More data formats: 3 Large result sets (including large blobs): 6 *** Temporary table creation: 1 DB indexes: 1 Translation on input: 1 Distributed write: 1 Xpath/Xquery queries over xml files and streams (not inside an xml db): 1 Distributed transactions: 1 Data Integration ---------------- Schema integration: 7 Multi-model query (queries over relational, structured and semi-structured data at the same time): 7 Distributed query (joins across different data sources): 10 *** Mixed language queries (Xpath & SQL): 1 Data format transformation: 5 Application Support (client) ---------------------------- Client API: 7 *** Activities: 3 Authorization/Authentications/A...: 4 Instrumentation: 1 Diagnostic tools (performance analysis): 6 *** Packaging/Deployment: 5 Logging/Auditing: 2 Notification: 2 Polling properties: 1 Configuration wizard: 1 Performance/Reliability ----------------------- Size of results: 3 CPU: Memory: 4 Overheads (WS..): 1 Network bandwidth: 2 Latency: Number of users: Number of queries: Must not fall over: 5 *** Recovery/checkpoints: 2 Re-register: 3 Security -------- Authorization: 8 *** Authentication: 8 *** Accounting: 3 Privacy: 7 *** Role based access control: 8 *** Roles for file access: 9 *** Usability/Support ----------------- Installation: 6 Configuration: 6 Plug-in packages/integration with other tools: 6 Reference site: 7 *** Installation self test: 8 *** Support: Everyone *** Metadata/Registration --------------------- More information in the registry - schema: 4 - data: 3 - dms functions: 6 Tools to extract this data: 2 Matching algorithms: 1 Physical metadata: 6 (dependency for Distributed Query) Logical metadata: 6 (dependency for Distributed Query) Self-recovering registry: 7 *** Conclusion ---------- Reliability File access Big result sets