Skip to content
Snippets Groups Projects
  1. Nov 29, 2017
    • Ben Cumming's avatar
      support ion species concentration updates (#398) · fa7d99b6
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      * Add querying methods to modcc ion species type to simplify code generation.
      * Add `WriteBack` type that records per-mechanism ion usage metadata.
      * Add `write_back` implementation for C and CUDA printers that adds weighted concentration
        contributions to global concentration fields.
      * Extend `uses_ion` interface to provide information about which concentration fields are modified by mechanisms.
      * Update `update_ion_symbols` lambda that is responsible for adding metadata about interactions between mechanism fields and external ion species fields, creating a write-back when the mechanism field is a state variable or an indexed variable if it is a current/reversal potential, or else an error.
      * Add `test_ca.mod` mechanism that writes calcium ion concentration for testing purposes.
      * Add back-end callbacks that initialize concentration values.
      * Update `mechanism::uses_ion()` to return information regarding mechanism ion concentration updates.
      * Add `mechanism::write_back()` method.
      * Update stimulus mechanism specialization to use new mechanism interface.
      * Update `ion` type to calculate default concentration contributions in CVs where the concentration is only partially determined by mechanisms.
      * Update `fvm_multicell` to calculate the default ion concentration weights.
      * Add unit test for ion concentration weight determination.
      
      Fixes #373 
      fa7d99b6
  2. Nov 28, 2017
    • Sam Yates's avatar
      Tidy `modcc` driver, remove optimize flag. (#404) · 998ee724
      Sam Yates authored
      * Remove optimization option (use SIMD options for vectorization).
      * Remove arbor utility library dependencies from modcc (pending separation of utility lib from arbor lib source).
      * Split target (cpu, gpu) specification from vectorization architecture (avx2, avx512).
      * Remove `Options` singleton; replace with structure local to `modcc.cpp`.
      * Tidy `modcc` option parsing and main function; allow a single invocation of `modcc` to generate code for multiple backends.
      * Rename generated sources to include backend target in filename.
      * Always run a constant simplification pass on generated procedures.
      * Remove file i/o code from `Module` and `modcc` main function; move functionality to new functions in `io` namespace. (Note: in on-going mechanism revamp, other i/o utility code will reside in the `io` namespace and subdirectory.)
      * Remove classes `ConstantFolderVisitor` and `ExpressionClassifierVisitor` that are no longer used.
      * Modify CMakeLists.txt files, `backends/*/fvm.cpp` to reflect the new filenames of generated sources.
      * Small formatting changes in `modcc` source to reflect coding guidelines (incomplete).
      998ee724
    • Sam Yates's avatar
      Fix binary operator parentheses bug. (#400) · 67721f02
      Sam Yates authored
      Fixes #399.
      
      Update `test_printers.cpp`:
      * Fix compilation and scope management issues, and re-include in CMakeLists.txt.
      * Update to use `verbose_print` (run-time verbosity).
      * Apply expected output tests to `CPrinter` and `CUDAPrinter` outputs.
      
      Address binary operation rendering issue:
      * Move C-style expression rendering common to `CPrinter` and `CUDAPrinter` to a specialized `Visitor`, `CExprRenderer`.
      * Use operator associativity to determine need for parentheses around sub-expressions of a binary expression.
      67721f02
  3. Nov 20, 2017
    • Ben Cumming's avatar
      Fix GPU event delivery bugs (#394) · 8566655c
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      * Properly iterate over all event queues in the case where a cell group has more than once cell (triggered in e.g. GPU miniapp).
      * Update GPU unit test to use the new `cell_group::advance()` interface.
      8566655c
    • Ben Cumming's avatar
      modcc now enforces derivatives only on state variables (#392) · 87a52776
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      Derivatives should only appear on the left hand side of expressions that describe the time evolution of state variables. Without this check `modcc` segfaulted when processing a derivative expression of a non-state variable.
      
      * Specialize the semantic analysis of `DerivativeExpression` to enforce that derivatives are only applied to state variables.
      87a52776
  4. Nov 16, 2017
    • Ben Cumming's avatar
      Move event wrangling from cell_group to model (#390) · c76c0520
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      * Remove `cell_group::enqueue_events` interface: `cell_group::advance` is given the events to deliver in the next epoch directly.
      * Implement event lane merging in `model` with stand-alone `merge_events` function.
      * Ensure event-delivery order is preserved across differing domain decompositions by performing a lexicographic sort over all of the `postsynaptic_spike_event` fields: fixes #385. 
                                                                                                               
      Fixes #378.
      c76c0520
    • Sam Yates's avatar
      Add custom streambuf for indented output. (#391) · 2fe61d04
      Sam Yates authored
      These classes aim to provide an alternative to `TextBuffer` in `modcc` printers.
      
      * Implement a streambuf wrapper `prefixbuf` that prepends a prefix string to each line.
      * Adds stream manipulators `setprefix`, `indent`, `popindent`, `settab` for manipulating the prefix in a `prefixbuf`.
      * Provide a wrapper stream `pfxstringstream` around `std::stringbuf` with `std::ostringstream` semantics.
      
      As implemented, a prefix string set with the `setprefix` is applied to the underlying streambuf, and is not a property of the stream. This behaviour may be surprising, so consider storing this prefix with the stream as a possible enhancement.
      2fe61d04
  5. Nov 14, 2017
    • Ben Cumming's avatar
      Use Nernst equation to calculate reversal potentials (#387) · 256c5421
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      Replace constant values for ion species reversal potentials taken from HH model with values that are updated on each time step via the Nernst equation.
      
      * Implement Nernst equation in multicore and gpu back-ends.
      * Extend interface of `ion` type to proved a method `update_reversal_potential` which calls the back-end nernst routine.
      * Add `valency` and default concentration values to the ion species.
      * Add interface for resetting ion species state (for restarting simulations).
      
      Fixes #376 
      256c5421
  6. Nov 09, 2017
    • Ben Cumming's avatar
      Convert currents to current densities in FVM (#381) · bca33966
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      Update the FVM formulation to use current densities instead of currents.
      
      Modifications to modcc:
      * Update printers to store and use weights for point process mechanisms,
      * Scale ion species current contributions by area proportion, similarly to contributions to the accumulated current.
      
      Changes to FVM code:
      * Update weights calculation for density and point processes mechanisms:
          * density channels use relative proportion of CV area, i.e. "density",
          * point processes use the reciprocal of the CV area to convert to a density.
      * Add `cv_area` parameter for matrix constructor, which is used by matrix assembly to convert current densities to currents.
      * Update stimulus implementations (gpu and cpu backends) to contribute current densities.
      
      Other changes:
      * Update unit tests to use new interfaces.
      * Update units section in LaTeX docs.
      
      Fixes #374.
      bca33966
  7. Nov 08, 2017
  8. Nov 07, 2017
  9. Nov 03, 2017
    • Ben Cumming's avatar
      Optimise host-side event wrangling [8] (#369) · 0726685c
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      Optimize host-side postsynaptic event queues:
      
      * Have communicator store stage local postsynaptic events per cell instead of per cell group.
      * Pass (by reference) the relevant subrange of vectors of per-cell events to the cell group `enqueue_events` method.
      * Identify calls to cell group `enqueue_events` and `advance` with an integration epoch identification allowing the use of an efficient thread-safe data structure for the cell group event stores.
      * Perform merge of new events and currently stored events not for delivery in current epoch in parallel across cells in `mc_cell_group::enqueue_events`.
      * Store one event binner object per cell in `mc_cell_group` to avoid hash table lookup overheads in the `binning_kind::following` case, and simplifying the `event_binner` class.
      * Add convenience time comparison functional object `event_time_less`.
      
      Fixes #368.
      0726685c
  10. Nov 02, 2017
    • Sam Yates's avatar
      Add mechanism parameter setting/new implementation. (#377) · b7623d13
      Sam Yates authored
      Fixes #350 
      
      * Replace parameter_list with mechanism_spec.
      * Add prototype for mechanism parameter schema checking.
      * Allow mechanism weights to be set after construction.
      * Combine range parameters on density mechanisms by linear contribution in CVs.
      * Cable segment electrical parameters are now member variables.
      * Publish mechanism parameter information through new method `mechanism::field_info`; note this will be replaced/improved in upcoming dynamic mechanism catalog work.
      * Access mechanism parameter scalars and range data via `mechanism::field_view_ptr` and `mechanism::field_value_ptr` methods.
      * Allow mechanism 'global' parameters to be set via a method of specializing mechanisms (and giving them corresponding aliases).
      * Extend recipe interface to allow querying of per-cell-kind global information for use by cell group implementations.
      * Add unit tests for above - note that linear density mechanism parameter test is tightly coup...
      b7623d13
  11. Nov 01, 2017
  12. Oct 02, 2017
    • Ben Cumming's avatar
      Add micro-benchmark for event delivery setup (#359) · 1c86fdb2
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      Add a micro-benchmark that illustrates scaling issues with event-setup on cell groups with many cells.
      
      The benchmark also illustrates some alternative approaches that are significant optimizations over the current approach.
      
      Fixes #357.
      1c86fdb2
  13. Sep 28, 2017
    • Sam Yates's avatar
      Rename NestMC references, names etc. to Arbor. (#363) · d9f38b2a
      Sam Yates authored
      * Use ARB_ and arb_ as variable prefixes in place of NMC_ and nmc_.
      * Replace references to 'NestMC' and 'NEST MC' to refer instead to Arbor.
      * Use 'arbor' as the sim name in generated validation data.
      * Reflow long-line paragraphs in `tests/ubench/README.md`.
      * Change names of CUDA mechanism and CUDA kernel libraries to include arbor name.
      d9f38b2a
    • Sam Yates's avatar
      Change nest::mc namespaces to arb (#362) · d9f99489
      Sam Yates authored
      Change nest::mc namespaces to arb
      d9f99489
  14. Sep 27, 2017
  15. Sep 25, 2017
    • Ben Cumming's avatar
      Finish Seperable CUDA compilation (#356) · ddbece13
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      Separate compilation for all CUDA code.
      
      * Move all CUDA kernels to their own .cu files, together with C++ function wrappers.
      * Compile all CUDA .cu files to a single static library.
      * Merge gpu and multicore backend validation tests.
      * Simply and clean up cruft from CMakeLists.txt files.
      ddbece13
  16. Sep 21, 2017
    • Ben Cumming's avatar
      Seperable compilation of mechanism kernels on GPU (#353) · 3c283219
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      Separable compilation of the CUDA kernels generated by modcc from NMODL files.
      
      CMake scripts:
      * Update the `build_modules()` helper function to cleanly handle calls to modcc that generate multiple output files.
      * Add a new library target `gpu_mechanisms` for the separately compiled CUDA kernels and the implementation of their C wrappers.
      * Reduce verbosity of compilation messages.
      
      * Simplify mechanism C++ namespace use: move everything in nest::mc::mechanisms::gpu::_mechanism-name_ into `nest::mc::gpu`, and similarly for multicore mechanism implementations, ions.
      * Remove template parameters for `value_type` and `size_type` from all of the 
      mechanism implementations, and use `fvm_value_type` and `fvm_size_type` everywhere instead.
      
      modcc changes:
      * Modify `CUDAPrinter` to keep track of 3 text buffers, one each for 
        "implementation", "interface" and "implementation interface":
      * Write the CUDA implementation interface to `X_impl.hpp`, comprising the definition of the mechanism-specific 'X_ParamParck' struct used to pass function arguments to the CUDA kernels.
      * Write the CUDA kernels and C wrappers to `X_impl.cu`.
      * Write the public C++ mechanism interface (with calls to implementation wrappers) to `X.hpp`.
      * Modify modcc driver to support multiple generated output files.
      3c283219
    • Vasileios Karakasis's avatar
      Conditionally compile AVX2 transcendentals unit tests (#355) · 5dc77b6c
      Vasileios Karakasis authored
      Only compile AVX2 intrinsics unit tests if NMC_VECTORIZE_TARGET=AVX2 is defined.
      
      Fixes #351.
      5dc77b6c
  17. Sep 20, 2017
    • Ben Cumming's avatar
      Stand alone CUDA compilation for threshold_watcher in gpu backend (#345) · 180a7ace
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      Refactor the threshold_watcher and stack data structures in the gpu backend so that they are amenable to separable compilation.
      
      * Make `gpu::stack<T>` have a host-only interface that wraps a POD type `gpu::stack_base<T>`.
      * Implement a `push_back(stack_base, value)` method in `backends/gpu/kernels/stack.hpp` that is visible only to device code.
      * Move `test_thresholds` kernel to a .cu file, replacing template parameters with types provided by `backends/fvm_types.hpp`.
      * Add a simple C function interface, callable from host side code, defined in `backends/gpu/threshold_common.hpp`.
      * Simplify the `gpu::impl::padded_size` function (both to read and in terms of efficiency).
      * Use `typeid` as the default for pretty-printing types in the memory back end.
      * Update the `test_gpu_stack` unit test to support new gpu stack interface.
      * Fix bug in the `test_spikes` unit test, which was not running the GPU back end in the cuda unit tests.
      180a7ace
    • Ben Cumming's avatar
      Remove intrinsics unit test that does not compile (#352) · f937973f
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      The unit test for AVX2 intrinsics does not compile with gcc.
      This is a quick fix to get master to compile, while the test is fixed.
      f937973f
    • Vasileios Karakasis's avatar
      AVX2 transcendentals intrinsics (#329) · a2393eea
      Vasileios Karakasis authored
      Provides the following transcendentals intrinsics:
      
      * `nmc_mm256_exp_pd`
      * `nmc_mm256_log_pd`
      * `nmc_mm256_pow_pd`
      * `nmc_mm256_frexp_pd`
      
      The first three are the equivalent of the corresponding SVML intrinsics without the `nmc` prefix.
      
      The last one is used by the `log` function, but I decided it's nice to have it public.
      
      All results are tested against full precision standard library implementation and provide equal results (except for the `pow` case). `NaN`s and infinities are treated according to the standard.
      
      Limitations:
      
      * Subnormals are treated as zeros by `frexp` and as a result by `log`.
      a2393eea
    • Sam Yates's avatar
      Batched sampling. (#347) · a5ce1d3e
      Sam Yates authored
      Collect samples in FVM lowered cells across an integration period, prior to delivery to sampler callbacks.
      
      Fixes #283.
      
      * Make `multi_event_stream` a generic data structure, based on the event type. These are now used for both `deliverable_event` objects and `sample_event` objects.
      * Add generic accessors for event data, separating access to event metadata (cell index, time) from payload (weight, sampling info etc.).
      * Make a flat, CUDA-friendly data structure describing the `multi_event_queue` marked event state, for passing to mechanisms and kernels. This also brings the underlying representations of the `multicore` and `gpu` versions of `multi_event_queue` closer.
      * Implement batched sampling kernel for GPU.
      * Additional utility function: `util::is_sorted_by`.
      
      Overhead of implementation and sampling process is negligible for CPU, and with no samples less than 1% on GPU. Running the miniapp with 10'000 cells and sampling 1% of the somata at 0.01ms adds approximately 10% to simulation time. 
      a5ce1d3e
  18. Sep 11, 2017
    • Ben Cumming's avatar
      add Travis-CI build status badge to readme (#346) · 02c4ad37
      Ben Cumming authored
      Status of the master branch is displayed at the top of the README.
      The status is based on the result of the automated daily build.
      02c4ad37
    • Ben Cumming's avatar
      Basic CI support with TravisCI (#340) · 137c5b5f
      Ben Cumming authored
      Add support for continuous integration with Travis CI.
      This implements bare bones support that can be extended over time.
      
      Travis CI test environments:
      
          All use gcc 5.
          Test the serial distributed back end with serial and cthread threading backends.
          Test mpi with cthread.
          The tbb test failed sporadically because CMake, so it is disabled for now.
      
      The test script:
      
          Builds the unit tests, global_communication tests and miniapp.
          Asserts that all unit and global_communication tests pass.
          Asserts that the miniapp runs successfully.
              does not test miniapp output for now.
      
      There is plenty of scope for improving the tests.
      A key improvement will be to use validated output for the validation and miniapp
      to provide some validation.
      
      There were some small fixes required to make the tests pass on Travis
      
          communication/mpi.hpp now sets default size and rank values of 1 and 0 respectively
          to allow all unit tests to p...
      137c5b5f
  19. Sep 05, 2017
  20. Sep 04, 2017
    • Ben Cumming's avatar
      Move nvcc-only code from memory to backends::gpu (#342) · 40e2f523
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      * Move gpu-kernel code from memory to backends/gpu
      
      A small step towards seperate back end compilation for CUDA.
      Move the following code to the gpu backend:
      - the memory::fill* wrappers and fill kernel
      - the managed_ptr type
        - only used in backends::gpu
        - has `__device__ __host__` members for dual host-device use.
      
      * update unit&validation tests
      40e2f523
  21. Sep 01, 2017
    • Sam Yates's avatar
      Update global comms tests for new sampling API. (#338) · 3bcb8f97
      Sam Yates authored
      * Avoid format securiy warning/error in `mpi_listener.hpp`
      * Update recipe classes in `test_communicator.cpp` and `test_domain_decomposition.cpp`.
      * Align test names in `test_domain_decomposition.cpp` with those in unit tests.
      * Fix `spike_gids` bug in `test_all2all` routine.
      * Replace test assertions with `AssertionResult` returns in `test_ring` and `test_all2all`.
      * Simplify no-extra-events check in `test_ring` and `test_all2all`.
      3bcb8f97
    • Sam Yates's avatar
      New sampling API implementation. (#335) · 8739fd55
      Sam Yates authored
      Towards resolution of issue #283.
      
      * Replace probe/sample infrastructure with new API as outlined in the `sampling_api.rst` documentation.
      * Separate `cell` probe information from `cell` description object.
      * Add `--list` option to `tsplot`, to summarize available time series for plotting together with their metadata.
      * Add `--sample-dt` option to validation tests.
      * Change validation time series comparison behaviour: linf distance metric now estimated by comparing reference data points against linearly interpreted simulation samples, rather than the other way around.
      * Add utility class `any_ptr` which allows lightweight and type-checked access to a type-erased pointer.
      * Replace `singleton_recipe` with two simple recipe base classes used for unit and validation tests, in `tests/simple_recipes.hpp`.
      * Simplify RSS cell implementation.
      * Add statistical functions for testing of probabilistic methods in `tests/unit/stats.hpp` (and .cpp):
         * simple summary stats class;
         * Kolmogorov–Smirnov one-sided statistic and CDF;
         * approximate Poisson CDF.
      * Simplify and refactor miniapp `sample_trace` code.
      * Add new utility class `handle_set` for managing a collection of integer handles. (Near-simplest implementation.)
      * Relax const-ness constraints on `transform_iterator`.
      8739fd55
  22. Aug 24, 2017
    • Ben Cumming's avatar
      Basic Sphinx Documentation (#328) · 610fd857
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      Adds support for building documentation with Sphinx from reStructuredText-formatted files in the `doc` subdirectory. Automatic building has been verified with ReadTheDocs.
      
      * Add basic documentation to the `doc` path.
      * Use a git submodule and associated CMake to pull in ReadTheDocs theme at configuration time.
      610fd857
  23. Aug 23, 2017
  24. Aug 18, 2017
    • Ben Cumming's avatar
      Better TBB CMake integration (#331) · 6dce9fa4
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      * Add support for CMake scripts provided by TBB.
      * Update required cmake version to 3.0.
      
      * hack to get linking to work on Cray PE
      
      * improve comments and remove redundant include in CMakeLists
      
      * firewall the tbb cmake files
      
      * tbb threading back end to_string includes version number
      6dce9fa4
    • Ben Cumming's avatar
      Refactor domain decomposition for arbitrary gid distribution. (#326) · cba9d458
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      Changes to `domain_decomposition`:
        * `domain_decomposition` performs two pass load balancing in constructor:
            1. first pass performs global load balance,
            2. second pass distributes cells locally between cpu and gpu cell_groups.
          The current logic for this is very simple and naive, and will be replaced with a load balancer which returns a lighter domain decomposition description in a follow up pull request.
        * Provides a simple `group_description` type that contains gid, `cell_kind` and target backend information for `cell_group_factory`.
      
      Changes to `communicator`:
        * Constructor takes a `domain_decomposition` and recipe.
        * The interface for adding connections and constructing connection table has been removed, as this is now performed within the constructor.
        * Construction is more complicated, as connections are partitioned by source gid which requires multiple passes over the connection information in the recipe.
        * `make_event_queues` updated: spikes and connections are now partitioned by source domain, and an optimization dynamically chooses to iterate over either connection or spike list, whichever is shorter.
        * The `exchange` method now sorts `local_spikes` before global gather to facilitate the optimized spike/connection searching.
      
      Changes to `miniapp`:
        * Automatically use gpu if available and compiled with gpu support.
        * Banner prints out useful information about number of cores, gpus and ranks.
        * Remove -g cell group size flag.
      
      Changes to `cell_group`:
        * `cell_group` interface take a list of gid values instead of a range.
        * Updated internal `cell_group` logic to convert between gid and local indices: use a vector for local index to gid map, and a hash table for gid to local index in `cell_group` implementations that need this lookup.
      
      changes to unit tests
        * tests for the domain decomposition
        * tests for the communicator that test ring and all2all networks
      cba9d458
  25. Jul 29, 2017
  26. Jul 18, 2017
    • Ben Cumming's avatar
      Feature/node description (#325) · ecb1b049
      Ben Cumming authored and Sam Yates's avatar Sam Yates committed
      Refactor hardware-querying code.
      
      Comprises part of the work on issue #318.
      
      * Create `src/hardware` path for hardware-querying code.
      * Create `nest::mc::hw` namespace for hardware-querying code.
      * Move memory, affinity and power code from util and threading code to the new diretory and namespace.
      * Add `nest::mc::threading::num_threads` function that attempts to determine the number of threads to use according to an environment variable, or by using the low level hardware querying if no environment variable is set.
      * Add hardware query for counting the number of available GPUs.
      ecb1b049
  27. Jul 07, 2017
    • Wouter Klijn's avatar
      306 spike from file (#314) · 90b7966a
      Wouter Klijn authored
      Created a new cell kind that injects a user-suplied stream of spikes, along with corresponding cell_group implementation called dss_cell_group.
      
      The miniapp is updated to test this functionality
        * An ostream spike time parser
        * A new command line argument -I with a Path.
      90b7966a
  28. Jul 05, 2017